Senior Database Reliability Engineer
Listed on 2025-12-17
-
IT/Tech
Data Engineer, Cloud Computing, Systems Engineer
Innovate with purpose
At BILL, we believe in empowering the businesses that drive our economy. By replacing outdated financial processes with innovative tools, we help businesses-from startups to established brands-make smarter decisions and gain control of their operations. And we don’t stop there: we’re creating the future of financial automation so businesses can spend more time on what matters.
Working here means you become part of a vision-driven team that’s ready to tackle challenges and build cutting-edge solutions. We value purpose, drive, and curiosity—and we thrive in a fast-paced, ever-changing environment. Whether in one of our offices in San Jose, CA, Draper, UT, or working remotely, BILLders collaborate to deliver real impact for businesses that need more time in their busy weeks.
BILL builds high performing teams and we seek to hire the best talent for every role. We’re committed to building a workplace that fosters inclusion and diverse perspectives, valuing each person’s unique skills and experiences. We’d love to hear from you—you might be just what we’re looking for, whether in this role or another.
Let’s give businesses more time for what matters.
We are seeking a highly skilled Senior Database Reliability Engineer to join our Core Infrastructure Team. This team is central to managing all of BILL’s infrastructure and databases. We champion Infrastructure as Code using Terraform and AWS ECS and ensure robust, real-time alerting via Slack and Pager Duty, leveraging a suite of monitoring tools including Foglight, Data Dog (for Logging, Metrics, and APM),
Splunk, and Cloud Watch (integrated via Datadog).
In this crucial role, you will be responsible for architecting, building, scaling, and supporting high-throughput data platforms that underpin mission-critical business applications. Success requires deep expertise across enterprise databases, real-time streaming, and distributed systems.
As a key contributor, you will collaborate with engineering, product, and infrastructure teams to design and build resilient, fault-tolerant, real-time data pipelines. A critical focus of this role is ensuring exceptional data consistency and high performance at the database level, primarily by utilizing technologies like Kafka and Flink.
We’d love to chat if you have:- 5+ years of hands‑on experience with enterprise RDBMS (Postgre
SQL/Aurora/MySQL/Oracle), focusing on optimization, security, and access control. - 3+ years managing databases & CDC Streaming services on AWS.
- Experience with Apache/Confluent Kafka (brokers, partitions, offsets, Schema Registry, Connect), including running clusters at scale with exactly‑once or at‑least‑once delivery.
- Familiarity with Flink pipelines (stateful streaming, checkpointing, windows, save points).
- Strong experience designing fault‑tolerant data ingestion and streaming architectures.
- Strong experience with replication tools like Golden Gate / Debezium
. - Expertise in IaC with Terraform and Git Lab
, applying an SDLC approach to infrastructure. - Strong SQL skills and experience in query optimization.
- Strong knowledge of performance tuning, execution plans, indexing, partitioning, and concurrency.
- Experience implementing High Availability (HA), replication, Change Data Capture (CDC), disaster recovery, and multi‑region database deployments.
- Proficiency in Python and other programming languages.
- Ability to troubleshoot complex platform issues across the database, streaming, and infrastructure layers.
- Experience with monitoring and alerting tools like Data Dog, Splunk, and Cloud Watch for real-time performance monitoring and issue resolution.
- Familiarity with containers and cloud platforms (
Kubernetes/ECS/AWS
). - Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent practical experience.
- Strong communication skills.
- Automation mindset.
- Ownership of end-to-end systems.
The estimated salary range for this role is noted below for our San Jose based role. Our ranges for each role and job level are based on a variety of factors including candidate experience, expertise, and geographic…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).