×
Register Here to Apply for Jobs or Post Jobs. X

Infrastructure Engineer, Distributed Compute

Job in Austin, Travis County, Texas, 78716, USA
Listing for: Base Power Company
Full Time position
Listed on 2026-06-27
Job specializations:
  • Software Development
    Backend Developer, Cloud Engineer - Software
Salary/Wage Range or Industry Benchmark: 100000 - 130000 USD Yearly USD 100000.00 130000.00 YEAR
Job Description & How to Apply Below

About Base

Base is America’s next-generation power company. We’re rebuilding the foundation of modern civilization–electricity–by deploying a vast network of distributed batteries that is transforming today’s fragile, centralized grid into a resilient and abundant system. We are engineers, operators, and creatives solving some of the most complex, interdisciplinary challenges of our time.

About the Role

Base is deploying thousands of computing nodes across the country, coordinating them as a single distributed system. We're looking for an Infrastructure Engineer to design, build, and operate the horizontal infrastructure that coordinates, orchestrates, and manages this distributed compute network — enabling device communication, task scheduling, state synchronization, and fleet management at scale.

You’ll own the backend systems and APIs that allow thousands of devices to reliably communicate with central infrastructure, track their state, receive updates, and execute coordinated commands. This is systems-level work: designing for failure, scale, cost efficiency, and operational simplicity.

You’ll work closely with device engineers who need reliable communication channels, product teams who need fleet management primitives, operations teams who need visibility and control, and hardware engineers who understand physical constraints. Your infrastructure is the nervous system of this product — it must be fast, reliable, and elegant.

What You’ll Do
  • Design and build the core orchestration and coordination layer that manages device fleet operations — task distribution, state synchronization, health monitoring — with >99.9% availability.
  • Build backend systems that reliably handle device-to-cloud communication at scale, including message routing, acknowledgment, retry logic, and conflict resolution for concurrent updates.
  • Develop APIs and services that allow product teams to query device state, push updates, and execute commands on thousands of devices simultaneously without bottlenecks or data consistency issues.
  • Design architectures that scale horizontally from hundreds to millions of devices without re-architecture, while optimizing compute, storage, and network costs.
  • Implement monitoring, alerting, and operational runbooks that allow the team to understand and troubleshoot distributed system behavior in production.
  • Build reliable async communication patterns using message queues and event streaming, handling ordering guarantees, deduplication, and exactly-once semantics.
  • Own the database and storage layer decisions that support both operational and analytical workloads — knowing when to use relational databases, No

    SQL stores, or specialized systems.
  • Partner with hardware and device teams to understand their needs and translate them into scalable, reliable backend services.
  • Write infrastructure-as-code that is maintainable, tested, and reproducible, enabling safe and rapid iteration.
What You’ll Bring
  • 5+ years building backend infrastructure or distributed systems, preferably at scale
  • Strong experience in Go, Python, Java, or equivalent backend languages
  • Deep understanding of distributed systems concepts: eventual consistency, state synchronization, failure handling
  • Experience building APIs and services that handle high scale and high concurrency
  • Familiarity with message queues or event streaming (Kafka, RabbitMQ, SQS, or similar)
  • Solid understanding of databases and data modeling — knowing when to use relational vs. No

    SQL vs. specialized stores
  • Comfort with infrastructure-as-code and cloud platforms (AWS or GCP)
  • Proven ability to own complex systems end-to-end: design, implementation, deployment, and operational support
Nice-to-Haves
  • Experience building device management or IoT backend systems
  • Familiarity with Kubernetes and container orchestration
  • Background in energy, utilities, or other operational technology (OT) domains
  • Experience with distributed tracing and observability at scale (Datadog, Honeycomb, etc.)
  • Knowledge of fleet management, device provisioning, or OTA update systems
  • Exposure to consensus algorithms (Raft, Paxos) or distributed coordination (etcd, Zookeeper)
  • Experience with stream processing frameworks…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary