×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer; m​/f​/d

Job in Akron, Summit County, Ohio, 44329, USA
Listing for: Deepslate
Full Time position
Listed on 2026-05-30
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Position: Site Reliability Engineer (m/f/d)

Location: Remote / Berlin (Office available) |
Language Requirement: Fluent German

At Deepslate, we are building Speech to Speech Voice AI models that sound and act indistinguishable from a human. And we believe everyone should be able to use it.

When it comes to text and images, giants like OpenAI and Google have already cracked the code. With video, Veo3, Sora and others are closing the gap rapidly. But with its endless languages, dialects, accents, subtle intonations, and speech melodies voice remains a highly complex unsolved frontier.

That is exactly why we started Deepslate.

Backed by top-tier investors from the Tech and AI sectors, as well as a major German VC fund, we are incredibly well-funded and moving fast.

We are building the future of communication.

We aren't trying to build another standalone platform; instead, we are the intelligence engine powering countless other applications. Whether it's integrated as a module by a CRM provider, plugged into another Voice AI platform, or directly embedded into an enterprise system by our integration partners - our model is everywhere.

Your Role

Our Voice Models need to handle millions of calls, and you will be the guardian of their uptime, performance, and scalability. If our infrastructure goes down, our customers' applications grind to a halt.

Your mission is to build an infrastructure so resilient that potential outages are caught and mitigated before they even happen. You are the bridge between development and operations, ensuring that our massive AI workloads run smoothly, efficiently, and with uncompromising high availability.

What You'll Do:
Bulletproof Our Voice AI Engine

You don't build temporary workarounds; you build automated, scalable fortresses. Your focus is on absolute reliability, deep observability, and crafting an infrastructure that effortlessly keeps pace with our rapid growth.

  • Infrastructure as Code: Design, build, and manage our cloud infrastructure using modern tools (
    Pulumi
    ) to ensure all infrastructure changes are reproducible, secure, and easily auditable.
  • Kubernetes : Orchestrate and optimize our Kubernetes clusters for complex, compute-heavy AI workloads, guaranteeing maximum efficiency and fault tolerance.
  • Deep Observability & Monitoring: Implement a flawless monitoring setup. Using Datadog and Open Telemetry
    , you will make the black box of our distributed systems transparent, hunting down latency spikes or bottlenecks before they impact users.
  • Incident Response & Reliability: Establish and manage our on-call and alerting processes (using Pager Duty
    ) and champion a culture of blameless post-mortems so the same mistake never happens twice.
  • Release Confidence: Build and maintain highly automated Integration Testing and deployment pipelines. No code goes live without rigorous validation of its impact on system stability.
Elevate Our Engineering Quality:
  • SLAs, SLOs & SLIs: Define and monitor our service-level metrics, turning reliability into a measurable, core component of our product development cycle.
  • Automation First: Ruthlessly automate away toil (repetitive, manual work) so the engineering team can focus on innovation instead of maintenance.
  • Security & Compliance: Ensure our infrastructure is not only highly available but also locked down and hardened against external threats.
What We’re Looking For:

Must-Haves:

  • Kubernetes: Deep, hands-on experience in setting up, managing, and scaling self-hosted Kubernetes clusters in production.
  • Infrastructure as Code: Strong experience with modern IaC, ideally with Pulumi (or deep Terraform knowledge alongside a willingness to adopt Pulumi).
  • Observability: You are a pro with Datadog and Open Telemetry
    . You know exactly how to effectively monitor distributed systems across tracing, metrics, and logs.
  • Alerting & Incident Management: Proven experience with Pager Duty (or similar tools) and a track record of building a healthy, sustainable on-call culture.
  • Integration Testing & CI/CD: Hands-on experience setting up robust testing and deployment pipelines for complex, microservice-based architectures.
  • Fluent German skills: (spoken and written).
  • Startup Mindset: You are comfortable navigating the chaos of an early-stage codebase. If a process is missing or unstructured, you roll up your sleeves and build it.
  • Extreme Ownership: You aren't just looking to blindly process Jira tickets. You proactively identify where the infrastructure is burning—or where it will burn in the future—and you take action.

Nice-to-Haves:

  • Experience managing GPU workloads and scaling AI/ML infrastructures.
  • Background in network optimization (highly critical for latency-sensitive voice streaming protocols like WebRTC or Web Sockets).
  • Previous experience building high-availability systems in a fast-paced B2B/API-first environment.

Deepslate is an equal opportunity employer. We welcome applications from all qualified candidates regardless of gender, nationality, ethnic origin, religion, disability, age, or sexual orientation.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary