×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Remote / Online - Candidates ideally in
Germany, Pike County, Ohio, USA
Listing for: Cerebras
Full Time, Remote/Work from Home position
Listed on 2026-02-07
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability, Systems Engineer, IT Support
Salary/Wage Range or Industry Benchmark: 53053 - 76633 USD Yearly USD 53053.00 76633.00 YEAR
Job Description & How to Apply Below
Location: Germany

Site Reliability Engineer

Location: Permanent
• Full time
• Remote

Engineering Team

We’re on the lookout for a Site Reliability Engineer!

45-65K EUR | Full Remote (Latam) | Series A startup backed by top US VCs.

At Agentero we believe in simple and smart solutions for complex problems.

We are building cutting-edge technology to help insurance agents serve their customers more effectively and help them grow their businesses. We do so through a data-driven platform that provides insurance agents with market access to digital carriers.

Agentero is a remote-first Silicon Valley startup with a top-talent team spread across Spain and the US. We ve raised a $13.5M Series A, bringing our total funding to over $20M with participation of top investors like Foundation Capital (Uber, Netflix) or USV (Twitter, Mongo

DB) and Mundi Ventures (WeFox). It s going to be huge! This is just the start…

🚀 The Opportunity

We re in search of a skilled Site Reliability Engineer based in Latin America to join our engineering team. This role works aligned with US business hours, enabling our follow-the-sun on-call model across our distributed team.

You'll work on improving our observability stack, creating runbooks that transform incident learnings into automation, and building infrastructure improvements that scale with our growth. This role is ideal for someone who is passionate about reliability, allergic to manual toil, and believes that every incident is an opportunity to make the system better.

📣 What You'll Do
  • Observability & Monitoring — You will design and implement monitoring solutions that alert on symptoms rather than outages, giving us early warning before our customers are impacted.

  • Runbooks & Incident Response — You will create and maintain runbooks that document every action, turning findings into repeatable processes and eventually into automation. You ll participate in blameless post-mortems to prevent incidents from ever happening again.

  • Infrastructure Improvements — You will build and maintain our cloud infrastructure using Infrastructure-as-Code principles, collaborating with backend engineers to improve service reliability and reduce manual work.

  • On-Call (Business Hours) — You will participate in a business-hours on-call rotation aligned with the US timezone (roughly 9am-6pm EST/PST). Our distributed team across LATAM, US, and Spain enables a follow-the-sun model with no midnight pages.

  • Engineering Excellence — You will help us build a culture of reliability that the whole team is proud of, championing automation, documentation, and continuous improvement.

  • 👤 What We're Looking For (Must-haves)
    • Based in Latin America with availability to work aligned with US business hours (EST or PST).

    • At least 4 years of relevant experience in SRE, Dev Ops, Platform Engineering, or Infrastructure roles.

    • Proficiency with Infrastructure-as-Code tools (Terraform preferred).

    • Experience with cloud platforms (AWS or GCP).

    • Strong Linux systems administration and troubleshooting skills.

    • Programming ability in Go, Python, or similar languages. This means you ve solved problems by writing code to automate your way out of them.

    • Familiarity with observability and monitoring tools (Datadog, Prometheus, Grafana, or similar).

    • Ownership of your work and enjoy the autonomy of managing projects across design, implementation, and production.

    • Great team player: humble, empathetic, open mindset.

    • Strong verbal and written communication skills in English.

    🌟 Nice-to-haves
    • Experience with GCP platform in particular with Cloud Run, Cloud Spanner, Cloud Monitoring.

    • Background in incident management and writing effective runbooks.

    • Experience with CI/CD pipelines and deployment automation.

    • Contributions to internal tooling or developer experience improvements.

    • You believe CI servers, push-button deploys, metrics dashboards, and centralized logging are not just "nice to haves", they re critical infrastructure that rapidly pays for itself.

    💎 Why You Should Join Agentero

    Salary
    : 45-65K EUR + equity.
    Remote-first
    :
    Work from anywhere in Latin America.
    Home office setup budget
    .
    Training and development budget
    .


    Business-hours on-call
    :
    We use follow-the-sun across our distributed…

    To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
    (If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)

    Job Posting Language
    Employment Category
    Education (minimum level)
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary