×
Register Here to Apply for Jobs or Post Jobs. X

Principal Site Reliability Engineer

Remote / Online - Candidates ideally in
Mesa, Maricopa County, Arizona, 85201, USA
Listing for: Blue River Technology
Full Time, Remote/Work from Home position
Listed on 2025-11-27
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing
Salary/Wage Range or Industry Benchmark: 166000 - 293000 USD Yearly USD 166000.00 293000.00 YEAR
Job Description & How to Apply Below

Summary

We are looking for a Principal Site Reliability Engineer to join the CVML Platform team at Blue River Technology. You will work to create a hybrid infrastructure, integrating edge devices, on-premises, and cloud resources to a cohesive CVML & Robotics foundation. You will work on cost effectiveness, transparency, and security aspects of the platform, focusing on speed and quality of solutions and services provided.

You will work with both your peers and stakeholders from other teams to achieve alignment on the platform's vision and technologies. You must show initiative and the ability to organize your work schedule, and be comfortable with supporting the application needs of multiple teams, systems, and products.

  • Employment Type
    :
    Full-Time
  • Work Location
    :
    Remote in the United States
  • Visa sponsorship is available for this position on a case-by-case basis.
Job Responsibilities

A combination, not necessarily all-inclusive, of the following:

  • System Design:
    Architect and implement various cloud and on-premise applications, systems, and infrastructure.
  • Hybrid system integration:
    Integrate extremely diverse systems, configure stable integration, uptime, and monitoring.
  • Edge device integration: work with edge devices of various formats and integrate them with on-prem and cloud workflows, including networking, low-level OS, and electrical/control integration.
  • Low-level performance optimization: optimize the performance and throughput of the system at the file system, networking, and software levels.
  • High-level optimisation of cost and stability: optimize cost, operational stability, and supportability of highly diverse platforms and tech stack.
  • Product Mindset:
    Collaborate with cross-functional teams to design, develop, and maintain robust, scalable, and user-friendly web and mobile data-intensive applications.
  • System Integration:
    Build tools that enable users to easily move between different applications and platforms to utilize the strengths of each in a coherent ecosystem.
  • Collaboration:

    Work closely with cross-functional teams, including data scientists, analysts, software engineers, and product managers, to understand data requirements and deliver data solutions that align with business goals.
  • Documentation:
    Create and maintain technical documentation, including data flow diagrams, architecture designs, and standard operating procedures.
  • Technology Evaluation:
    Stay up-to-date with industry trends and emerging technologies related to data engineering, recommending and implementing new tools and frameworks as appropriate.
Required Experience and Skills
  • 8+ years of experience building infrastructure with K8S, AWS, and bare metal.
  • 8+ years of experience working with Python and Go (with production experience).
  • 8+ years of experience working with infra automation tools:
    Terraform / Terragrunt (or Pulumi / CDK).
  • 8+ experience with Linux-based systems and networks, and a deep understanding of internal components, networking, and security aspects.
  • Has a track record of building and maintaining scalable systems in production environments.
  • Experience in building CI/CD pipelines using Git Hub Actions (or Git Lab / Jenkins) for application release and deployment.
  • Experience in using AWS ECS, EKS, IAM, EC2, and RDS at production scale.
  • Deep understanding of Kubernetes and its internals (kubelet, CRDs, etc) and experience with building and extending clusters from scratch.
  • Strong problem-solving skills and ability to troubleshoot complex infrastructure and networking issues.
  • Excellent communication skills to collaborate effectively with technical and non-technical stakeholders.
  • Attention to detail and commitment to producing high-quality, well-documented code.
Preferred Experience and Skills
  • Experience with standard SQL, No

    SQL, and MPP databases.
  • Experience with writing production Kubernetes operators.
  • Airflow, Kubeflow, or other orchestration system experience.
  • Can understand some C++ and/or Rust, or talk with people who do.
  • Prior experience in the autonomy and robotics space is a huge plus.

Only individual applicants will be considered. We do not work with unsolicited third-party agencies or proxy interview services.

At Blue River,…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary