×
Register Here to Apply for Jobs or Post Jobs. X

Principal Software Engineer, DevOps

Remote / Online - Candidates ideally in
Warren, Macomb County, Michigan, 48088, USA
Listing for: Utilidata
Remote/Work from Home position
Listed on 2026-05-30
Job specializations:
  • IT/Tech
    Data Engineer, Cloud Computing
Salary/Wage Range or Industry Benchmark: 180000 - 210000 USD Yearly USD 180000.00 210000.00 YEAR
Job Description & How to Apply Below

Utilidata is a fast-growing NVIDIA-backed edge AI company enabling greater visibility and control of power utilization in energy-intensive infrastructure, like the electric grid and data centers. Karman, the company’s distributed AI platform powered by a custom NVIDIA module, is transforming the way utility companies operate the grid edge and will enable data centers to unlock more compute for the same provisioned power.

We are seeking a Dev Ops Engineer to help design, build, and operate Utilidata’s off‑device platform that ingests, processes, and serves data flowing from edge AI devices. The role will build and maintain infrastructure across on‑premises and cloud environments - bridging edge deployments with cloud‑based data processing to support analytics, operations, and ML workloads s is a hands‑on development role with technical leadership responsibilities and with company wide impact.

This engineer will architect and maintain the systems that keep our platform running, set technical direction for infrastructure and deployment practices, and mentor engineers. This engineer will partner closely with on‑device and ML teams to ensure our off‑device platform is resilient, well‑instrumented, and ready to scale. This is a remote position based in the United States, working with distributed teams across the country.

Responsibilities
  • Oversee the deployment and management of containerized applications using Kubernetes, ensuring optimal performance and availability
  • Contribute to strategic planning regarding how the infrastructure solutions evolve to match the requirements of Data Center partners
  • Lead the design, implementation, and maintenance of scalable and reliable systems on AWS and/or on‑premise
  • Utilize Terraform for infrastructure as code to automate the provisioning and management of cloud resources
  • Monitor system performance and uptime, ensuring systems meet established service level objectives (SLOs)
  • Support SOC2 security compliance requirements for data handling
  • Mentor and guide team members in Dev Ops practices, promoting a culture of reliability and excellence
  • Advocate for automation of operational tasks to enhance efficiency and reduce manual intervention
  • Collaborate with cross‑functional teams to build and maintain CI/CD pipelines
  • Troubleshoot and resolve complex production issues, conducting root cause analysis and implementing corrective actions
  • Participate in on‑call rotations and incident response teams
  • Assist in capacity planning, performance tuning, and technical decision‑making
  • Drive continuous improvement initiatives for processes and infrastructure
Minimum Qualifications
  • 8+ years of development experience including extensive experience in platform engineering, SRE, or distributed systems, with clear senior or principal‑level impact
  • Experience designing and operating infrastructure across on‑premises and cloud environments
  • Strong proficiency in container orchestration, particularly Kubernetes
  • Strong proficiency with AWS services and architecture
  • Hands‑on experience with Terraform for infrastructure automation
  • Familiarity with monitoring tools (Prometheus, Grafana, or similar) and observability best practices
  • Excellent problem‑solving skills, leadership abilities, and attention to detail
  • Strong communication and collaboration skills, with experience in driving technical outcomes
  • Willingness to travel up to 20% of time
Enhanced Qualifications (Nice to Have)
  • Bachelor’s degree in Computer Science, Engineering, or a related field
  • Experience supporting or enabling MLOps platforms, model deployment pipelines, or ML‑adjacent infrastructure
  • AI Workload scheduling using Kubernetes
  • Knowledge of Apache Spark for large‑scale data processing
  • Knowledge of database technologies (SQL, No

    SQL)
  • Understanding of networking concepts and security best practices
Salary Range

$180,000 to $210,000 base compensation depending on experience and stock options. Salary will be commensurate with an individual's skills, training, years of experience, and in line with internal compensation bands.

Location

This position can be performed remotely from anywhere in the United States.

Our Commitments

Utilidata values the diversity of our team.…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary