×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer, Energy Software

Job in Richmond Hill, Ontario, Canada
Listing for: Tesla
Full Time position
Listed on 2026-06-01
Job specializations:
  • Software Development
Job Description & How to Apply Below
Position: Staff Site Reliability Engineer, Energy Software
What To Expect
Tesla is looking for a Site Reliability Engineer to build, enhance, and scale the infrastructure that underpins our Energy IoT applications. These applications provide real-time monitoring, optimization, and control for Tesla’s industry‑leading energy products, including Powerwall, Megapack, Solar Roof, Supercharger, Wall Connector, Autobidder, and Virtual Power Plants.

We are a high‑impact team that values curiosity, learning, mentorship, open discourse, and making disciplined decisions by weighing trade‑offs. Our work supports over 50 engineers and directly affects millions of customers.

If you enjoy thinking in systems and tackling challenges related to the availability, reliability, scalability, and security of distributed software, this role is for you.

You’ll work with and deepen your expertise in Linux, Networking, Kubernetes, on‑premises data centers, AWS, Terraform, Prometheus, Helm, Git Hub Actions, Postgre

SQL, Cloud

Native

PG, Kafka, Influx

DB, Scala, and Rust.

Join us in accelerating the world’s transition to sustainable energy.

What You’ll Do

Envision and implement changes that improve system reliability

Conduct deep investigations into new technologies and resolve unexpected issues that arise during operation

Provide guidance on system architecture and security best practices

Review, digest, and distill complex code and technical topics to ensure clarity and accessibility for all engineers

Provide technical leadership, foster collaboration, and drive key initiatives to completion

Uphold team values, including engineering excellence, curiosity, bias for action, self‑awareness, inclusivity, and openness

What You’ll Bring

Minimum 2+ years of relevant industry experience

Experience in developing, scaling, and maintaining infrastructure for distributed systems, including IoT applications

Proficiency in many of the following:
Linux, Networking, Kubernetes, on‑premises data centers, AWS, Terraform, Prometheus, Helm, Git Hub Actions, Postgre

SQL, and Kafka

Strong understanding of system design principles and the challenges of ensuring availability, reliability, scalability, and security in distributed software systems

Effective verbal and written communication skills

Ability to navigate uncertainty and loosely defined problem statements

Strong analytical and problem‑solving skills, with the ability to evaluate trade‑offs and make well‑reasoned decisions

Collaborative mindset with a willingness to learn, mentor, and engage in open discussions

#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary