×
Register Here to Apply for Jobs or Post Jobs. X

Sr. IT Linux Site Reliability Engineer

Job in Redmond, King County, Washington, 98052, USA
Listing for: SPACE EXPLORATION TECHNOLOGIES CORP
Full Time position
Listed on 2025-12-12
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below

Space

X was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today Space

X is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.

SR. LINUX SITE RELIABILITY ENGINEER

Space

X is looking for an experienced engineer with deep working knowledge of Kubernetes and related containerized technologies. This employee will be a member of the Information Technology Linux Infrastructure team and will provide expertise in Kubernetes design, maintenance, scaling and optimization in support of critical business functions. The ideal candidate will be flexible and flourish in a fast paced and challenging environment.

They should be a self-starter, self-motivator and possess ingenuity to excel at this position.

RESPONSIBILITIES
  • Install, manage, scale and optimize Kubernetes and RKE clusters using Ansible, Terraform and adjacent technologies in production environments.
  • Work closely with other Space

    X engineers to gather requirements, research, evaluate, design, plan, deploy, and support software platforms and related technologies running in Kubernetes within a world-class environment that meets the needs of the demanding Space

    X engineering teams. Build highly resilient, high-performance, scalable, and robust systems.
  • Exercise a high degree of personal responsibility for the processes, systems, and tools you create and manage; all supporting the goal of making humanity an interplanetary species.
  • Make recommendations, justify, and implement improvements using an accepted change control methodology.
  • Work within a diverse group to design and deliver creative solutions and resolve problems in a timely and proactive manner by interacting with internal business units.
  • Define, document and follow standards and best practices for systems design, testing, and implementation.
  • Foster an environment of collaboration and cross-training, upskilling the team in Kubernetes expertise and ensuring peers are developed into capable engineers.
  • Drive scripting, self-service and automation to develop solutions to reduce administrative overhead and TOIL.
  • Participate in on-call rotation to handle urgent after-hours work when necessary.
BASIC QUALIFICATIONS
  • Bachelor’s degree in Computer Science or a STEM discipline and 5+ years of systems engineering experience; OR 7+ years of systems engineering experience in lieu of a degree.
  • Experience deploying and supporting Linux servers in physical and virtualized environments (e.g. VMware via automation).
  • Experience with the Linux shell as well as configuring and extending Linux instances (e.g. kernel modules, cgroups, pki, iptables, interfaces).
  • Experience supporting and scaling containerized applications in Linux environments.
  • Experience using automation frameworks (e.g. Ansible, Terraform) to manage provisioning and post-provisioning life cycles of infrastructure and Kubernetes installations.
PREFERRED SKILLS AND EXPERIENCE
  • Expertise in creating repeatable, reliable, scalable systems architectures, with high availability, fault tolerance, performance tuning, monitoring, and statistics/metrics collection.
  • Expertise in source code version control tools such as Git and Subversion and collaborating on source code via Pull Requests and other Git-based workflows.
  • Strong understanding of Linux Container Runtime.
  • Experience implementing configuration management provisioning and workflow automation solutions via Infrastructure as Code, CI/CD and Git Ops (e.g. Ansible, AWX/Tower, Vagrant, Puppet, Redfish, Jenkins, cloud-init, ArgoCD, etc).
  • Experience writing test automation to ensure backwards compatibility of feature and change development for automation processes and Kubernetes deployments.
  • Experience with programming and scripting languages such as Python and Golang to develop software solutions and integrate with external systems to implement automation against RESTful API services.
  • Experience installing, configuring and troubleshooting Kubernetes internals, CNI, CRI and CSI plugins (e.g. Docker, Cri-O, Ceph, Cilium), load balancing (e.g. Metal

    LB), Service Mesh (e.g. Istio)…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary