Senior System Software Engineer - Infrastructure
Listed on 2025-12-18
-
IT/Tech
Cloud Computing, Systems Engineer
Senior System Software Engineer - Infrastructure
Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIA engineer, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work.
JobDetails
- Seniority Level: Mid‑Senior level
- Employment Type:
Full‑time
- Designing, deploying, and maintaining scalable AWS infrastructure using EKS, EC2, S3, and related services.
- Managing and optimizing Kubernetes clusters for high availability, resilience, and performance.
- Creating and maintaining Git Lab CI/CD pipelines to automate build, test, and deployment workflows.
- Developing automation scripts and Infrastructure as Code (IaC) templates with Terraform.
- Monitoring system performance and implementing logging, metrics, and alerting through LGTM, Prometheus, Datadog, or Splunk.
- Implementing Dev Sec Ops best practices, embedding security scans, compliance checks, and secret management in the CI/CD lifecycle.
- Supporting platform observability, diagnosing production incidents, and enhancing self‑service for developer teams.
- Collaborating with cross‑functional teams to streamline delivery and improve developer productivity.
- BS/MS in Computer Science and/or equivalent experience.
- 12+ years of hands‑on experience building/supporting complex services.
- Strong hands‑on experience with AWS services (VPC, IAM, EC2, EKS, Lambda, Cloud Watch).
- Deep knowledge of Kubernetes internals, Helm charts, and container orchestration principles.
- Proficiency with Git Lab CI/CD or equivalent pipeline automation tools.
- Experience implementing Git Ops workflows (ArgoCD, FluxCD).
- Strong foundation in scripting languages such as Python, Bash, or Go.
- Familiarity with networking, load balancing, and security in cloud‑native environments.
- Experience enforcing cloud and container security standards and compliance practices.
- Excellent documentation, problem‑solving, and communication skills for cross‑team alignment.
- Managed multi‑cloud and hybrid Kubernetes clusters across AWS, GCP, and Azure.
- Contributed to open‑source Dev Ops projects, including Kubernetes and Git Lab initiatives.
- Earned certifications such as CKA, AWS Dev Ops Engineer, and Git Lab Certified Specialist.
- Applied AI/ML tools and AIOps platforms for predictive monitoring and automation.
- Led Dev Ops teams in platform engineering, chaos testing, disaster recovery, and process optimization.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until December 13, 2025.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Reference
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).