×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer ML platform - W2

Job in Sunnyvale, Santa Clara County, California, 94087, USA
Listing for: Saransh Inc
Full Time position
Listed on 2026-06-18
Job specializations:
  • IT/Tech
    Cloud Computing: Infrastructure & Operations, Machine Learning/ ML Engineer, Data Engineering, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below
Position: Site Reliability Engineer with ML platform - Only W2

Overview

Title:

Site Reliability Engineer SRE – ML platform

Location:

Austin, TX or Sunnyvale, CA

Employment type:

Full-time
• Seniority:
Mid-Senior level
• ONLY W2

Responsibilities
  • Continuous Deployment using Git Hub Actions, Flux, Kustomize
  • Design and implement cloud solutions, build MLOps on cloud AWS
  • Data science model containerization, deployment using docker, VLLM, Kubernetes
  • Communicate with a team of data scientists, data engineers and architects, document the processes
  • Develop and deploy scalable tools and services for our clients to handle machine learning training and inference
  • Knowledge of ML models and LLM
Qualifications
  • 6+ years of experience in ML Ops with strong knowledge in Kubernetes, Python, MongoDB and AWS
  • Good understanding of Apache SOLR
  • Proficient with Linux administration
  • Knowledge of ML models and LLM
  • Ability to understand tools used by data scientists and experience with software development and test automation
  • Ability to design and implement cloud solutions and ability to build MLOps pipelines on cloud solutions (AWS)
  • Experience working with cloud computing and database systems
  • Experience building custom integrations between cloud-based systems using APIs
  • Experience developing and maintaining ML systems built with open-source tools
  • Experience with MLOps Frameworks like Kubeflow, MLFlow, Data Robot, Airflow etc., experience with Docker and Kubernetes
  • Experience developing containers and Kubernetes in cloud computing environments
  • Familiarity with one or more data-oriented workflow orchestration frameworks (Kubeflow, Airflow, Argo, etc.)
  • Ability to translate business needs to technical requirements
  • Strong understanding of software testing, benchmarking, and continuous integration
  • Exposure to machine learning methodology and best practices
  • Good communication skills and ability to work in a team
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary