Machine Learning Engineer SFO, CA; Hybrid
Job in
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-01-08
Listing for:
ESR Healthcare
Full Time
position Listed on 2026-01-08
Job specializations:
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer, Data Engineer, Data Scientist
Job Description & How to Apply Below
Duration: 1+ year
Job Description:
NOTE:
Must be good in communication and technical skills + Minimum of 8 years of work experience needed.
Role Overview
We are seeking a skilled Machine Learning Engineer to design, develop, and deploy advanced AI/ML models, with a focus on Generative AI, RAG architectures, and large-scale machine learning applications. You will work on end-to-end ML pipelines, integrating state-of-the-art tools like OpenAI, Anthropic Claude, and vector databases to deliver high-quality solutions for real-world business challenges.
Key Responsibilities
• Machine Learning, Generative AI & RAG Development:
Build and fine-tune large language models (LLMs) using frameworks such as OpenAI GPT or Anthropic Claude.
Design and implement RAG pipelines for scalable, real-time applications leveraging vector databases like Pinecone, Weaviate, Open search.
Develop prompt engineering strategies to optimize model outputs for specific use cases.
Design and deploy scalable ML models that integrate with existing systems.
• End-to-End ML Pipeline:
Architect, train, and deploy machine learning pipelines for NLP and multimodal AI solutions.
Conduct data preprocessing, feature engineering, and exploratory data analysis for training embeddings for semantic search and document retrieval tasks.
• Model Deployment & Optimization:
Deploy ML models in production environments using cloud platforms like AWS Sage Maker, ECS or equivalent tools.
Ensure scalability, reliability, and low latency in production systems while monitoring model performance.
Implement CI/CD pipelines for ML models using Docker, Kubernetes, MLflow.
Ensure APIs and ML services handle high traffic with minimal latency.
• Security & Compliance:
Ensure ML APIs follow best practices for authentication, authorization, and data privacy.
• Collaboration & Integration:
Work closely with cross-functional teams including data scientists, software engineers, and product managers to align ML solutions with business objectives.
Work with data engineers to design feature stores and streaming pipelines.
Integrate ML outputs into enterprise systems while ensuring seamless user experiences.
• Research &
Innovation:
Stay updated on advancements in generative AI, LLMs, embeddings, and RAG technologies to enhance existing systems.
Experiment with new algorithms and frameworks to drive innovation in AI-powered applications.
Required
Skills & Qualifications
• Technical Expertise:
Minimum of 8 years of work experience with at least 4 years in Python; familiarity with frameworks like PyTorch, Tensor Flow, and libraries like Hugging Face Transformers.
Hands-on experience with LLMs (e.g., OpenAI GPT models, Anthropic Claude) and fine-tuning techniques.
Strong understanding of RAG architectures and vector database integration (e.g., Open search, Pinecone, Weaviate).
• API Development:
FastAPI, Flask, Django
• Containerization:
Docker, AWS ECS, Kubernetes
• Cloud & Data Tools:
Experience with cloud platforms such as AWS (Sage Maker preferred), GCP Vertex AI, or Azure ML for deploying ML models.
Familiarity with SQL or No
SQL databases for data extraction and preprocessing tasks.
• Problem-Solving
Skills:
Ability to design scalable solutions for complex problems involving unstructured data and large analytical skills with a focus on optimizing ML workflows for performance and efficiency.
•
Soft Skills:
Excellent communication skills to collaborate effectively with technical and non-technical stakeholders.
A passion for learning and staying ahead in the rapidly evolving field of artificial intelligence.
Preferred Qualifications
• Experience building conversational AI systems or chatbots using generative AI technologies.
• Experience with building REST API using frameworks such as FastAPI.
• Experience with SQL and No
SQL database/store (Postgres, Dynamo
DB, Open search etc.)
• Knowledge of NLP techniques such as sentiment analysis, topic modeling, or summarization tasks.
• Familiarity with serverless architectures (e.g., AWS Lambda) or ECS for scalable ML deployment.
• Bachelor’s or Master’s degree in Computer Science, Data Science, Mathematics, or related fields.
#JLjbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×