×
Register Here to Apply for Jobs or Post Jobs. X

DevOps Engineer AI Ops

Job in Atlanta, Fulton County, Georgia, 30383, USA
Listing for: Highbrow LLC
Full Time position
Listed on 2025-12-27
Job specializations:
  • IT/Tech
    AI Engineer, Cloud Computing, Data Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 130000 USD Yearly USD 100000.00 130000.00 YEAR
Job Description & How to Apply Below
Position: DevOps Engineer with AI Ops Experience

Job Title :

-

Dev Ops Engineer with AI Ops Experience

Employment Type

:

- W2

Duration :

- Long Term

Visa Type :

- All Visa applicable which are ready for W2

Location
- Atlanta, GA (Day-1 Onsite)

Exp required :

-
5+ years of experience in Dev Ops engineering, with at least 3 years specializing in AI Ops or supporting ML/AI model deployment and infrastructure.

Job Description:
  • 5+ years of experience in Dev Ops engineering, with at least 3 years specializing in AI Ops or supporting ML/AI model deployment and infrastructure.
  • Proven experience in designing, implementing, and managing CI/CD pipelines and ML Ops frameworks to automate AI/ML workflows.
Technical

Skills:

  • Proficiency in cloud platforms (AWS, GCP, Azure) with hands-on experience in deploying AI/ML models and utilizing AI/ML services (e.g., AWS Sage Maker, Google AI Platform).
  • Strong skills in containerization and orchestration tools such as Docker and Kubernetes, especially for deploying machine learning models at scale.
  • Experience with infrastructure-as-code tools like Terraform, Cloud Formation, or Ansible to manage and provision cloud and on-premise environments.
  • Proficiency in CI/CD tools (e.g., Jenkins, Git Lab CI, Circle

    CI) to build automated pipelines for AI/ML model training, testing, and deployment.
  • Solid understanding of monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack) for model performance tracking and infrastructure observability.
  • Strong programming and scripting skills in Python, Bash, and YAML for automating workflows and integrating services.
AI Ops and MLOps

Skills:

  • Experience with MLOps best practices, including model versioning, automated retraining, and model governance for reliable and reproducible AI pipelines.
  • Hands-on experience with model monitoring tools (e.g., MLflow, Kubeflow, or TFX) to track model performance, drift, and retraining needs.
  • Familiarity with data pipelines and orchestration tools (e.g., Apache Airflow, Prefect) for managing data and model workflows.
  • Knowledge of model deployment strategies (e.g., blue-green deployments, canary releases) to ensure reliable AI/ML model deployment with minimal downtime.
  • Experience with A/B testing and experiment tracking to evaluate model performance in production and measure the impact on business KPIs.
Dev Ops and Automation

Skills:

  • Ability to design and manage scalable infrastructure to support machine learning workloads, ensuring cost efficiency, performance, and security.
  • Proficiency in automating testing and deployment processes for data and model pipelines to support fast, reliable releases.
  • Familiarity with serverless architectures and cloud-native tools for AI, allowing for flexible and efficient resource management.
  • Experience with security best practices, including role-based access control, data encryption, and compliance requirements for data-sensitive applications.
Communication and Collaboration

Skills:

  • Excellent communication skills with the ability to collaborate closely with data scientists, ML engineers, and software development teams.
  • Proven ability to document infrastructure, CI/CD pipelines, and MLOps processes, ensuring transparency and knowledge sharing across teams.
  • Strong problem-solving skills and a proactive approach to troubleshooting, particularly in managing and resolving deployment and performance issues.
  • Ability to train and mentor team members on MLOps tools, best practices, and model deployment techniques.
Additional Qualifications:
  • Experience with data security and governance standards, especially related to machine learning applications in regulated industries.
  • Familiarity with AI ethics and compliance, including model fairness, transparency, and risk management.
  • Knowledge of advanced monitoring and alerting tools and techniques to ensure the reliability of AI systems in production.
  • Strong interest in staying up-to-date on the latest advancements in MLOps and AI Ops to continuously improve infrastructure and processes.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary