DevOps/MLOps Engineer
Ashburn, Loudoun County, Virginia, 22011, USA
Listed on 2026-05-16
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability
Dev Ops/MLOps Engineer
Niyam is seeking a Dev Ops/MLOps Engineer to join our team in support of our work with a federal client. This role is responsible for designing, automating, and maintaining scalable infrastructure and deployment pipelines that support both traditional software development and AI/ML model life cycles. The ideal candidate brings a strong foundation in Dev Ops practices, cloud‑native technologies, and MLOps frameworks, with a focus on automation, reliability, and secure operations within regulated environments.
This position requires close collaboration with software engineers, data scientists, and cybersecurity teams to deliver resilient and compliant solutions.
This full‑time position will be hybrid to Ashburn, VA. This position is contingent upon award of contract.
What We Offer- Flexible
Work Hours:
Life doesn’t always fit into a 9‑to‑5 schedule. We offer flexibility to help you manage your work‑life balance effectively. - Remote Work:
Niyam understands the value of flexibility. We offer remote work. - Career Growth:
Niyam is not just a job; it’s a career journey. We provide a supportive environment for your professional development and offer fully paid opportunities for training and advancement within the company. - Great People:
Our people are the blueprint of who Niyam is to the industry and community. - Great Environment:
Niyam fosters a great environment where innovation, collaboration, and personal growth thrive. - Diversity & Inclusion:
We believe in the strength of diverse perspectives. Your unique ideas are welcomed and celebrated every day at Niyam.
- Design, implement, and maintain robust CI/CD pipelines to support continuous integration and delivery of both application code and AI/ML models across development, testing, and production environments.
- Automate infrastructure provisioning, configuration management, and deployment processes using Infrastructure as Code (IaC) tools to ensure consistency, scalability, and repeatability.
- Manage and optimize cloud‑based environments, leveraging platforms such as AWS, Azure, or GCP to support high availability, fault tolerance, and cost efficiency.
- Implement and manage containerization and orchestration technologies (e.g., Docker, Kubernetes) to support scalable, portable, and resilient application and model deployments.
- Monitor system performance, availability, and reliability using centralized logging, metrics, and alerting tools; proactively identify and resolve performance bottlenecks and system issues.
- Ensure seamless integration and promotion of code and models across development, testing, staging, and production environments through automated workflows and release management processes.
- Collaborate with data scientists and ML engineers to operationalize machine learning models, enabling versioning, reproducibility, and continuous model delivery through MLOps best practices.
- Implement and enforce security best practices across the Dev Ops lifecycle, including secure configurations, vulnerability management, and compliance with federal security standards.
- Support system reliability engineering (SRE) practices, including incident response, root cause analysis, and continuous improvement of system resilience.
- Document infrastructure, pipelines, and operational procedures to support maintainability, auditability, and compliance with federal standards and accreditation requirements.
- US Citizenship with ability to obtain a Public Trust.
- Bachelor’s degree or higher in Computer Science, Engineering, Information Technology, or a related technical discipline from an accredited institution.
- Minimum of 4 years of experience in Dev Ops, Site Reliability Engineering (SRE), MLOps, or a related field supporting enterprise or mission‑critical systems.
- Hands‑on experience designing and maintaining CI/CD pipelines using tools such as Jenkins, Git Lab CI/CD, Git Hub Actions, or similar.
- Experience with Infrastructure as Code (IaC) tools such as Terraform, Cloud Formation, or Ansible.
- Experience working with cloud platforms such as AWS, Microsoft Azure, or Google Cloud Platform…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).