Technical Lead - AI OPs
Listed on 2026-06-20
-
IT/Tech
AI Engineer (Applied/Software), Machine Learning/ ML Engineer
Technical Lead - AI OPS
We are seeking a highly skilled and visionary Technical Lead, AIOps, to join our growing AI Operations team. In this pivotal role, you will drive the transformation of IT operations through the strategic application of Artificial Intelligence, Machine Learning, and Big Data analytics. You will architect and deliver cutting‑edge automated solutions that leverage observability platforms, advanced ML algorithms, and robust data analysis frameworks to improve system resilience, production availability, and operational efficiency.
This is a technical leadership position at the intersection of innovation and operations. You will lead a team of high‑performing Site Reliability and AIOps engineers, guiding them through complex technical challenges while fostering a culture of collaboration, continuous improvement, and innovation.
- AIOps Platform Design and Delivery:
Design and implement a robust, enterprise‑grade AIOps platform supporting production operations teams across the full incident lifecycle – from initial observation through engagement and resolution. - Observability and Intelligent Monitoring:
Integrate observability platforms such as Dynatrace, Moogsoft, and Splunk with AI/ML capabilities to enable early anomaly detection, trend analysis, and actionable operational insights. - Agentic AI Development:
Build and maintain an agentic AI‑powered virtual assistant that delivers instant, intelligent responses to operational queries – including incident summaries, root‑cause analysis, and recommended remediation steps. - Real‑Time Dashboards and Metrics:
Design and maintain interactive dashboards providing real‑time visibility into key operational metrics such as MTTA and MTTR, enabling proactive decision‑making across engineering and leadership teams. - Generative AI and Automation:
Collaborate with the GAIT Center of Excellence to implement GenAI‑based solutions that automate report generation and streamline operational workflows. Partner with Lines of Business development teams to automate routine tasks and reduce manual intervention. - AI/ML Algorithm Architecture:
Architect and implement AI/ML algorithms tailored to boost IT operational efficiency, predict system failures, and recommend preventive actions before issues impact end users. - Team Leadership and Stakeholder Engagement:
Lead, mentor, and develop a team of Site Reliability and AIOps engineers. Engage effectively with both technical and non‑technical stakeholders to ensure AIOps tools are understood, valued, and widely adopted across the organization.
- Bachelor's Degree Required
- 5 Years Required; 7 Years Preferred
- Sedentary Work
9IC
Required Skills- At least 5 years of experience in a Technology Role
- Experience with programming languages – Python and/or Java
- Hands‑on experience with cloud platforms (AWS, Azure, or Google Cloud)
- Experience using observability tools such as Moogsoft, Dynatrace, or Splunk
- At least 7+ years of experience in a technology role
- Familiarity with data processing frameworks such as Apache Kafka and automation tools such as Ansible Tower
- Solid understanding of machine learning algorithms and data analysis techniques
- Practical experience designing and working with agentic systems and automation agents
- Demonstrated ability to lead and develop high‑performing engineering teams
- Excellent problem‑solving, communication, and interpersonal skills
- Master's degree in computer science, Data Science, AI/ML, or a related field
- Professional certifications in cloud, AI/ML, or data engineering (e.g., AWS Certified Machine Learning Specialty, Google Professional Data Engineer)
- Experience with Dev Ops practices and CI/CD pipelines
- Familiarity with Agile development methodologies
- Agile Methodology
- Continuous Integration and Deployment
- Data Analysis
- Debugging
- Dev Ops
- Enterprise Application Integration
- Operating Systems Management
- Problem Solving
- Programming
- Software Development
- Software Development Life Cycle
- Web Application Development
$130,000/yr – $154,000/yr
Equal OpportunityWe are an Equal Opportunity Employer. TIAA does not discriminate against any candidate or employee on the basis of age, race, color, national origin, sex, religion, veteran status, disability, sexual orientation, gender identity, or any other legally protected status. Our full EEO & Non-Discrimination statement is on our, and you can read more about your rights and view government notices.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).