More jobs:
Lead DevOps Engineer
Job in
Frisco, Collin County, Texas, 75034, USA
Listed on 2025-12-17
Listing for:
Thomson Reuters
Full Time
position Listed on 2025-12-17
Job specializations:
-
IT/Tech
Cloud Computing, Systems Engineer, Cybersecurity, IT Support
Job Description & How to Apply Below
New Position:
This position is open due to an existing vacancy to support our evolving business needs.
We are seeking an experienced and hands on Lead Dev Ops Engineer to join the Corporates Tax and Trade team in Toronto to drive infrastructure excellence and operational efficiency across our cloud platforms. This role combines deep expertise in multi‑cloud environments with modern AI/ML infrastructure management and microservices architecture.
About the RoleAs a Lead Dev Ops Engineer
, you will:
- Lead the design, implementation, and optimization of our Dev Ops practices while enabling our teams to deliver reliable, scalable solutions. This role is crucial in building, automating, and maintaining our cloud infrastructure, CI/CD pipelines, and ensuring the reliability and scalability of our applications. You’ll play a key part in fostering a culture of operational excellence, security, and continuous delivery, working closely with development and product teams.
- Infrastructure as Code (IaC):
Design, implement, and manage scalable, secure, and highly available cloud infrastructure primarily on AWS, with an understanding of best practices for GCP environments (e.g., using Terraform, Cloud Formation). - Automation & CI/CD:
Develop and maintain robust CI/CD pipelines (e.g., Git Lab CI/CD, Git Hub Actions, Jenkins, AWS Code Pipeline) to automate software delivery, testing, and deployment processes. - Linux System Administration:
Provide expert‑level administration, troubleshooting, and optimization for Linux‑based systems, ensuring stability, security, and performance. - Monitoring & Observability:
Implement comprehensive monitoring, logging, and alerting solutions (e.g., Prometheus, Grafana, ELK Stack, Cloud Watch, Data Dog) to ensure application health, performance, and proactive issue detection. - Networking & Security:
Configure and manage cloud networking components (VPCs, subnets, routing, security groups, firewalls) and implement security best practices (IAM, encryption, least privilege). - Troubleshooting & Incident Response:
Act as a subject‑matter expert for production issues, performing root cause analysis, implementing preventative measures, and participating in on‑call rotations (as required). - Collaboration:
Work closely with software engineers, data scientists, and product managers to understand their needs and provide reliable, efficient, and secure infrastructure solutions. - Continuous Improvement:
Identify and implement improvements to existing systems, tools, and processes to enhance efficiency, reduce costs, and improve reliability. - Documentation:
Create and maintain clear, concise documentation for infrastructure, processes, and playbooks.
You are a fit for the role Lead Dev Ops Engineer
, if your background includes:
- 7+ years of experience in Dev Ops, Site Reliability Engineering, or Infrastructure Engineering roles with at least 2 years in a lead or senior capacity.
- Deep expertise in Linux System Administration:
Command‑line proficiency, shell scripting, process management, networking, file systems, user/group management, and security best practices. - Strong proficiency with AWS:
Experience with core AWS services such as EC2, S3, RDS, VPC, IAM, Lambda, EKS/ECS, Cloud Watch, and an understanding of well‑architected principles. - Familiarity with GCP (Google Cloud Platform):
Hands‑on experience with at least a few core GCP services (e.g., GCE, GCS, GKE, Cloud Functions, IAM) is a significant advantage. - Expertise in Infrastructure as Code (IaC) tools:
Proven experience with Terraform (preferred), AWS Cloud Formation, Pulumi or similar. - Solid experience with CI/CD tools and methodologies: e.g., Git Lab CI/CD, Git Hub Actions, Jenkins or similar.
- Proficiency in at least one scripting language:
Python (preferred), Bash, Go, or similar. - Experience with containerization and orchestration:
Docker and Kubernetes (EKS, GKE, or self‑managed). - Understanding of networking fundamentals: TCP/IP, DNS, Load Balancing, VPNs.
- Experience with monitoring and logging tools:
Prometheus, Grafana, Datadog, Splunk, Cloud Watch or similar. - Strong problem‑solving skills:
Ability to diagnose complex issues across various layers of the…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×