Senior Support Engineer
Tata Consultancy Services (TCS) is an equal prospect employer, and embraces diversity in race, nationality, ethnicity, gender, age, physical ability, neurodiversity, and sexual orientation, to create a workforce that reflects the societies we operate in. Our continued commitment to Culture and Diversity is reflected in our people stories across our workforce and implemented through equitable workplace policies and processes.
Roles and Responsibilities- Design and implement observability-as-code solutions using Terraform to deploy monitoring pipelines, dashboards, and alerting strategies across distributed systems.
- Drive observability improvements leveraging industry-leading tools (Dynatrace, ELK, Splunk, Pager Duty) to achieve real-time performance insights and comprehensive system visibility.
- Instrument applications for end-to-end observability, implementing distributed tracing, metrics collection, and log aggregation across Node.js and .NET microservices and event-driven architectures.
- Troubleshoot complex incidents in production environments, diagnosing root causes across multiple service layers, databases, caches, and APIs under load using SLISLO frameworks.
- Investigate and resolve Azure Kubernetes Service (AKS) infrastructure, ensuring reliability and scalability of containerized workloads with deep proficiency in Terraform and Azure managed services (SQL MI, Redis, Functions, Event Grid).
- Translate business requirements into observable, resilient systems that meet defined SLIsSLOs and drive reliability improvements.
- Automate operational tasks to reduce toil and improve system resilience through infrastructure-as-code and CICD best practices.
- Lead incident response and remediation for mission-critical systems, conducting blameless postmortems and building resilience through chaos engineering and tabletop exercises.
- Collaborate cross-functionally with development, platform, and business teams to improve service availability, scalability, and operational excellence.
- Must-have 8 years hands‑on experience in observability, SRE, or Dev Ops roles with proven expertise across infrastructure and application-level reliability.
- Deep expertise in observability tooling Dynatrace, ELK, Splunk, and Pager Duty demonstrated understanding of observability principles (instrumentation, correlation IDs, SLISLO frameworks).
- Advanced proficiency with Azure Kubernetes Service (AKS), Terraform, and Azure.
Salary Range - CA $ 100,000 - CA $ 120,000 Per Year
Tata Consultancy Services Canada Inc. is committed to meeting the accessibility needs of all individuals in accordance with the Accessibility for Ontarians with Disabilities Act (AODA) and the Ontario Human Rights Code (OHRC). Should you require accommodation during the recruitment and selection process, please inform Human Resources.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: