×
Register Here to Apply for Jobs or Post Jobs. X

Metrics Platform Site Reliability Engineer

Job in Atlanta, Fulton County, Georgia, 30383, USA
Listing for: Capgemini
Full Time position
Listed on 2025-12-08
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below

Metrics Platform Site Reliability Engineer

Join to apply for the Metrics Platform Site Reliability Engineer role at Capgemini

Job Location – Atlanta, GA

Responsibilities
  • Manage and mentor a team of Site Reliability Engineers
  • Define and implement SRE strategies and best practices in alignment with organizational objectives
  • Monitor clients' service level agreements (SLAs), service level objectives (SLOs) and service level indicators (SLIs)
  • Lead initiatives to improve system reliability, availability, scalability and performance
  • Collaborate with development and operations teams to ensure reliability and resiliency goals are met
  • Implement and improve incident management processes to minimize downtime and ensure timely resolutions
  • Review and contribute to the architecture of critical systems ensuring they meet reliability and performance goals
  • Drive observability practices by implementing robust monitoring, logging and alerting systems
  • Implement and maintain monitoring systems to proactively identify potential issues and alert engineers to problems before they impact users
  • Respond to incidents and outages, diagnose problems and implement solutions to minimize downtime and restore service
  • Automate repetitive tasks and processes to improve efficiency and reduce manual effort
  • Identify and address performance bottlenecks to ensure systems run efficiently and effectively
  • Manage and maintain the underlying infrastructure including servers, networks and cloud resources
  • Plan for future capacity needs to ensure systems can handle anticipated workloads
  • Develop and maintain processes for deploying software updates and releases
  • Work closely with developers, operations teams and other stakeholders to ensure system reliability and availability
  • Maintain clear and concise documentation of systems, processes and procedures
  • Identify areas for improvement and implement changes to enhance system reliability and performance
Skills Required
  • Proficiency in writing Splunk queries and alerts is a must
  • Hands‑on experience with at least one APM tool:
    New Relic, App Dynamics, Honeycomb, Data Dog is a must
  • Expertise in automation tools and scripting languages (Python or JavaScript) is a must
  • Proficiency in scripting languages:
    Python or Node.js is a must
  • Proficiency in any cloud platform: AWS, GCP, Azure is a must
  • Strong understanding of distributed systems, microservices architecture and container orchestration tools (e.g., Kubernetes)
  • Experience with monitoring tools like Prometheus, Grafana is a must
Benefits
  • Flexible work
  • Healthcare including dental, vision, mental health, and well‑being programs
  • Financial well‑being programs such as 401(k) and Employee Share Ownership Plan
  • Paid time off and paid holidays
  • Paid parental leave
  • Family building benefits like adoption assistance, surrogacy, and cryopreservation
  • Social well‑being benefits like subsidized back‑up child/elder care and tutoring
  • Mentoring, coaching and learning programs
  • Employee Resource Groups
  • Disaster Relief

Referrals increase your chances of interviewing at Capgemini by 2x

Disclaimer

Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.

This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.

Capgemini is committed to providing reasonable accommodations during our recruitment process. If you need assistance or accommodation, please reach out to your recruiting contact.

Click the following link for more…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary