×
Register Here to Apply for Jobs or Post Jobs. X

Staff Engineer - SRE - Retail & Pharmacy

Job in Woonsocket, Providence County, Rhode Island, 02895, USA
Listing for: Hispanic Alliance for Career Enhancement
Full Time position
Listed on 2025-12-06
Job specializations:
  • IT/Tech
    Systems Engineer, IT Support
Job Description & How to Apply Below

At CVS Health, we’re building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care. As the nation’s leading health solutions company, we reach millions of Americans through our local presence, digital channels and more than 300,000 purpose-driven colleagues - caring for people where, when and how they choose in a way that is uniquely more connected, more convenient and more compassionate.

And we do it all with heart, each and every day.

The Staff Engineer - SRE, Retail & Pharmacy will implement and maintain comprehensive observability solutions, providing real-time insights into the performance and overall health of systems to proactively identify and address potential issues. This role is responsible for investigating and resolving incidents quickly during critical situations and performing root cause analysis to prevent future recurrence. You will collaborate with cross-functional teams to build robust monitoring, alerting, and telemetry solutions, enabling proactive issue detection and resolution across distributed systems.

As a senior member of the SRE team, you will drive best practices, mentor others, and shape the strategic evolution of our observability ecosystem in a complex, edge-centric architecture.

What You Will Do:
  • Observability Strategy & Implementation
    • Design and implement comprehensive observability solutions tailored for edge computing environments, including monitoring, logging, tracing, and metrics collection, to provide deep visibility into system performance and health across distributed remote facilities
    • Define and maintain Service Level Indicators (SLIs), Service Level Objectives (SLOs), and business KPIs to measure and enhance system reliability in edge and centralized infrastructure
    • Build and optimize dashboards, visualizations, and alerting systems to enable real-time insights and rapid incident response for edge nodes and remote facilities
    • Implement distributed tracing and log aggregation systems to troubleshoot complex issues in edge computing environments
  • System Reliability & Performance
    • Collaborate with engineering teams to ensure applications and infrastructure at edge locations are designed with observability in mind, incorporating best practices for instrumentation and monitoring in resource-constrained environments
    • Drive proactive identification of issues in edge facilities through advanced observability tools, reducing Mean Time to Detect (MTTD) and Mean Time to Resolve (MTTR) across distributed systems
    • Lead incident postmortems, analyzing root causes specific to edge environments and implementing observability-driven improvements to prevent recurrence
  • Tooling & Automation
    • Develop and maintain tools, scripts, and automation to enhance observability pipelines, optimizing for the unique challenges of edge computing, such as bandwidth limitations and intermittent connectivity
    • Evaluate and integrate industry-standard observability tools (e.g., Prometheus, Grafana, ELK Stack, Open Telemetry) and recommend solutions tailored for edge computing use cases
    • Optimize observability data storage, retention, and querying to balance performance, cost, and scalability across a large number of remote facilities
  • Leadership & Collaboration
    • Mentor and guide junior SREs and engineers on observability best practices for edge computing, fostering a culture of reliability and proactive monitoring
    • Partner with solution, engineering, and business teams to align observability efforts with business objectives, ensuring seamless operation of edge and centralized systems
    • Lead cross-functional initiatives to improve observability, reliability, and operational efficiency across distributed edge infrastructure
  • Continuous Improvement
    • Stay current with emerging observability trends, tools, and methodologies, particularly those suited for edge computing and distributed systems, and advocate for their adoption
    • Contribute to the development of observability standards, runbooks, and documentation tailored for edge environments to ensure consistency and scalability
    • Drive cost optimization for observability infrastructure while…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary