Senior Site Reliability Engineer
Listed on 2026-02-06
-
Software Development
Software Engineer, Cloud Engineer - Software, DevOps
Senior Site Reliability Engineer - Waltham, MA
Dentsply Sirona is the world’s largest manufacturer of professional dental products and technologies, with a 130-year history of innovation and service to the dental industry and patients worldwide. Dentsply Sirona develops, manufactures, and markets a comprehensive solutions offering including dental and oral health products as well as other consumable medical devices under a strong portfolio of world-class brands. Dentsply Sirona’s products provide innovative, high-quality and effective solutions to advance patient care and deliver better and safer dentistry.
Dentsply Sirona's Waltham, MA location is hiring a Sr. Site Reliability Engineer to join a global team that will ensure system reliability and performance. Together, this team will act as 24/7 emergency 2nd/3rd level support for products, restoring services ASAP when downtime occurs. This role is partially remote, providing a mix of working remotely and in the office.
KEY RESPONSIBILITIES
- Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding
. - Partner with development and operations teams to improve services through rigorous testing and release procedures; perform root cause analyses and implement solutions.
- Partner with architecture teams
. - Improve existing systems through automation and uplifts.
- Participate in system design consulting and platform management.
- Balance feature development speed and reliability with well-defined service-level objectives.
ACCOUNTABILITIES
- Run the production environment by monitoring availability and taking a holistic view of system health.
- Build software and systems to manage platform infrastructure and applications
. - Improve reliability and quality of products in our microservice architecture.
- Measure and optimize system performance
, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement. - Act as 24/7 emergency 2nd/3rd level support for products; restore services ASAP when downtime occurs.
EDUCATION AND EXPERIENCE
- Bachelor's or Master's degree in Computer Science or Software Engineering or relevant experience.
- At least 5 years' experience in a Site Reliability Engineering / Platform Engineering / Dev Ops role or similar.
- Excellent troubleshooting skills and proven experience resolving production downtime with immediate and long-term solutions.
- A deep understanding of algorithms, data structures, complexity analysis and software design
. - Good analytical skills coupled with excellent communication skills;
professional English is required, German is a bonus. - At least Google Associate Cloud Engineer certification
, higher certifications are a bonus.
TECHNICAL SKILLS
- Experience with Kubernetes and GCP cloud both as an admin and user.
- Previous software development experience in one of:
Golang, C++, or any other modern programming language; Flutter experience is a bonus. - Extensive knowledge of relational databases
, file systems and Linux
. - Familiarity with monitoring tools (e.g.
Datadog
) and project tracking software (e.g.
Jira
). - Proficiency in building / maintaining CI and CD pipelines
. - Experience working with container orchestration platforms such as Kubernetes
. - Good understanding of systems automation and IT Security.
Dentsply Sirona is an Equal Opportunity/ Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, age, sexual orientation, disability, or protected Veteran status.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).