×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer II- CTJ - Secret

Job in Redmond, King County, Washington, 98053, USA
Listing for: Microsoft Corporation
Full Time position
Listed on 2026-06-03
Job specializations:
  • IT/Tech
    Systems Engineer, Cybersecurity, Cloud Computing
Job Description & How to Apply Below
Overview The IDEAS organization's mission is to unlock the power of data to deliver actionable insights and personalized experiences  work supports Microsoft 365, Azure, Windows, and other platforms by enabling reliable, secure, and compliant data services. As part of this team, you will collaborate with partners across the company-including product engineering, data science, and operations-to solve complex problems using modern data platforms, cloud analytics, and AI-assisted tooling.

As a Site Reliability Engineer (SRE), you will focus on automation, incident response, and data-driven reliability improvements for services operating in regulated government cloud environments. You will contribute to live site operations, partner closely with engineering teams, and help evolve systems to operate reliably and  IDEAS? Joining IDEAS means contributing to how Microsoft uses data to deliver reliable, secure, and impactful services.

You will work on meaningful systems, collaborate with diverse teams, and help shape platforms that serve customers at global scale. If you are motivated by improving reliability through engineering, data, and collaboration, we encourage you to apply. Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals.

Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. Responsibilities
* Participate as a Designated Responsible Individual (DRI) in a 24x7 on-call rotation, monitoring service health, responding to incidents within defined SLAs, and contributing to post-incident reviews and learning.
* Design, build, and maintain automation for deployment, operations, and incident mitigation to improve reliability and reduce manual effort.
* Instrument services for observability; collect and analyze telemetry and health signals; and use data to guide reliability and performance improvements.
* Collaborate with engineering partners and stakeholders to align on goals, share operational insights, and deliver user-focused solutions.
* Apply engineering best practices for development, scaling, and operational excellence to meet performance and customer requirements.
* Support compliance with security, privacy, and accessibility requirements throughout service onboarding and ongoing operations.
* Continuously learn and adopt industry practices and internal tools to improve reliability, performance, and observability. Qualifications

Required Qualifications:

* Master's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration
* OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 2+ years technical experience in software engineering, network engineering, or systems administration
* OR equivalent experience. Bachelor's Degree in Computer Science, or related technical discipline with proven experience coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python *
  • OR equivalent experience.
    * Experience with automation, live site operations, and incident response in large-scale cloud or distributed systems.
    * Proficiency in at least one programming or scripting language (for example: C#, Java, Python, or Power Shell).
    * Strong analytical and problem-solving skills, including experience using telemetry and operational data to inform decisions.
    * Effective written and verbal communication skills, and experience collaborating across teams and disciplines.
    * Ability to meet Microsoft, customer, and/or government security screening requirements, including passing the Microsoft Cloud Background Check upon hire and periodically thereafter. Other Requirements:
    Security Clearance Requirements:
    Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
    * The successful candidate must have an active U.S. Government Secret Security Clearance. Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. Failure to maintain or obtain the appropriate clearance and/or customer screening requirements may result in employment action up to and including termination.
    * Clearance Verification:
    This position requires successful verification of the stated security clearance to meet federal government customer requirements. You will be asked to provide clearance verification information prior to an offer of employment. Microsoft Cloud Background Check:
    This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years…
  • To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
    (If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)
    0
    200
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary