Software Engineering SMTS - Cloud Reliability Job New York New York USA,IT/Tech

Location: New York

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category:
Software Engineering

Job Details About Salesforce

Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn't a buzzword - it's a way of life. The world of work as we know it is changing and we're looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce's core values at the heart of it all.

Ready to level-up your career at the company leading workforce transformation in the agentic era? You're in the right place! Agentforce is the future of AI, and you are the future of Salesforce.

Job Title: Senior Member of Technical Staff (SMTS) - Site Reliability Engineer (Cloud Automation)

Location: New York, NY;
San Francisco, CA

About the Team

The Cloud Platform Engineering team builds and operates the highly available, active-active mission-critical infrastructure that powers Salesforce treat the internal cloud as a product designed to maximize developer velocity through automation-first thinking and a strict "No Ticket-Ops" philosophy. We are defining the next generation of platform engineering by running a LEAN, innovative team that leverages AI as much as humanly possible.

By integrating AI agents directly into our Git Ops workflows and our enterprise WorkOS (Slack), we aim to build a smart, secure platform that our internal developers love.

The Shared Team DNA

While every member of our team has a distinct focus area, we are all "T-shaped" engineers who learn from one another. Regardless of your title, you must share our collective passion for:

Customer Focus:
Treating internal developers as our primary customers and prioritizing their velocity and user experience.
Automation:
Eradicating manual toil and "ticket-ops" via Git Ops and AI-augmented workflows.
Security:
Believing that security should be "shifted left" and built into the code, not bolted on as an afterthought.
SRE Mindset:
Engineering for failure, prioritizing self-healing systems, and maintaining an 99.999% availability standard.
Observability:
Relying on telemetry, centralized logging, and Chat Ops to proactively identify and resolve issues.

About the Role

While the Architect determines how our platform should be designed, you are the engineer who actually builds the cloud infrastructure engine. As a Senior SRE focusing on Cloud Automation, you will partner closely with our enterprise CI/CD teams to seamlessly integrate our platform's capabilities into the developer workflow. You are responsible for building the infrastructure vending machines and the enterprise-grade infrastructure-as-code (IaC) modules that abstract away cloud complexity.

You will empower internal developers to provision secure, compliant environments in minutes via self-service Chat Ops workflows.

Your Impact - Responsibilities

The Vending Machine:
Build, maintain, and scale the automated provisioning workflows that orchestrate the creation of new, fully governed multi-account cloud environments.
"Golden Modules":
Author, test, and maintain a library of pre-approved Infrastructure-as-Code templates that internal developers will consume. Ensure these modules enforce our strictest standards by default.
Shift-Left Integrations:
Partner with the enterprise CI/CD team to plug our platform's automated security scanning, Policy-as-Code evaluations, and cost-estimation checks directly into the developer's Pull Request process.
Resilience & Observability:
Implement data-plane-driven automated failover mechanisms, and develop integrations that connect our provisioning tools to our enterprise WorkOS (Slack) for real-time operational intelligence.

Minimum Qualifications

Bachelor's degree in Computer Science, Computer Engineering, Software Engineering or relevant work experience.
7+ years of software engineering or Site Reliability Engineering experience in large-scale cloud environments.
Expert-level proficiency in Infrastructure-as-Code (strictly Terraform) and…


Increase/decrease your Search Radius (miles)



Job Posting Language