Platform & Production Operations Engineer
Listed on 2026-06-13
-
IT/Tech
SRE/Site Reliability, Cloud Computing
Platform & Production Operations Engineer
About Smart Access
Smart Access is the AI execution layer for supply chains. We're a Frontline Execution Platform that helps the world's largest warehouses, manufacturers, and logistics operations close the gap between defined standards and actual frontline behavior. Our system connects standards, observations, coaching, and intelligence into a single execution loop that drives consistent, measurable performance on the floor.
We work with operational leaders, frontline supervisors, safety teams, and continuous improvement leaders to systematically close the execution gap — the distance between what standards say should happen and what actually happens. As supply chains adopt AI, Smart Access is positioned to become the operational intelligence layer that AI agents rely on to drive frontline action.
The Company just closed a Series A funding round, marking a significant milestone in its growth journey. This is a pivotal moment to join; the team is lean, the trajectory is steep, and the roles being hired now will be foundational to how Smart Access scales. If you thrive in a high-ownership, high-impact environment and want to help build something from the ground up, this is the opportunity.
The Role
Smart Access is seeking a Senior Platform & Production Operations Engineer to help build, operate, and scale the infrastructure that powers our AI-enabled frontline execution platform.
This is a hands-on role that combines elements of cloud infrastructure, Dev Ops, platform engineering, and production operations. You will be responsible for maintaining reliable production systems, improving deployment and operational processes, strengthening observability and monitoring, and partnering closely with engineering teams to ensure our platform scales securely and efficiently.
As an early member of a growing engineering organization, you will have significant ownership and influence over how our infrastructure, deployment practices, operational tooling, and production processes evolve.
This role is ideal for an engineer who enjoys both building systems and operating them in production.
What You'll Do
- Own and improve production operations across Smart Access cloud environments
- Monitor, troubleshoot, and resolve production incidents and performance issues
- Improve reliability, availability, scalability, and operational excellence across the platform
- Build and maintain cloud infrastructure and deployment automation
- Partner with application engineers to support development, testing, deployment, and production readiness
- Develop monitoring, alerting, observability, and operational dashboards
- Participate in incident response, root cause analysis, and operational reviews
- Improve security, access controls, secrets management, and operational compliance practices
- Optimize cloud infrastructure utilization and cost efficiency
- Create operational runbooks, documentation, and repeatable support processes
- Become deeply familiar with the Smart Access application stack and operate independently within 6–9 months
Required Qualifications
- 8+ years of experience operating and supporting production SaaS applications
- Experience managing cloud-based applications in production environments
- Strong experience with Google Cloud Platform (GCP)
- Experience supporting highly available, multi-tenant SaaS platforms
- Experience with CI/CD pipelines and deployment automation
- Experience with infrastructure-as-code tools, preferably Terraform
- Strong troubleshooting and production incident management skills
- Experience implementing and operating monitoring, observability, alerting, and incident response processes
- Experience working closely with software engineering teams in agile environments
- Strong written and verbal communication skills
- Ability to operate independently and take ownership in a fast-moving startup environment
Preferred Qualifications
- Experience operating applications built on Google Cloud Run
- Experience supporting applications built with FastAPI and Django
- Experience with containerized application deployments
- Experience supporting AI-enabled or data-intensive applications
- Experience with observability platforms such as New…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: