More jobs:
Site-Reliability Engineer; W2 - Locals to AZ In-person
Job in
Scottsdale, Maricopa County, Arizona, 85261, USA
Listed on 2026-05-18
Listing for:
Saransh Inc
Full Time
position Listed on 2026-05-18
Job specializations:
-
IT/Tech
Cloud Computing: Infrastructure & Operations, Systems Engineer, SRE/Site Reliability, IT Support
Job Description & How to Apply Below
Job Title
Site‑Reliability Engineer
LocationScottsdale, AZ (Onsite from Day
1)
Contract (W2)
Interview Process1 level of internal evaluation.
3 Levels of Client Interviews – 2 Telephonic and 1 In-person. The last round can be held either in Richardson, Texas or Scottsdale, Arizona.
NOTE: Local candidates only as the client interview is in person.
Core Required Skills- Python – Yes
- Helm – Yes
- Google Cloud Platform – Yes
- Kubernetes – Yes
7+ years of relevant experience.
Mandatory Skills- Google Cloud Platform (GCP) Containerization, Kubernetes
- Infrastructure as Code (Terraform), CI/CD (Git Hub Actions), and Helm
- Automation and scripting using Python, Ansible, and Node.js
- Monitoring and observability with Prometheus and Grafana
- Linux systems and troubleshooting
- Service reliability/operation experience running large-scale, high-performance applications in a hybrid environment (on-prem and cloud).
- Experience in writing automation scripts and building dashboards for Application Performance management to manage Transaction journeys.
- Experience working with programming languages such as Go, Python, Java, Rust etc.
- Working knowledge of at least one database – Oracle, SQL Server, Redis, Clickhouse, Postgre
SQL, Mongo or any time-series databases. - Experience in transitioning platforms to the cloud and containerization on GCP and Rancher.
- Experience maintaining containerized apps in GKE/RKE/AKE environments.
- Experience implementing Cloud observability using OTEL to enable real‑time monitoring, distributed tracing and incident resolution.
- Experience working with specific Graph
QL Frameworks (Apollo, Prisma, Hasura etc…). - Experience using knowledge of networking protocols such as TCP/IP, HTTP, DNS, Load balancing and service mesh to troubleshoot issues in high‑pressure situations.
- Proven experience managing application availability, building creative solutions to manage repetitive activities, improving gating and detection for applications at every touchpoint for a 24x7 high‑availability platform exposed to critical clients and customers.
- Working knowledge of monitoring tools – Splunk, App‑Dynamics, Grafana/Prometheus and Dynatrace.
- Experience with tools like Rally, Confluence and other CI/CD extenders.
- Hands‑on experience with implementing in‑memory caching solutions. Experience with Redis database is a plus.
- Excellent debugging skills across a variety of integrated technical platforms on API gateway.
- Hands‑on with GCS, Cloud SQL, Spanner and Firestore.
- Extensive experience in enterprise level infrastructure and operations.
- Experience in high availability and distributed systems, Linux and Windows administration, troubleshooting and support.
- Monitor and troubleshoot Hashi Corp Vault environments, ensuring minimal downtime and rapid recovery from incidents.
- Working knowledge of Vertex AI, Gen AI and Big Query.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×