More jobs:
DevOps Engineer
Job in
New York, New York County, New York, 10261, USA
Listed on 2026-06-12
Listing for:
Scale.jobs
Full Time
position Listed on 2026-06-12
Job specializations:
-
IT/Tech
SRE/Site Reliability, Cloud Computing
Job Description & How to Apply Below
About the Role
The role focuses on the architecture, automation, and reliability of high‑traffic production environments, ensuring that infrastructure scales seamlessly alongside growing user demand. This position sits at the intersection of software engineering and systems operations, bridging the gap between application code and cloud resources. The engineer will be responsible for building internal tooling and self‑service platforms that enable product teams to deploy rapidly while maintaining strict uptime and security standards.
Success in this role means reducing operational friction through robust CI/CD pipelines and proactive observability.
- Design and maintain infrastructure as code (IaC) using Terraform or Pulumi to manage multi‑region cloud environments on AWS or GCP
- Orchestrate containerized applications using Kubernetes, including managing EKS/GKE clusters, ingress controllers, and service meshes
- Develop and optimize automated CI/CD pipelines using Git Hub Actions, Git Lab CI, or Jenkins to streamline the path to production
- Implement comprehensive monitoring, logging, and tracing solutions using Prometheus, Grafana, and the ELK stack to improve system observability
- Collaborate with backend engineers to troubleshoot complex distributed systems issues and perform root cause analysis on production incidents
- Automate routine operational tasks through Python or Go scripting to eliminate manual toil and improve system consistency
- 3–7 years of experience in Dev Ops, Site Reliability Engineering, or Infrastructure Engineering in a production cloud environment
- Hands‑on expertise with infrastructure as code tools, specifically Terraform, and container orchestration with Kubernetes
- Strong proficiency in Linux systems administration and networking fundamentals (DNS, TCP/IP, Load Balancing, VPCs)
- Experience writing production‑quality code in Python, Go, or Ruby for automation and internal tooling
- Familiarity with cloud‑native security best practices, including IAM policy management and secret management (Vault)
- Bonus:
Experience with service mesh technologies (Istio/Linkerd), serverless architectures, or managing large‑scale Postgre
SQL/No
SQL databases
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×