×
Register Here to Apply for Jobs or Post Jobs. X

Infrastructure Engineer

Job in Roseland, Essex County, New Jersey, 07068, USA
Listing for: Advanced Tech Placement
Full Time position
Listed on 2026-05-30
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Systems Engineer, Cloud Computing
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

Infrastructure Engineer

We are seeking a highly skilled Infrastructure Engineer to help design, build, automate, and operate scalable, high‑availability production infrastructure in a fast‑paced enterprise technology environment. This individual will play a key role in driving reliability, automation, cloud infrastructure strategy, operational excellence, and AI‑enabled engineering practices across mission‑critical systems.

Responsibilities
  • Design, build, automate, and support large‑scale, highly available cloud infrastructure environments
  • Manage and optimize containerized production platforms and orchestration environments
  • Develop and maintain Infrastructure as Code (IaC) solutions using tools such as Terraform or Pulumi
  • Build automation tooling, operational utilities, and platform enhancements using Python or Go
  • Drive infrastructure reliability, scalability, observability, and resiliency initiatives
  • Partner closely with engineering, product, security, AI/ML, and platform teams to support enterprise‑wide initiatives
  • Implement and maintain monitoring, logging, alerting, and performance management solutions
  • Troubleshoot complex production issues and proactively identify systemic risks or operational weaknesses
  • Lead infrastructure improvements with a focus on reversibility, risk mitigation, and minimizing production blast radius
  • Create operational standards, automation frameworks, and deployment strategies that improve engineering velocity and reliability
  • Support AI‑driven infrastructure operations, intelligent automation initiatives, and AI‑assisted engineering workflows
  • Evaluate and implement emerging AI‑enabled operational tooling to improve efficiency, incident response, automation, and developer productivity
  • Collaborate with engineering teams supporting AI/ML workloads, data platforms, and model deployment pipelines
  • Own infrastructure initiatives end‑to‑end, including architecture, implementation, rollout, rollback planning, and operational support
Requirements
  • 5 years of experience in Infrastructure Engineering, Dev Ops, Site Reliability Engineering, or similar roles supporting large‑scale production environments
  • Hands‑on experience operating containerized production environments and orchestration platforms in enterprise or high‑growth environments
  • Strong experience with Kubernetes, Helm, and Infrastructure as Code tools such as Terraform or Pulumi
  • Experience supporting cloud infrastructure environments, preferably AWS
  • Proficiency in Python or Go for automation, tooling, and infrastructure development
  • Strong experience with monitoring, observability, and logging platforms such as Prometheus, Grafana, ELK, or equivalent technologies
  • Experience implementing resilient infrastructure designs focused on scalability, reliability, rollback strategies, and operational safety
  • Strong understanding of infrastructure tradeoffs involving reliability, cost optimization, deployment velocity, and operational risk
  • Demonstrated experience leveraging AI‑assisted engineering tools and agentic AI workflows within day‑to‑day development and operational practices
  • Experience utilizing AI‑enabled platforms such as Claude Code, Codex, Git Hub Copilot, or similar tools to improve automation, troubleshooting, deployment efficiency, and operational workflows
  • Familiarity with infrastructure requirements supporting AI/ML environments, including compute scalability, data processing pipelines, model deployment, or GPU‑enabled workloads is highly desirable
Required Skills
  • Excellent communication and cross‑functional collaboration skills
  • Strong analytical and problem‑solving capabilities
  • Ability to challenge assumptions, identify operational gaps, and recommend innovative infrastructure solutions
  • Proven ownership mindset with experience leading infrastructure initiatives from concept through production deployment
  • Strong organizational skills with the ability to prioritize and execute in fast‑paced environments
  • Passion for continuous improvement, emerging technologies, and modern AI‑enabled operational practices
Preferred Skills
  • Software engineering background with experience building and maintaining production‑grade applications, services, libraries, or internal frameworks
  • Ability to read, troubleshoot, and modify application codebases supporting infrastructure platforms
  • Experience bridging infrastructure engineering and software development practices
  • Experience building reusable platform tooling, developer enablement frameworks, or internal infrastructure products
  • Experience supporting enterprise‑scale cloud transformation or modernization initiatives
  • Exposure to MLOps, AI infrastructure, vector databases, model serving frameworks, or intelligent automation platforms
  • Experience supporting AI/ML engineering teams through scalable infrastructure and deployment automation
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary