More jobs:
Sr Platform Engineer; Contractor
Job in
Newport Beach, Orange County, California, 92659, USA
Listed on 2026-02-16
Listing for:
Pacific Asset Management, LLC
Contract
position Listed on 2026-02-16
Job specializations:
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability
Job Description & How to Apply Below
* Design, build, and maintain scalable, reliable, and secure cloud infrastructure on AWS that supports the organization's internal development platform and production workloads.
* Implement and manage Infrastructure as Code (IaC) using Terraform to ensure consistency, repeatability, and version control of infrastructure.
* Build and maintain CI/CD pipelines and deployment automation to enable efficient and reliable software delivery across multiple environments.
* Establish and maintain platform observability through comprehensive monitoring, logging, and alerting solutions to ensure system reliability and performance.
* Design and implement container orchestration platforms using Kubernetes or similar technologies to support microservices architecture and workload management.
* Implement and enforce security best practices across the platform, including network security, access controls, secrets management, and compliance with industry standards and regulations.
* Optimize platform performance, cost, and resource utilization through continuous analysis and improvement of infrastructure components.
* Collaborate with development teams to define platform requirements, SLAs, and service level objectives (SLOs) that support application needs.
* Build self-service platform capabilities and APIs that enable development teams to provision and manage infrastructure resources independently.
* Develop disaster recovery and business continuity strategies, including backup solutions and incident response procedures.
* Provide technical leadership and mentorship to platform and operations teams on infrastructure best practices and emerging technologies.
* Create and maintain comprehensive documentation for platform architecture, runbooks, and operational procedures.
* ** 5+ years
** of experience in platform engineering, site reliability engineering, Dev Ops, or infrastructure engineering roles.
* Proven experience designing, deploying, and managing production-grade cloud infrastructure on AWS (required).
* Strong hands-on experience with Terraform for Infrastructure as Code (IaC) (required).
* Extensive experience with
** containerization technologies** (Docker, Podman) and
** container orchestration platforms** (Kubernetes, ECS, or similar).
* Deep understanding of
** CI/CD principles
** and hands-on experience building and maintaining automated deployment pipelines using tools like Jenkins, Git Lab CI, Git Hub Actions, or Azure Dev Ops.
* Strong proficiency in Python for automation and tooling development (required).
* Experience with
** configuration management tools
** like Ansible, Puppet, Chef, or Salt.
* Strong understanding of
** networking concepts**,
** security principles**, and
** cloud architecture best practices**.
* Excellent
** troubleshooting and problem-solving skills
** with the ability to diagnose and resolve complex infrastructure issues.
* Excellent
** communication and collaboration skills
** with the ability to work effectively with cross-functional teams and stakeholders.
* Ability to write clear, concise, and comprehensive
** technical documentation**.
* Advanced proficiency in Python for building platform automation, tooling, and infrastructure management solutions.
* Experience with Go is a plus.
* Experience with
** service mesh technologies** (Istio, Linkerd, Consul) and API gateway solutions.
* Hands-on experience with
** observability and monitoring tools
** such as Prometheus, Grafana, Datadog, New Relic, ELK Stack, or Splunk.
* Knowledge of
** database administration and management
** for both SQL and No
SQL databases in cloud environments.
* Experience implementing
** Git Ops workflows
** using tools like ArgoCD, Flux, or similar.
* Strong understanding of
** cloud-native architecture patterns**,
** microservices design**, and
** distributed systems**.
* Experience with
** secrets management solutions** (Hashi Corp Vault, AWS Secrets Manager, Azure Key Vault).
* Knowledge of
** compliance frameworks and security standards** (SOC 2, ISO 27001, HIPAA, PCI-DSS).
* Experience with
** infrastructure cost optimization
** and
** Fin Ops practices**.
* Familiarity with
** chaos engineering principles
** and
** resiliency testing**.
* Strong
** incident management
** and
** on-call experience
** with proven ability to lead incident response and post-mortem activities.
* AWS Certification (Solutions Architect, Dev Ops Engineer, or Sys Ops Administrator) is highly preferred.
* ** Linux/Unix:
** Expert-level understanding of Linux/Unix systems, system administration, and performance tuning
* ** Python (Required):
** Expert-level proficiency for infrastructure automation and tooling development;
Bash scripting experience
* ** Terraform (Required):
** Expert-level proficiency for managing cloud infrastructure;
Cloud Formation familiarity is a plus
* ** AWS (Required):
** Extensive hands-on experience with VPC, EC2, ECS/EKS, Lambda, RDS, S3, IAM, Cloud Watch, and AWS well-architected framework
* ** Containerization:
** Deep expertise in Docker and…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×