More jobs:
Platform Engineer
Job in
Richmond, Henrico County, Virginia, 23214, USA
Listed on 2026-06-02
Listing for:
Apex Systems
Full Time
position Listed on 2026-06-02
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below
Job#: 3032578
Platform EngineerApex Systems is seeking an experienced Platform Engineer to help build, automate, and operate the cloud platform foundation supporting a major migration from on‑premise systems to AWS. This role focuses on delivering secure, reliable, and scalable platform services—including networking, compute, storage, container platforms, and automation tooling—while enabling application teams through self‑service, Infrastructure as Code, and modern cloud‑native patterns. You will work closely with Cloud Architecture, Dev Sec Ops , Networking, and Security teams to ensure all platform components meet organizational security requirements.
CloudPlatform Engineering & Core Infrastructure
- Build and operate foundational AWS services, including VPCs, Transit Gateway, Direct Connect/VPN, IAM, KMS, Cloud Watch, and ECS/EKS.
- Implement secure, scalable compute, storage, and networking patterns aligned with Landing Zone architecture.
- Deploy and manage container platforms (ECS Fargate, EKS, or Kubernetes variants) supporting modernization of legacy middleware.
- Build platform services such as service mesh, API gateways, logging pipelines, and centralized monitoring.
- Implement secrets management, encryption, and secure service‑to‑service communication.
- Support migration of VMware, Windows, Linux, and middleware workloads into AWS using standardized platform patterns.
- Ensure platform components meet organizational and Zero Trust requirements for authentication, authorization, logging, and auditability.
- Apply AI‑assisted observability, anomaly detection, and predictive alerting to improve platform reliability.
- Build and maintain IaC using Terraform, Cloud Formation, and Ansible for AWS and hybrid environments.
- Develop reusable Terraform modules, Cloud Formation templates, and platform blueprints for consistent provisioning.
- Implement Git‑based IaC workflows with automated plan/apply pipelines (Git Hub, Git Lab, Azure Repos).
- Automate provisioning of accounts, networks, compute, and platform services using AWS Service Catalog, AFT, or custom automation.
- Implement drift detection and automated remediation using Terraform Cloud/Enterprise, Atlantis, or AWS native tools.
- Build runbook automation using AWS Systems Manager, Ansible Automation Platform.
- Enable self‑service provisioning for application teams through templates, catalogs, and automation workflows.
- Use generative AI to accelerate IaC creation, documentation, and operational runbook generation.
- Build centralized logging, metrics, and tracing pipelines using Cloud Watch, Open Telemetry, Prometheus/Grafana, Elastic Stack, or Datadog.
- Implement alerting, incident response workflows, and operational dashboards for platform services.
- Support SRE practices including SLOs/SLIs, error budgets, and blameless incident reviews.
- Implement automated health checks, scaling policies, and resilience patterns for platform workloads.
- Integrate platform services with CI/CD pipelines to ensure consistent deployment and operational readiness.
- Apply AI/ML for log correlation, predictive scaling, and automated incident triage.
- Maintain platform runbooks, operational standards, and architecture decision records.
- Hands‑on experience building and operating AWS infrastructure (networking, compute, storage, IAM, monitoring).
- Strong proficiency with Terraform, Cloud Formation, and Ansible.
- Experience with container platforms (ECS, EKS, or Kubernetes).
- Experience automating infrastructure provisioning and configuration.
- Familiarity with hybrid networking (Direct Connect, VPN, Transit Gateway).
- Experience with centralized logging, monitoring, and observability tooling.
- Understanding of security controls, secrets management, and compliance frameworks.
- Experience supporting application teams through platform services or self‑service tooling.
- Experience with a broad range of AWS services, including Cloud Front, S3, Cloud Map, Data Sync, Cloud Trail, App Mesh, SQS, Guard Duty, AWS Inspector, Route 53, Security Groups, Subnets, Network ACLs, WAF, IAM, and VPC…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×