Senior Platform Engineer
Job in
Conley, Clayton County, Georgia, 30027, USA
Listed on 2026-06-30
Listing for:
Cox Automotive
Full Time
position Listed on 2026-06-30
Job specializations:
-
IT/Tech
Job Description & How to Apply Below
Cox Automotive's Engineering & Technology organization is building a centralized Enterprise AI Integration Platform - the governed infrastructure layer that allows every team in the organization to connect AI agents (Claude, Copilot Studio, Claude Code) to enterprise data sources in a secure, observable, and self-service way. This Senior Platform Engineer will own the technical implementation of that platform: the central AI gateway server, the on-demand connector provisioning engine, the identity-driven session layer, and the observability stack that gives the organization full visibility into every AI tool call made against production data.
This is one of a small number of roles at Cox Automotive working at the intersection of enterprise infrastructure engineering and Model Context Protocol (MCP) - the protocol rapidly becoming the standard interface between AI agents and enterprise systems. The work is novel, the surface area is broad, and the organizational impact is significant.
What You'll Do:
- Design, implement, and optimize the central AI Gateway MCP server - the single governed endpoint through which all AI client connections route, built on FastAPI + uvicorn for high-concurrency enterprise workloads
- Build and maintain the Redis Elasti Cache session layer that binds Microsoft Entra identity to role-resolved MCP tool sets, including token lifecycle management, sliding TTL extension, per-user quota enforcement, and distributed rate limiting
- Implement the on-demand connector provisioning engine - a system that provisions compute containers with enterprise client drivers, establishes VPC-internal network paths, and retrieves credentials from AWS Secrets Manager automatically when a user's AI agent declares intent to access a data source
- Build enterprise system connectors as MCP tool sets:
Oracle DB, SharePoint Graph API, Rally, Service Now, and a vendor connector approval pipeline with ECR container image scanning and an Aurora-backed connector registry - Implement comprehensive automated testing: unit, integration, load testing (1,000+ concurrent users), and chaos testing for connector fault tolerance
- Build and maintain the full observability stack: structured logging, Prometheus metrics, Kinesis Firehose Open Search indexing, and Grafana dashboards for per-user, per-tool, per-session audit trails
- Design and implement CI/CD pipelines for all platform components via Git Hub Actions, with automated container image builds, ECS task definition updates, and blue/green deployments
- Own security controls:
Entra OIDC token validation, PII masking on all tool responses, WAF rule management, Secrets Manager integration with autorotation, and OWASP-aligned secure API design - Maintain and extend the existing Snowflake MCP codebase that forms the foundation of the platform, including session management, RBAC, PII masking, configuration management, and secrets integration modules
- Develop troubleshooting and diagnostic tools for production support
- Create documentation, runbooks, and operational playbooks for platform support and maintenance
Minimum Requirements:
- Bachelor's degree in a related discipline and 4 years' experience in a related field. The right candidate could also have a different combination, such as a master's degree and 2 years' experience; a Ph.D. and up to 1 year of experience; or 16 years' experience in a related field
- Python development (4+ years) with advanced async/await patterns, FastAPI, multiprocessing, and production performance optimization
- Model Context Protocol (MCP) - hands-on implementation experience with MCP servers, tool definitions, and client integration patterns; ability to read and extend the protocol specification independently
- AWS platform depth: ECS Fargate task lifecycle management, Elasti Cache Redis (TLS, clustering, eviction policies), Secrets Manager, Route 53, ALB (sticky sessions, TLS termination), ECR, Aurora Postgres, SSM Parameter Store, Cloud Watch, Kinesis Firehose
- Microsoft Entra / Azure AD integration: OIDC federation, group membership extraction via Graph API or JWT claims, RBAC pattern implementation
- Database integration and optimization:
Oracle, PostgreSQL, Snowflake, SQL Server - including connection pooling, query optimization, and schema introspection - Distributed systems patterns: circuit breakers, retry with exponential backoff, bulkhead isolation, Redis-backed distributed state, graceful degradation
- Container platform:
Docker multi-stage builds, ECS task definitions, non-root container security, health endpoint implementation - REST API security: JWT validation, rate limiting, input validation, PII detection and masking
- Observability: structured JSON logging, Prometheus client instrumentation, distributed tracing concepts, Cloud Watch Logs Insights
- Version control and CI/CD (Git, Git Hub Actions, automated testing pipelines)
- High-concurrency MCP server development with proven experience supporting enterprise-scale concurrent sessions
- Snowflake advanced…
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×