Senior Infrastructure Engineer
Listed on 2026-05-27
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability
Gradial helps marketers and creatives move from idea to execution faster. Our platform turns intent into action, automating website updates, design system migrations, and ongoing content optimization while preserving brand integrity across every touchpoint.
Backed by leading investors, we’re building software that adapts to the user, not the other way around. We move with urgency, operate with ownership, and solve hard problems from first principles. If you want to do ambitious work, take real responsibility, and help define the future of AI‑native content operations, you’ll do your best work here.
The RoleAs a Senior Infrastructure Engineer at Gradial, you will architect and evolve the systems that power our AI‑driven content operations platform. This role is ideal for someone who thrives in startup‑to‑scale‑up environments and brings a deep understanding of how to make infrastructure reliable, secure, and scalable.
You’ll play a critical role in building on our core systems, supporting rapid product iteration and ensuring the platform is built for growth. If you’ve owned infrastructure in production, guided system evolution, and want to shape the future AI, we’d love to meet you.
What You’ll Own- Design and maintain scalable, secure, and resilient infrastructure to support Gradial’s AI platform.
- Lead Kubernetes cluster management, CI/CD pipelines, observability tooling, and infrastructure‑as‑code efforts.
- Anticipate scaling needs and proactively evolve infrastructure architecture to support growth and reliability.
- Take full ownership of real‑time, compute‑intensive services: designing, deploying and maintaining to meet high performance standards with minimal oversight.
- Establish and enforce best practices for system reliability, performance monitoring, and disaster recovery.
- Evaluate and implement infrastructure automation tools to improve deployment velocity and reduce operational burden.
- Act as a strategic voice on infrastructure investment, technical debt management, and long‑term scalability planning.
- 5+ years of experience in Dev Ops, SRE or platform engineering roles.
- Proven track record designing and operating large‑scale, production‑grade infrastructure.
- Deep expertise in Kubernetes, cloud‑native architecture, and container orchestration.
- Proficiency with infrastructure‑as‑code (e.g., Terraform, Git Ops), CI/CD tooling, and monitoring stacks (e.g., Prometheus, Grafana).
- Experience in high‑growth environments, especially scaling infrastructure from early product‑market fit to maturity.
- Strong communication skills and a collaborative, ownership‑driven mindset.
- Familiarity with AI/ML infrastructure, including GPU provisioning and model deployment.
- Prior experience supporting cloud or multi‑cloud architectures.
- Comfort with Type Script or Python to support tooling and operational scripts.
The salary range for this position is $130,000 – $200,000 annually
. Final compensation will be determined based on factors such as experience, skills, and qualifications. In addition to base salary, this role may be eligible for performance‑based bonuses and equity awards. Gradial offers a comprehensive benefits package, including medical, dental & vision insurance, 401K retirement plan, paid time off, paid sick leave and other employee wellness programs.
- Embrace AI as a core tool for problem‑solving, creativity and scale.
- Show a strong work ethic, high ownership and bias toward action.
- Communicate with clarity and curiosity.
- Thrive in fast‑paced, hyper‑growth environments; where building is always better than maintaining the status quo.
- Meaningful equity and competitive salary
- Comprehensive health, dental and vision coverage
- Fast‑paced environment with autonomy and ownership
- Real impact, zero bureaucracy
- A front‑row seat to building category‑defining AI infrastructure
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).