Senior Kubernetes Platform Engineer
Listed on 2026-02-07
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability, IT Project Manager
Overview
WHO WE ARE
Apex Fintech Solutions (Apex) powers innovation and the future of digital wealth management by building tech-forward solutions that help simplify, automate, and facilitate access to financial markets for all. Our robust suite of fintech software enables us to support clients such as Stash, Betterment, SoFi, Webull, and eToro, amongst many others; collectively, Apex powers access to the stock market for over 22+ million end customers.
At Apex, we are changing how the securities industry operates by reinventing the status quo, which was manual, slow, and accessible only by the ultra-wealthy. We re digitizing and democratizing systems so that everyone has an opportunity to invest.
When you re at Apex, you drive this change. You re part of a global team with a clear vision: to be the trusted technology that powers the digital economy. Our offices in Austin, Dallas, Chicago, New York, Portland, Belfast, and Manila are home to over 1,000 employees.
Together, we re shaping the future of financial innovation. Embrace change. Solve big. Win together. And be G.R.E.A.T. — grit, results, empathy, accountability, and teamwork — with Apex.
We re proud to be recognized for the innovative work we do, the purpose-driven nature of our work, and the collaborative culture we ve created. Here are just a few of the many awards we ve recently received:
Best Places to Work
2026, 2025, 2024, 2023
Presented by Built In
Wealth Tech of the Year
2025
Presented by US Fin Tech Awards
The World s Top 250 Fintech Companies
2024
Presented by CNBC
As a Senior Site Reliability Engineer
, you ll play a pivotal role in our platform organization, driving the full lifecycle management of Kubernetes clusters—from design and deployment to maintenance and continuous improvement. You ll focus on ensuring the reliability, security, and performance of our Kubernetes environments through automation and best practices. You ll also collaborate closely with our Developer Experience and Release Engineering teams to deliver a reliable continuous delivery system for our application teams.
Tooling Development
:
Build and enhance automation tools and libraries for generating Kubernetes manifests, eliminating manual errors and improving deployment efficiency.Cluster Management
:
Deploy, configure, and maintain Kubernetes clusters and supporting infrastructure, ensuring high availability, security, and performance.Monitoring & Troubleshooting
:
Set up and manage monitoring and alerting systems (Datadog), proactively identify issues, and resolve incidents quickly.Security & Compliance
:
Implement best practices, conduct regular audits, and ensure compliance with relevant industry standards and regulations.Documentation
:
Maintain clear and comprehensive documentation of configurations, procedures, and best practices, and provide training to other teams on platform tools, usage and best practices.Collaboration
:
Work closely with application teams to streamline and support Kubernetes-based deployments, and partner with other platform teams to solve infrastructure challenges and develop robust solutions.Continuous Improvement
:
Stay current with cloud-native technologies; proactively improve existing code, processes, and systems, and advocate for enhancements to processes and platform architecture.On-Call Support
:
Participate in an on-call rotation to respond to and resolve production incidents, maintaining system reliability and minimizing downtime.
Bachelor s degree in Computer Science, Information Technology (or work equivalent experience) required
5+ years of software development experience (Go, Java, Python, etc.)
2+ years of hands-on Kubernetes experience (GKE, EKS, RKE, etc.)
Experience with Infrastructure as Code (IaC) tools and concepts (Terraform, Cloud Formation, Pulumi)
Experience with CI/CD tools and pipelines (Git Hub Actions, ArgoCD, FluxCD), and Git Ops practices
Proficient in deploying and managing Kubernetes clusters on cloud platforms (Google Cloud, AWS, Azure) and on-premises environments
Solid understanding of container technologies (Docker,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).