×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer; req

Job in Honolulu, Honolulu County, Hawaii, 96814, USA
Listing for: CATHEXIS
Full Time position
Listed on 2025-12-05
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer
Job Description & How to Apply Below
Position: Site Reliability Engineer (req-191)

Team CATHEXIS elevates the government contracting experience through rapid response, deep skill, and thoughtful problem-solving and communication. Our core capabilities are our top-tier program and project management, data analytics, and audit services, the backbone of which is our integrated approach to operational excellence.

You worked hard to get to where you are. You strive to make every day better than the day before. So do we. Team CATHEXIS operates with an all-in mindset. We are working together to create a company that supports our shared values and individual goals. Our values are centered around Respect, Engagement, Customer Service, Integrity, Teamwork, and Excellence in everything we do for our employees, clients, partners, and communities.

We believe success is best when we listen and lead with empathy; model high standards of ethics to provide a rewarding candidate experience; work hard, have fun, and appreciate the strengths we all bring to the team; and empower our employees to create innovative and trusted results.

We are looking for a dynamic Site Reliability Engineer (SRE) to join our team at Joint Base Pearl Harbor-Hickam. The Site Reliability Engineer (SRE) will manage, monitor, and optimize clusters on Kubernetes. Together, we’re accelerating our clients’ digital transformation through the building and deployment of data-driven, scalable AI solutions. The ideal candidate will have a deep understanding of Kubernetes, Cloud Infrastructure, and Infrastructure as Code (IaC) practices.

You will be responsible for ensuring the reliability and scalability of our clients’ Kubernetes clusters and Cloud Infrastructure.

Responsibilities
  • Monitor and Manage Kubernetes Clusters:
    Ensure the stability, health, and scalability of Kubernetes Clusters, deploying applications and services on Kubernetes
  • Kubernetes Management:
    Deploy, monitor, and scale applications on Kubernetes clusters. Maintain Helm charts, manage services, and ensure resource allocation for optimal cluster performance
  • Containerization & Deployment:
    Design and maintain Docker-based microservices architecture, ensuring consistent and reproducible deployments across staging, QA, and production environments
  • Cloud Infrastructure Management:
    Work with leading Cloud Platforms (AWS, Azure and/or GCP) to set up, configure, and manage infrastructure resources using Infrastructure as Code (Terraform, Cloud Formation, etc.)
  • Monitoring & Incident Response:
    Set up monitoring solutions, define alerts, and manage the incident response process for any issues related to Jenkins or Kubernetes clusters
  • Automate Infrastructure Processes:
    Build automation tools for scaling, monitoring, and maintaining infrastructure using modern tools like Terraform, Ansible, Linux, or equivalent
  • Collaborate Across Teams:
    Work closely with development, services, and operations teams to ensure a seamless integration between application development, deployment, and infrastructure
  • Security & Compliance:
    Ensure all systems follow best practices in terms of security and compliance with relevant regulations. This includes role-based access, encryption, and automated vulnerability scanning
Requirements
  • Active SECRET clearance or higher is required
  • Bachelor’s degree (or equivalent) in computer science or related discipline
  • A minimum of two (2) years of experience working with on-premise and off-premise cloud environments
  • Experience with AWS and/or Azure
  • Hands-on experience with a range of open-source technologies, such as Linux, Docker, Kubernetes, K8s, Terraform, Helm, Postgre

    SQL, or similar technologies
  • Ability to program (structured and OOP) using one OR more high-level languages, such as Python, Java, C/C++, Ruby, and Java Script
  • Experience with distributed storage technologies such as NFS, HDFS, Ceph, and Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn)
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement
  • Ability to lead and work independently in an Agile/Scrum environment
  • Real passion for developing team-oriented solutions to complex engineering problems
  • Ability to thrive in an autonomous,…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary