Kafka DevOps Engineer
Job in
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-06-29
Listing for:
EITACIES Inc.
Full Time
position Listed on 2026-06-29
Job specializations:
-
IT/Tech
SRE/Site Reliability, Cloud Computing: Infrastructure & Operations, AWS
Job Description & How to Apply Below
Benefits
- Company parties
- Health insurance
- Opportunity for advancement
- 8+ years of overall IT industry experience.
- 5+ years of hands‑on experience with Kafka or No
SQL technologies. - Strong programming skills in Python and/or Java, with a focus on automation and tooling.
- Experience with CI/CD pipelines and Infrastructure as Code (IaC) tools such as Git, Cloud Formation, and Terraform.
- Experience with at least one cloud platform: AWS, Azure, or Kubernetes‑based environments.
- Experience building AI‑powered solutions, MCP Servers, Agentic AI systems, or GenAI‑based automation tools.
- Strong Linux/Unix administration and troubleshooting experience.
- Excellent analytical, debugging, problem‑solving, verbal, and written communication skills.
- Experience with Dev Ops and Site Reliability Engineering (SRE) practices.
- Strong production support, incident management, issue triaging, and root cause analysis experience.
- Experience with Docker and Kubernetes administration, deployment, and performance tuning.
- Knowledge of security best practices, vulnerability management, CVE analysis, and monitoring cloud/system/device logs.
- Experience designing self‑service platforms and operational automation solutions.
- Build, manage, and support Kafka and No
SQL platforms in production environments. - Design, implement, and maintain scalable platform architectures and deployment solutions.
- Develop and maintain automation tools for infrastructure provisioning, monitoring, and operational workflows.
- Integrate AI/GenAI capabilities into operational tools and platform management processes.
- Design and implement CI/CD pipelines and Infrastructure as Code solutions.
- Execute and manage code deployments across development, testing, staging, and production environments.
- Troubleshoot and resolve platform, infrastructure, and application issues across all environments.
- Monitor system performance, reliability, availability, and security, and drive continuous improvements.
- Collaborate with development, operations, and architecture teams to improve platform efficiency and developer productivity.
- Drive operational excellence through automation, observability, reliability engineering, and proactive issue resolution.
A self‑driven engineer who can take ownership of complex platform challenges, build innovative automation and AI‑driven solutions, communicate effectively with stakeholders, and operate with minimal supervision in a fast‑paced production environment.
Required Skills- Amazon Web Services (AWS): 2‑5 Years
- Amazon Web Services S3 (AWS S3): 2‑5 Years
- Amazon Web Services EKS (AWS EKS): 2‑5 Years
- Apache Kafka: 2‑5 Years
- Artificial Intelligence:
At least 1 year - AWS‑EC2: 2‑5 Years
- Git Hub: 2‑5 Years
- Kubernetes: 2‑5 Years
- No
SQL: 5‑10 Years - Python: 5‑10 Years
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×