More jobs:
SRE DevOps Engineer
Job in
Sunnyvale, Santa Clara County, California, 94087, USA
Listed on 2026-01-10
Listing for:
Redolent Infotech Pvt. Ltd.
Full Time
position Listed on 2026-01-10
Job specializations:
-
IT/Tech
Cloud Computing, Data Engineer
Job Description & How to Apply Below
Overview
One of our direct clients is urgently looking for a SRE Dev Ops Engineer @ Sunnyvale, CA.
TITLE: SRE Dev Ops Engineer
LOCATION: Sunnyvale, CA
Duration: 6 to 12+ Months
Rate: DOE
Key Skills- Splunk
- Grafana
- SRE
- Cloud
- Dev Ops
- Azure
- Docker
- Kubernetes
- Java (Basic)
- Python (Scripting)
- You’ll sweep us off our feet if…
- Needs to be able to dig into issues on our eCommerce site and identify root cause, and experience in
- support and triage production incidents
- Creating Dashboards, Alerting and Monitoring Subject Matter Expert
- Experience with Application development and root cause analysis
- Develops Innovation strategies, processes, automation, failover experience
- Drives the execution of multiple business plans and projects
- Experience in driving high availability across multiple organizations
- Experience in putting together architecture diagrams
- Experience in managing workloads in private and public data centers
- Infrastructure experience that involves setup, scale, and decommissioning
- Prior cloud experience, planning and driving efficiencies
- Automation and CI/CD experience
- Application container experience using Kubernetes
- Experience with event streaming platforms like Kafka is a plus
- Experience with analytics & monitoring platform like Grafana/graphite/MMS/Splunk is a plus
- Supporting java full stack backend application system components in a massively scalable, high performance, multi-tenant, international eCommerce platform with multiple micro-services deployed in cloud environment, root causing every reactive/proactive production issues.
- Leads and participates in medium- to large-scale, complex, cross-functional projects
- Partners with architects and development leads to come up with high level design to accelerate customer experience, recommending out-of-box engineering best practices.
- Pro-Actively identifies areas to drive automation/speed/innovation
- Troubleshoots business and production issues by gathering information, performing root cause analysis, engaging support teams, developing solutions, and documenting actions
- Provides support to the business by responding to user questions, concerns, and issues, researching and identifying needed solutions, determining implementation designs, and guiding on implications of new systems
- Assists in providing guidance to small groups of engineers for assigned projects with pertinent documents, directions, examples, and timelines
- Demonstrates up-to-date expertise and applies it to action plans, meeting customer and business needs
- Models compliance with company policies and supports ethics and integrity in business processes
- Provides and supports the implementation of business solutions by building relationships with stakeholders and monitoring progress
- Hands-on experience debugging 5xx and 4xx
- Java/Spring and Node/Python Experience is required
- Creating database objects (tables, views, indexes)
- CI/CD experience automation and implementation experience
- Kubernetes and Docker experience is a plus
- Implement the database structure such as tables, indexes
- Reviewing and tuning the SQL scripts
- Reviewing database structure changes that provided by application developers and data modelers
- Working with application developers to tune the performance of the database
- Experience creating best in class application availability metrics and dashboards
- Managing infrastructure scale, setup and decommissioning
- Public cloud experience, Azure, GCP and Private Data Centers
- Driving P1 production incident calls, communicating up to the point and summarizing action plans for each owner and follow-up until closure
- Ability to take right priority decision and run operational excellence with innovative ideas, without much guidance/supervision
- Ability to build and run tools necessary for operational success
- Documenting SOPs for repetitive issues, building knowledge base articles for team’s benefit
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×