More jobs:
Job Description & How to Apply Below
Location:
Bangalore
Experience:
3–8 Years
Employment Type:
Full-Time
About the Role We are seeking a highly motivated Production Support Engineer (PSE) to support and maintain large-scale production systems and data platforms. The ideal candidate will have strong debugging and troubleshooting skills, experience handling production incidents, and the ability to work effectively in high-pressure environments.
This role involves monitoring critical data pipelines, managing L1/L2 production support activities, performing root cause analysis, and collaborating with engineering and infrastructure teams to ensure system reliability and operational excellence.
Key Responsibilities
Pipeline Operations & Monitoring Monitor, maintain, and support large-scale, high-throughput data streaming pipelines.
Ensure smooth execution and availability of production data workflows.
Proactively identify and address operational issues before they impact business operations.
L1/L2 Technical Support & Incident Management Act as the primary point of contact for production support issues related to data platforms and pipelines.
Acknowledge, triage, troubleshoot, and resolve production incidents within defined SLAs.
Coordinate with engineering, infrastructure, and cross-functional teams to resolve complex production issues.
Participate in incident response, escalation management, and service restoration activities.
Debugging & Root Cause Analysis Analyze application and system logs to identify and troubleshoot issues.
Debug Java/Scala-based applications and investigate production failures.
Perform Root Cause Analysis (RCA) for incidents and outages.
Implement corrective actions, workarounds, and preventive measures to minimize recurring issues.
Operational Excellence Follow established SOPs and incident management processes.
Contribute to operational automation using scripting tools.
Drive continuous improvements in monitoring, alerting, and support processes.
Required Qualifications & Skills Technical Skills 3–8 years of experience in Production Support, Platform Operations, Site Reliability Engineering (SRE), Dev Ops, or related roles.
Strong experience supporting large-scale production systems and data platforms.
Hands-on experience with big data frameworks such as Spark, Flink, and YARN .
Strong debugging and troubleshooting skills in Java or Scala applications.
Proficiency in Linux/Unix environments.
Experience with Shell Scripting and Python for operational automation.
Strong understanding of production monitoring, incident management, and root cause analysis.
Soft Skills Excellent analytical and problem-solving abilities.
Strong communication and stakeholder management skills.
Ability to work effectively in high-pressure, production-critical environments.
Customer-focused mindset with strong ownership and accountability.
Preferred Skills
Experience with Google Cloud Platform (GCP) .
Familiarity with Docker and Kubernetes .
Experience with monitoring and alerting tools such as Prometheus and Grafana .
Prior experience supporting large-scale consumer-facing or e-commerce platforms.
Exposure to distributed systems and high-volume data processing environments.
What We're Looking For Strong production debugging and troubleshooting expertise.
Hands-on experience managing production incidents and performing RCA.
Ability to debug Java applications and Linux systems effectively.
Experience working with large-scale data processing ecosystems.
Excellent communication skills and ability to collaborate across teams.
Preferred Notice Period Immediate Joiners or candidates serving up to 15 days' notice period are highly preferred.
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×