More jobs:
Job Description & How to Apply Below
This role is ideal for professionals who don't just patch issues, but thrive on troubleshooting complex distributed system architectures, analyzing deep application logs, and collaborating closely with Technical Architects and Core Engineering to ensure permanent system reliability, scalability, and performance.
The ideal candidate must possess advanced code-level debugging skills, a proactive site-reliability mindset, and a willingness to work in a 24/7 rotational shift environment, including night shifts .
What We Are Looking For :
4–6 years of experience in Application/Production Support or Development with a strong lean toward production engineering.
Bachelor’s degree in Computer Science, IT, or equivalent practical software engineering experience.
Sound knowledge of Design patterns and SOLID Design Principles to ensure code-level patches and fixes are robust and scalable.
Excellent debugging, root-cause analysis (RCA), and incident-handling capabilities.
The ability to "over-communicate" with exceptional articulation and stakeholder management skills during high-pressure outages.
An enhanced curiosity and a passion for automation—always looking for modern ways to solve legacy system problems.
Strong ownership mindset, attention to detail, and ability to thrive in a fast-paced environment.
Willingness to work in rotational/night shifts to ensure continuous production uptime.
Key Responsibilities:
️ Advanced Engineering & Code-Level Resolution
Directly implement code-level enhancements , bug fixes, and stability patches to address underlying issues in Java-based microservices applications.
Go beyond surface-level fixes to analyze complex system behaviors, collaborating with Technical Architects to evaluate deep-tech infrastructure flaws contributing to recurring outages.
Collaborate with development and architecture teams to design more scalable, data-driven analytical solutions based on production-floor insights.
Incident Management & Production Support
Monitor and support production applications and services to troubleshoot and resolve production issues within defined SLAs.
Analyze application logs and debug Java-based microservices applications.
Work comprehensively on incident management, problem management, and service requests.
Escalate critical production issues backed by detailed technical analysis.
Participate in deployment validations and production monitoring activities.
Ensure timely communication and transparent status updates during critical incidents.
Technical
Skills Required:
Strong experience in Java/J2EE and debugging Java-based microservices .
Profound experience in analyzing logs, debugging complex system behaviors, and troubleshooting production issues.
Hands-on experience with production monitoring and observability tools (e.g., Datadog or similar) and ticketing systems.
Strong knowledge of SQL and database querying; familiarity with large-scale data systems (e.g., Mongo
DB, Elastic Search ) and distributed caching (e.g., Redis ).
Good understanding of application deployment, release processes, and cloud environments ( AWS ).
Position Requirements
10+ Years
work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×