Site Reliability Engineer II; Remote
Job Description & How to Apply Below
Agile Lab is a company founded in 2014 with the mission to create value for its customers in data‑intensive environments through customisable solutions that establish performance‑driven processes, sustainable architectures and automated platforms based on data governance best practices.
Having delivered over 100 successful Elite Data Engineering initiatives, we have used this experience to create Witboost: a modular, technology‑agnostic platform that enables modern organisations to discover, value and produce their data in both traditional environments and fully compliant Data Mesh architectures.
With a highly skilled team of over 260 data engineers based in Europe, Agile Lab helps organisations with their data‑driven transformation.
Take a look at the handbook to discover our core values and processes.
💼
The opportunity:
We are looking for a Site Reliability Engineer II (SRE II) to join our growing team. You will play a key role in maintaining the reliability, observability, and operational efficiency of enterprise‑level distributed systems.
In this role, you’ll coordinate a small technical team (3–4 people) in managing microservices in complex production environments. You will be involved in monitoring, incident management, release coordination, and performance tuning, with a strong focus on Open Shift platforms.
You’ll also work closely with multiple cross‑functional teams to ensure high availability and performance of our cloud‑native services.
This role includes on‑call availability.
💰 RAL:
38.5K-48.5K
💻 Responsibilities:
Ensure high reliability of microservices running in Open Shift environments
Lead and coordinate a technical team of 3–4 engineers for operational excellence
Manage incident resolution and ticketing workflows via Service Now
Collaborate with development teams to drive performance optimization and tuning
Design, configure and maintain monitoring dashboards (Grafana, Prometheus, etc.)
Coordinate with Service Control Room to maintain effective alerting and response
Oversee release processes of new features, hotfixes, and updates in production
🛠️ Requirements:
Degree in Computer Engineering, Computer Science, or a related field
Proven experience in Application Maintenance Services (AMS): minimum 2 years
In‑depth knowledge of Open Shift and microservices in cloud‑native environments
Ability to technically and operationally lead a team of 3–4 people
Experience in release management, monitoring, and incident resolution
Excellent communication and cross‑functional coordination skills
Strong initiative, operational autonomy, and results‑oriented mindset
Fluency in Italian (mandatory requirement)
Monitoring & Observability:
Grafana, Prometheus, Kibana, Jaeger, Datadog, Open Telemetry
Cloud/Dev Ops:
Open Shift, Git Lab, Jenkins
Data & Messaging:
Kafka, Mongo
DB, Ignite
Ticketing & ITSM:
Service Now
🙌🏻 We offer:
Full Remote or hybrid working in our offices:
Milan, Turin, Padua, Bologna, Catania and Rende;
Real work life balance;
Training monthly budget (time and money);
Support of a buddy in the first week of work;
Benefits and corporate welfare programs: company prizes and welcome pack with all the equipment you need to work;
Agile Nomads
Experience:
opportunity to work for 2 weeks abroad;
Referral bonus, if you bring people as talented as you;
The opportunity to attend one conference per year;
A company rated 4.8 out of 5 for employee satisfaction on Glassdoor and certified as a Great Place to Work
Inclusive environment where you can be who you really are;
Stimulating environment oriented to growth, both professional and personal.
😊 How we work:
We don't like hierarchies: we work as a team;
We don't like bureaucracies, we prefer sense of responsibility;
We like data, certainly, so anything that is measurable;
We want to make a positive change in our industry;
Empathy, humility, collaboration, and willingness to challenge ourselves are the basis of our work.
Please note:
Only candidates based in European time zones (CEST or similar) will be considered for this position;
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×