Job Description & How to Apply Below
Join to apply for the Platform Reliability Engineer role at J&M Group
Continue with Google Continue with Google
Join to apply for the Platform Reliability Engineer role at J&M Group
- Infrastructure as Code (IaC):
Terraform, ARM templates, Cloud Formation - Scripting
Languages:
Python, Power Shell, Bash - Security & Compliance:
Access control models, cloud security practices - Platform Governance:
Unity Catalog (nice to have) - Operational Excellence: SRE principles, SLOs, SLIs
Job Description
Technical Skills
- Cloud Platforms:
Azure, AWS - Infrastructure as Code (IaC):
Terraform, ARM templates, Cloud Formation - Scripting
Languages:
Python, Power Shell, Bash - Monitoring & Observability:
Azure Monitor, Log Analytics, Prometheus - CI/CD Tools:
Azure Dev Ops, Git Hub Actions - Platform Services:
Compute, Storage, Networking, Data Plane Infrastructure - Security & Compliance:
Access control models, cloud security practices - Platform Governance:
Unity Catalog (nice to have) - Operational Excellence: SRE principles, SLOs, SLIs
- Automation & Cost Optimization:
Platform automation, cost reduction strategies
- Effective communication and cross-team collaboration
- Strong problem-solving and analytical mindset
- Proactive, independent, and team-oriented work style
- Attention to detail
- 3-6 years in platform engineering, SRE, or infrastructure roles
- Bachelor's degree in Computer Science, IT, or related field
- Experience in agile or iterative development environments
- Certifications (nice to have):
Azure Administrator, Azure Dev Ops Engineer, AWS Solutions Architect
Job Summary
- We are looking for a skilled and motivated Platform Reliability Engineer to support and optimize our platform services. This role bridges the gap between infrastructure services and the platform capabilities required by development and operations teams. The engineer will contribute to automation, reliability, cost optimization, and service excellence of core platform components hosted in the cloud (Azure/AWS). This is a hands-on technical role with a focus on enabling reliable, secure, and scalable platform foundations for enterprise-scale workloads.
- Support the design and implementation of core platform services that enable development teams to build, deploy, and operate applications reliably.
- Develop Infrastructure as Code (IaC) templates and scripts using tools like Terraform or ARM to automate provisioning and configuration.
- Monitor and maintain platform services including compute, storage, networking, and data plane infrastructure for scalability and performance.
- Collaborate with development, cloud engineering, and security teams to ensure platform alignment with architectural standards and security requirements.
- Implement observability practices using tools for monitoring, logging, and alerting to support performance tuning and incident detection.
- Troubleshoot platform-related incidents, perform root cause analysis, and document findings for continuous improvement.
- Participate in deployment activities, ensuring proper controls and validations are in place when promoting workloads to production.
- Support optimization initiatives to reduce costs across services such as compute, storage, Synapse, and platform integration tools.
- Contribute to ongoing platform modernization efforts, including migration from legacy configurations to unified governance models such as Unity Catalog.
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- 3-6 years of experience in platform engineering, SRE, or related infrastructure roles.
- Practical experience with Azure or AWS cloud services, particularly related to infrastructure and platform-level resource management.
- Proficiency in Infrastructure as Code (IaC) tools such as Terraform, ARM templates, or Cloud Formation.
- Hands-on experience with monitoring and observability solutions (e.g., Azure Monitor, Log Analytics, Prometheus).
- Familiarity with CI/CD pipelines and release processes (e.g., Azure Dev Ops, Git Hub Actions).
- Strong scripting skills (Python, Power Shell, or Bash) to automate tasks and workflows.
- Understanding of access control models, security practices, and compliance in cloud platforms.
- Familiarity with SRE principles and operational excellence metrics (SLOs, SLIs).
- Experience working in agile or iterative environments
- Effective communicator with the ability to coordinate across platform, security, cloud, and development teams.
- Strong problem-solving mindset with attention to detail.
- Proactive and collaborative team player, able to work independently and drive issues to resolution.
- Exposure to Unity Catalog or similar data governance tooling in the context of platform services.
- Experience supporting platform migrations or re-architecture projects.
- Certification in Azure Administrator, Azure Dev Ops Engineer, or AWS Solutions Architect.
- This role is ideal for someone with a strong technical foundation who is ready to take on ownership of platform-level…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×