Software Engineer, HPC Scheduling
Listed on 2026-05-31
-
Software Development
Software Engineer, Cloud Engineer - Software, DevOps
About the Company
North Mark Compute & Cloud (NMC²) is backed by dedicated leadership and investment, with a clear mission. Its goal is to scale and enhance the high-performance computing (HPC) and cloud infrastructure that supports its clients’ research, production, and delivery, enabling breakthroughs that shape the industries of tomorrow. Its engineers build critical infrastructure to eliminate friction in scientific research, simulations, analysis, and decision-making, accelerating discovery and driving faster innovation.
ThePosition
The HPC Scheduling team develops and manages a large high-performance compute (HPC) platform to enable the business to conduct complex research are seeking a highly motivated person to join our team to help us continue to push the envelope running batch workloads on Kubernetes. The ideal candidate will have an active interest in Kubernetes and batch computing, a broad range of experience with software engineering and development, as well as experience managing large-scale infrastructure and complex tooling environments.
The main focus will be on Armada – an exciting open source CNCF project built and maintained by the team – which we use to solve multi-cluster Kubernetes batch job scheduling ’ll join an experienced team, working at the cutting‑edge of ML workloads and at scale.
- Design and develop high-quality software solutions using procedural programming languages, with a focus on Golang.
- Build and maintain highly scalable, highly available and globally distributed systems to support large‑scale research workloads.
- Manage and optimise data interactions across relational and non‑relational databases, particularly Postgre
SQL. - Develop and operate containerised applications within Kubernetes, ensuring effective orchestration and workload scheduling.
- Support, tune and troubleshoot Linux‑based systems as part of our core compute platform.
- Apply core networking knowledge to debug, optimise and enhance platform connectivity and performance.
- Diagnose and resolve complex technical issues across infrastructure and software layers independently.
- Apply solid software architecture principles, computer science fundamentals and data structure knowledge to guide design decisions and code quality.
- Drive continuous improvement by contributing to CI/CD pipelines and engineering best practices.
- Stay up to date with emerging technologies and apply new knowledge across disciplines.
- Experience with developing Kubernetes components, such as controllers and operators.
- Experience with event‑driven programming and message queues, such as Apache Kafka and Pulsar.
- Experience with high-performance computing, Kubernetes, or DAG (Directed Acyclic Graph) workflows.
- Experience running systems at scale using a cloud provider, ideally AWS.
- Use of operational and runtime tools and practices, including monitoring and logging with systems such as Prometheus and Grafana.
- Experience operating or using job scheduling systems, such as SLURM.
- Must be legally authorized to work in the United States without the need for employer sponsorship, now or at any time in the future.
- Company‑Paid Lunch Stipend:
Lunch is provided via Grub Hub. - Employer‑Paid Medical in a High Deductible Health Plan, Dental and Vision benefits for employees and families.
- 16 weeks of Paid Parental Leave.
- Employee Assistance Program.
- Life Insurance, Short‑Term Disability and Long‑Term Disability.
- 401(k):
Company will match 100% of your contributions up to 6%. - Optional Employee‑Paid Benefits:
Medical insurance in our PPO plan and a variety of other benefits such as Health Savings Accounts (with Company Contribution!), Flexible Spending Accounts, Supplemental Life Insurance, Wellhub, etc. - Time Off: 25 days of Paid Time Off plus 12 company holidays.
North Mark is an equal employment opportunity employer. The company's policy is not to discriminate against any applicant or employee based on race, color, religion, national origin, gender, age, sexual orientation, gender identity or expression, marital status, mental or physical disability, or genetic information, or any other basis protected by applicable law. The firm also prohibits harassment of applicants or employees based on any of these protected categories.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).