×
Register Here to Apply for Jobs or Post Jobs. X

Member of Technical Staff, ML Systems

Job in Cambridge, Middlesex County, Massachusetts, 02140, USA
Listing for: Netpreme
Full Time position
Listed on 2026-01-01
Job specializations:
  • IT/Tech
    AI Engineer, Systems Engineer
Job Description & How to Apply Below

Join to apply for the Member of Technical Staff, ML Systems role at Netpreme
.

2 days ago Be among the first 25 applicants

About

The Role

We’re looking for a motivated LLM Systems Engineer willing to explore new and unconventional inference systems based on emerging hardware. This role is part engineering, part research – you’ll be responsible for searching and prototyping various algorithms suitable for our inference hardware, as well as guiding our hardware team on the product definition. The ideal candidate has a proven track of record of pursuing ML systems research, and is very familiar with industry-standard LLM inference systems.

This role will be performed on‑site from one of our offices in Santa Clara, CA or Boston, MA.

Essential Duties And Responsibilities
  • Prototype and optimize emerging ML inference systems.
  • Develop novel memory models for expandable vRAM.
  • Write efficient GPU kernels for data movement.
  • Perform design‑space exploration, implementation, and benchmarking of inference engines, both in simulations and on real hardware.
Qualifications
  • MS or PhD in computer systems, ideally with a focus on LLM inference and/or distributed systems.
  • Prior experience contributing to the core LLM inference infrastructures (vLLM, SGLang, Tensor

    RT, etc.).
  • Prior experience in accelerator programming (e.g. CUDA, JAX/Pallas, ROCm).
  • Advanced computer architectures and performance engineering skills is a big plus.
Compensation & Benefits
  • Competitive salary commensurate with experience including base salary, incentive‑based bonus, and early stage equity grant.
  • Comprehensive benefits including health, dental, vision, and life insurance.
  • Well‑equipped, sunny offices in Santa Clara, CA and Boston, MA.
  • Relocation assistance and visa sponsorship.
  • Perks include a daily lunch stipend, 401k match, and more.
  • A collaborative, continuous‑learning work environment with smart, dedicated colleagues engaged in developing the next generation of architecture for high‑performance computing.
The Opportunity
  • Impact:
    We are tackling a fundamental challenge at the infrastructure layer: unlocking greater AI capability while dramatically improving efficiency. The work we do here compounds across state‑of‑the‑art AI models, systems, and real‑world applications.
  • Timing:
    Joining now means real ownership of the company and meaningful influence over product direction and execution. You’ll work from first principles, move quickly from insight to execution, and see your contributions directly reflected in what we build.
  • Culture:
    You’ll work alongside a group of people who care deeply about rigor, clarity, and impact. We value thoughtful disagreement, fast learning, and intellectual fearlessness. This is a place where strong ideas shine, curiosity is encouraged, and growth is a daily practice.
Seniority Level

Mid-Senior level

Employment Type

Full‑time

Job Function

Engineering and Information Technology

Industries

Computer Networking Products

Referrals increase your chances of interviewing at Netpreme by 2x

Apply BELOW

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary