×
Register Here to Apply for Jobs or Post Jobs. X

Sr. Software Engineer, Systems Infrastructure

Job in Mountain View, Santa Clara County, California, 94039, USA
Listing for: LinkedIn
Full Time position
Listed on 2026-05-26
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer, Systems Engineer, Data Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Sr. Staff Software Engineer, Systems Infrastructure

Sr. Staff Software Engineer, Systems Infrastructure

  • Full-time
  • Workplace Type:
    Hybrid

Linked In is the world’s largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connections, discover exciting opportunities, build necessary skills, and gain valuable insights every day.

We’realso committed toprovidingtransformational opportunities for our own employees by investing in their growth. We aspire to create a culture that’sbuilt on trust, care, inclusion, and fun – where everyone can succeed.

Join us to transform the way the world works.

This role will be based in Sunnyvale or Mountain View, CA.

At Linked In, our approach to flexible work is centered on trust and optimized for culture, connection, clarity, and the evolving needs of our business. The work location of this role is hybrid, meaning it will be performed both from home and from a Linked In office on select days, as determined by the business needs of the team.

Join Linked In’s Serving Foundations team and work on one of the most critical layers of our AI platform—powering large-scale model inference across all major AI use cases. This team sits at the center of Linked In’s AI stack and is responsible for making our models faster, more efficient, and more scalable at production scale.

This is a deeply technical, systems-focused position at the intersection of machine learning, compilers, and hardware. You will work across the full stack—from model graphs and optimization techniques to runtime systems, kernels, and GPU execution—to push the limits of performance and efficiency.

You will lead efforts to optimize large-scale inference systems serving billions of requests, driving improvements in latency, throughput, and cost. This includes advancing GPU utilization, designing custom kernel and operator optimizations, improving model efficiency through quantization and compression, and shaping how models are compiled and executed in production environments.

As a Sr. Staff engineer, you will operate with a high degree of autonomy and influence, identifying bottlenecks across the system and driving end-to-end solutions across model, runtime, and infrastructure layers. Your work will directly impact how AI systems perform at Linked In scale and will help define the future of our AI serving platform.

Responsibilities
  • Lead end-to-end optimization of large-scale AI inference systems across model, runtime, and hardware layers
  • Design and implement GPU-efficient solutions, including kernel optimization, operator fusion, and memory optimization
  • Apply model optimization techniques such as quantization, pruning, and mixed precision to improve performance and efficiency
  • Optimize model execution using ML compilers and runtimes (e.g., Tensor

    RT, XLA, TVM, Triton)
  • Build and scale low-latency, high-throughput inference systems for both online and offline workloads
  • Identify and resolve bottlenecks across distributed systems and model serving pipelines
  • Set technical direction and influence best practices for AI performance and efficiency across teams
Basic Qualifications
  • BS/BA in Computer Science or related technical field or equivalent experience
  • 8+ years of experience in software engineering with a focus on systems and performance
  • Experience building or optimizing large-scale production ML systems
  • Experience programming in Python, C++, or similar languages
  • Experience working with distributed systems and large-scale infrastructure
Preferred Qualifications
  • Deep expertise in GPU programming and optimization (CUDA, Triton, or similar)
  • Experience with model optimization techniques such as quantization, pruning, and compression
  • Experience with ML compilers or runtimes such as Tensor

    RT, XLA, TVM, Torch Inductor, or similar
  • Hands-on experience with kernel-level or operator-level optimization
  • Experience building or scaling high-performance inference systems, including LLM serving
  • Understanding of latency, throughput, and cost tradeoffs in production ML systems
  • Background in high-performance computing or hardware-aware optimization
Suggested Skills
  • AI/ML Systems and Infrastructure
  • GPU and Performance…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary