×
Register Here to Apply for Jobs or Post Jobs. X

Software Engineer, ML Inference

Job in Bellevue, King County, Washington, 98009, USA
Listing for: Cognitiv
Full Time position
Listed on 2025-12-08
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 200000 - 270000 USD Yearly USD 200000.00 270000.00 YEAR
Job Description & How to Apply Below
Position: Staff Software Engineer, ML Inference

Staff Software Engineer, ML Inference

Talent Acquisition @ Cognitiv - Join Us to Build the Future of AI-Powered Advertising 🚀

Are you ready to revolutionize the advertising industry?

At Cognitiv, we are not just another AdTech company—we are industry trailblazers redefining media buying with our Deep Learning Advertising Platform. Since 2015, we have harnessed the power of cutting‑edge deep learning technology and data science to transform how brands connect with their customers. Our mission? To bring intelligence to advertising and deliver unparalleled precision, relevance, and impact at scale.

With our innovative platform, advertisers enjoy unprecedented flexibility—whether it is activating Dynamic Deals through their preferred DSP, leveraging our managed service DSP, or utilizing our industry‑first Context

GPT product. As a part of Cognitiv, you will be at the forefront of AI‑driven advertising solutions, driving change and achieving remarkable growth in a rapidly evolving industry.

Now, we’re growing!

Location:

Hybrid MTW in Bellevue.

The Role

We are searching for one of the absolute best ML inference engineers in the industry—someone excited to architect and scale a cutting‑edge inference system that becomes the backbone of Cognitiv’s ML‑driven products.

In this role, you will define what inference means to Cognitiv and lead the cross‑organizational effort to bring that vision to life. You’ll build performance‑critical systems powering real‑time decision‑making for some of the world’s biggest brands, while helping shape the future of AI in AdTech.

This role is foundational. It is high‑impact. And it is a rare opportunity to build both the system and the team around one of the most strategic technical pillars in the company.

What You’ll Do
  • Build and Optimize Inference Systems: Implement and optimize large‑scale ML inference systems using both industry‑standard frameworks and in‑house technologies.
  • Lead Cross‑Team Technical Initiatives: Drive major organization‑wide technical programs that advance Cognitiv’s ML inference capabilities.
  • Evaluate and Advance ML Breakthroughs: Identify emerging ML inference technologies and partner with Product to build business cases for new capabilities.
  • Deliver Production‑Grade ML Solutions: Collaborate with Engineering, Research, and Product to design and integrate high‑performing ML solutions into production systems.
  • Raise the Engineering Bar: Mentor engineers through code reviews, design reviews, and pair programming to elevate technical quality.
  • Set Engineering Standards: Define and automate best‑in‑class standards for coding, testing, observability, and security across inference systems.
  • Own the Full Development Lifecycle: Take end‑to‑end ownership of services including planning, design, execution, testing, and release.
  • PyTorch / Lib Torch
  • C++17 or later
  • Managed languages: C#, Java
  • Cloud: AWS, GCP, or Azure
Who You Are
  • Expert in PyTorch/Lib Torch: 4+ years of experience with modern PyTorch/Lib Torch and awareness of the latest ecosystem innovations.
  • Skilled in Neural Network Optimization: 4+ years optimizing models through quantization, parallelism, tiling, and related techniques.
  • Strong C++ Engineer: 4+ years programming in C++17 or later, with deep knowledge of performance and memory considerations.
  • Clear, Influential Communicator: Able to shape organization‑wide technical narratives and drive alignment across teams.
  • End‑To‑End Owner: Comfortable owning services through the full development lifecycle, from design to release.
  • Technically Educated: Bachelor’s or advanced degree in Computer Science, Engineering, Math, Physics, or a related field.
Bonus Points If You Have
  • Experience with GPU/hardware acceleration for inference (e.g., NVIDIA Tensor

    RT)
  • Experience with containers (Docker, Kubernetes)
  • Familiarity with Infrastructure‑as‑Code (Terraform, Ansible)
  • Experience with advanced ML architectures (two‑tower models, teacher‑student learning)
  • Experience with Rust
  • Experience with MLOps systems (monitoring, lifecycle management, automation)
What We Offer

Salary: $200,000 - $270,000 USD Base Salary + Equity

Compensation is based on experience, skills, and other factors. Base salary is just…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary