×
Register Here to Apply for Jobs or Post Jobs. X
More jobs:

Senior Machine Learning Engineer; GPU​/CUDA

Job in Chicago, Cook County, Illinois, 60290, USA
Listing for: Fintal Partners
Full Time position
Listed on 2026-06-03
Job specializations:
  • Engineering
    AI Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Senior Machine Learning Engineer (GPU / CUDA)

A market-leading high-frequency trading firm is seeking Senior Machine Learning Engineers to join a specialist performance engineering team focused on low-level optimization for large-scale AI workloads.

This role is heavily focused on GPU performance, CUDA kernel optimization, and systems-level acceleration work later in the ML pipeline. The team works on extracting maximum performance from modern hardware architectures to support highly demanding training and inference workloads.

You will work close to the metal, optimizing critical components across CUDA, C++, memory management, and GPU execution paths. The work combines deep systems engineering with cutting-edge machine learning infrastructure.

Key Responsibilities:
  • Develop and optimize CUDA kernels for high-performance ML workloads
  • Improve GPU utilization, memory efficiency, and execution performance
  • Profile and optimize bottlenecks across training and inference pipelines
  • Work on compiler/runtime-level optimizations and kernel fusion strategies
  • Collaborate with ML systems and infrastructure teams on end-to-end acceleration
  • Build highly optimized C++ components for latency and throughput-sensitive systems
Requirements:
  • Strong C++ and CUDA development experience
  • Deep understanding of GPU architecture and performance optimization
  • Experience profiling and debugging GPU workloads using tools such as Nsight
  • Knowledge of PyTorch internals, Triton, NCCL, CUTLASS, or similar frameworks
  • Strong systems programming background with focus on performance engineering
  • Experience working on high-throughput or low-latency distributed systems
  • Computer Science, Mathematics, Physics, Engineering, or related technical degree preferred

This is an opportunity to work on some of the most technically challenging AI infrastructure problems in the industry, within an environment that values engineering excellence, autonomy, and performance.

#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary