Edge AI ML Kernel Performance Engineer
Job in
Vancouver, BC, Canada
Listed on 2026-06-19
Listing for:
Amazon Development Centre Canada ULC
Full Time
position Listed on 2026-06-19
Job specializations:
-
IT/Tech
AI Engineer (Applied/Software), Machine Learning/ ML Engineer
Job Description & How to Apply Below
In this role, you will operate at the hardware-software boundary, collaborating with cross-functional teams to design and implement kernels for quantization-aware training and low-bit inference. Your work will involve analyzing kernel performance, identifying bottlenecks, and optimizing performance for cloud and edge deployments using modern GPU accelerators.
Key Responsibilities:
• Design CUDA and Triton kernels for efficient model training
• Conduct performance analysis to resolve training bottlenecks
• Implement kernel optimizations for compression tasks
• Create a kernel development harness for profiling
• Maintain a comprehensive training kernels library
Requirements:
• 3+ years in software development
• 2+ years in design or architecture of systems
• Experience with CUDA and ML kernels
• Proficient in Python, Java, C++
• Understanding of GPU memory hierarchies
Drive edge AI advancements with optimized kernel development at Amazon Devices.
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×