More jobs:
Cerebras Systems Full Stack LLM Engineer
Job Description & How to Apply Below
The Inference Core Model Bringup team needs an experienced engineer who excels in fast-paced environments to support large-scale ML applications. Your expertise will span from model architecture adjustment to runtime integration, ensuring incredible performance and effective debugging across the entire Cerebras software stack.
Key Responsibilities:
• Lead the bring up of ML models on Cerebras CSX systems
• Conduct performance tuning and optimization across the AI toolchain
• Identify and debug issues in model codes, IRs, and hardware utilization
• Propose enhancements for better automation in model deployments
Requirements:
• Bachelor's, Master's, or PhD in Computer Science or related discipline
• Strong familiarity with deep learning frameworks such as Tensor Flow
• Proven skills in performance profiling and debugging
• Experience with LLVM or MLIR compiler technologies
• Comfort with high-level coding in C/C++ and optimizations
Drive the future of AI application performance through your expert contributions at Cerebras Systems.
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×