Senior Machine Learning Engineer
Listed on 2026-01-01
-
Engineering
AI Engineer -
IT/Tech
AI Engineer, Machine Learning/ ML Engineer
Principal Consultant | Physical AI, Computer Vision & Smart Manufacturing/Robotics Senior Audio / Speech ML Engineer — Voice-Controlled Robotics | Edge AI | Autonomy
Location:
Austin, TX
Salary: $150,000 - $225,000 + Early Stage Equity
We are working with a rapidly growing, well-funded startup operating at the cutting edge of autonomous robotics and voice-based human–machine interaction. Backed by globally recognized investors, the team is building conversational AI systems that enable operators to control multiple autonomous platforms through natural language.
They are now hiring a Senior Audio / Speech Machine Learning Engineer to own the end-to-end speech capability—from data collection through deployment on edge hardware in real-world environments.
What You’ll Do- Build speech recognition and audio understanding models (STT / STS) from the ground up
- Own the entire audio ML pipeline
—data, training infrastructure, deployment, optimization - Design model architectures for noisy, real-world environments (wind, radio noise, variable microphones)
- Deploy and optimize models on resource-constrained edge devices (e.g., NVIDIA Jetson)
- Define and improve WER and related performance metrics
- Run field tests and data-collection campaigns in live environments
- Influence audio hardware decisions and quality requirements
- Stay current with speech recognition, multimodal, and edge-AI research
- Degree in CS, ML, AI, Audio Engineering, or related field
- Proven track record building and shipping audio ML systems end-to-end
- Strong knowledge of audio DSP and frequency-domain processing
- Expert-level Python for ML (training loops, pipelines, experiment tracking)
- Experience with production deployment, quantization, and optimization
- Willingness to travel for field testing and data capture
- Worked on audio-focused ML products or speech tech
- Experience deploying on ARM or NVIDIA Jetson platforms
- ROS / ROS2 for robotics integration
- Multimodal AI (speech + vision + language)
- $150,000 - $225,000
- Equity (early stage)
- Strong PTO
- 401k
- Life Insurance
- Short/Long-Term Disability, Wellness Programs.
If interested please click apply now for immediate consideration!
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).