Senior or MLE - Droid Perception; Onboard
Listed on 2026-07-01
-
Software Development
AI Engineer (Applied/Software)
About Zipline
Do you want to change the world? Zipline is on a mission to transform the way goods move. Our aim is to solve the world’s most urgent and complex access challenges by building, manufacturing and operating the first instant delivery and logistics system that serves all humans equally, wherever they are. From powering Rwanda’s national blood delivery network and Ghana’s COVID-19 vaccine distribution, to providing on-demand home delivery for Walmart, to enabling healthcare providers to bring care directly to U.S. homes, we are transforming the way things move for businesses, governments and consumers.
The technology is complex but the idea is simple: a teleportation service that delivers what you need, when you need it. Through our technology that includes robotics and autonomy, we are decarbonizing delivery, decreasing road congestion, and reducing fossil fuel consumption and air pollution, while providing equitable access to billions of people and building a more resilient global supply chain.
Join Zipline and help us to make good on our promise to build an equitable and more resilient global supply chain for billions of people.
About You and The RoleZipline is operating the world’s largest autonomous logistics network—delivering critical medical and commercial goods globally with high reliability, precision, and scale. As we expand into increasingly complex, safety-critical environments, the systems behind our autonomy stack must be robust, adaptable, and deeply integrated—especially at the intersection of perception and deployment.
We're hiring senior and staff perception engineers to join our Droid team
, the group responsible for the autonomy that powers Zipline’s backyard delivery experience. This team owns the full stack of onboard, offboard and cloud-side perception systems enabling precise and reliable delivery in complex customer backyards. You’ll build realtime 3D perception models that capture geometry, scene semantics and preference for both delivery and package pickup. You’ll develop across the entire perception stack, from optimizing the onboard TensorRT engines to building a data flywheel that finds interesting samples from our long-tail of customer deliveries.
You will work closely with the planner team to make sure we build the right system, rather than just the best perception model.
This is not a research role—you’ll be expected to move fast, ship production-grade systems, and find clever ways to apply state-of-the-art techniques to tangible, high-impact problems.
What You’ll Do- Implement, train and evaluate real-time 3D perception models that work with two or more cameras across one or more timesteps
- Run these models onboard a resource-constrained computer, finding ways to optimize and reduce compute and memory footprints
- Build visualization, introspection and eval tooling to deeply understand model performance both on test datasets as well as “in the wild”
- Help design and implement data selection pipelines that identify the most valuable data from the field, then help our annotation teams label these faster through the use of prelabeling or pseudo-ground-truthing these samples.
- Work closely with the droid planner team, building a strong interface between the two subsystems and tracking the right metrics to ensure we’re always hill-climbing towards a better overall system
- Stay up to date with research in the field, drive experimentation, and help keep Zipline’s modeling stack in lockstep with powerful new paradigms in real-time compute-constrained 3D perception
- At least 5+ years of experience building and deploying deep learning-based perception systems, particularly in 3D geometry, semantic understanding
, or mapping from cameras - Strong understanding of classical computer vision (e.g. camera calibration, epipolar geometry, structure-from-motion, SGBM stereo) and the ability to blend it with modern ML approaches.
- Expertise and depth with robotics fundamentals: you should be able to reason about reference frames, matrix math, SE(3) manifolds and probabilistic sensor fusion
- Hands-on experience training, iterating on, and optimizing CNN and transformer…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).