×
Register Here to Apply for Jobs or Post Jobs. X

Research Robotics​/Computer Vision Engineer

Job in San Mateo, San Mateo County, California, 94409, USA
Listing for: Lightspeed
Full Time position
Listed on 2026-06-12
Job specializations:
  • Engineering
    Robotics, AI Engineer (Applied/Software)
Salary/Wage Range or Industry Benchmark: 60000 - 80000 USD Yearly USD 60000.00 80000.00 YEAR
Job Description & How to Apply Below

Overview

At Skild AI, we are building the world's first general purpose robotic intelligence that is robust and adapts to unseen scenarios without failing. We believe massive scale through data-driven machine learning is the key to unlocking these capabilities for the widespread deployment of robots within society. Our team consists of individuals with varying levels of experience and backgrounds, from new graduates to domain experts.

Relevant industry experience is important, but ultimately less so than your demonstrated abilities and attitude. We are looking for passionate individuals who are eager to explore uncharted waters and contribute to our innovative projects.

Position Overview

Skild AI, Inc. seeks a Research Robotics/Computer Vision Engineer in San Mateo, CA responsible for developing perceptive, intelligent, and adaptable robotic systems capable of learning and performing tasks with a focus on 3D computer vision and autonomous navigation. This includes designing perception pipelines, optimizing SLAM systems, and creating learning-based algorithms for robust robotic control in real-world environments.

Specific duties include:

  • implementing perception on robots to enable safe exploration and navigation in real world environments in collaboration with the locomotion team
  • reconstructing an entire scene in 3D using monocular images, estimating camera poses, optimizing and streamlining 3D SLAM
  • developing a set of software tools for localization of a robot using only visual inputs
  • building robust software to enable life-long mapping on a robot via optimally merged pose-graphs
  • visual servoing wrt objects detected/ tracked to control robot motion
  • researching novel techniques to detect and cater to glare during robotic mapping and navigation
  • building infrastructure and pipeline and collecting data to enable streaming of hand movements for training robot manipulation tasks such as pick and place
  • maintaining a camera and 2D lidar based navigation stack, including fixing bugs, adding new customer feature requests, and ensuring successful deployments
Responsibilities
  • implementing perception on robots to enable safe exploration and navigation in real world environments in collaboration with the locomotion team
  • reconstructing an entire scene in 3D using monocular images, estimating camera poses, optimizing and streamlining 3D SLAM
  • developing a set of software tools for localization of a robot using only visual inputs
  • building robust software to enable life-long mapping on a robot via optimally merged pose-graphs
  • visual servoing wrt objects detected/ tracked to control robot motion
  • researching novel techniques to detect and cater to glare during robotic mapping and navigation
  • building infrastructure and pipeline and collecting data to enable streaming of hand movements for training robot manipulation tasks such as pick and place
  • maintaining a camera and 2D lidar based navigation stack, including fixing bugs, adding new customer feature requests, and ensuring successful deployments
Minimum Requirements
  • Must have a master’s degree (or foreign equivalent) in Computer Vision, Robotics, or a directly related discipline and one (1) year of experience in Machine Learning or Data Science.
  • Must have any experience with or knowledge of each of the following: (i) reconstructing 3D scenes using monocular videos, meshes, point clouds, Neural Radiance Fields, and Gaussian Splats; (ii) reconstructing rigid and articulated hand-held objects from videos, including inferring the time-varying hand configurations and relative poses of the objects; (iii) using generative computer vision, including diffusion models to guide reconstruction, or addressing occlusion and limited viewpoint variations in videos via data driven priors;

    (iv) optimizing attention-based models for perception used in autonomous navigation systems; (v) using Neural Architectural Search (NAS) to find better perception backbone architectures with higher accuracies and lower latencies; and (vi) cloud-based training in AWS, Google cloud, or Vetex AI and optimized data loading for cloud based distributed training for deep learning workloads (e.g. Pytorch data loader, or sharding) with hardware-in-loop.
  • Experience can be concurrent.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary