×
Register Here to Apply for Jobs or Post Jobs. X

Senior Researcher – Vision-Language Models; VLM

Job in Markham, Ontario, Canada
Listing for: Huawei Technologies Canada Co., Ltd.
Full Time position
Listed on 2025-12-28
Job specializations:
  • IT/Tech
    Machine Learning/ ML Engineer, Data Scientist, AI Engineer, Computer Science
Job Description & How to Apply Below
Position: Senior Researcher – Vision-Language Models (VLM)

Job description

Huawei Canada has an immediate permanent opening for a Senior Researcher.


About the team:
The Human-Machine Interaction Lab unites global talents to redefine the relationship between humans and technology. Focused on innovation and user-centered design, the lab strives to advance human-computer interaction research. Our team includes researchers, engineers, and designers collaborating across disciplines to develop novel interactive systems, sensing technologies, wearable and IoT systems, human factors, computer vision, and multimodal interfaces. Through high-impact products and cutting-edge research, we aim to enhance user experiences and interactions with technology.


About the job:

  • Design, develop, train, evaluate, and optimize advanced Computer Vision, and Machine Learning models, and Vision-Language models (e.g., transformers, multimodal encoders, diffusion models), emphasizing on-device performance and efficiency

  • Prototype and optimize SOTA architectures for tasks such as image understanding, visual search, object detection, segmentation, multimodal grounding, etc.

  • Implement Computer Vision and Machine Learning algorithms from scratch or leverage existing libraries and frameworks (e.g., Tensor Flow, PyTorch, scikit-learn, Keras)

  • Explore and apply techniques such as quantization, pruning, distillation, LoRA adapters to meet mobile/embedded constraints

  • Choose appropriate algorithms and techniques based on problem requirements, data characteristics, and business needs

  • Manage and process large multimodal datasets (images, videos, text)

  • Build and maintain data pipelines for model training and inference

  • Deploy Machine Learning models to production environments and maintain model retraining and versioning strategies

  • Job requirements

    About the ideal candidate:

  • Ph.D. or Master's degree in Computer Science or a related field with a focus on Computer Vision and Machine Learning

  • Minimum 3 years of research and development experience in Vision-Language or multimodal AI, with a strong portfolio of applied projects or publications

  • Proficiency in Computer Vision and Machine Learning frameworks (e.g., Tensor Flow, PyTorch), and modern CV toolchains (OpenCV, MMDetection, Detectron2, etc.)

  • Familiarity with transformers, diffusion models, contrastive learning (e.g., CLIP, ALIGN), and prompt/adaptor-based fine-tuning techniques is an asset

  • On-device model deployment experience is an asset

  • Experience contributing to relevant open-source projects is an asset

  • Experience building commercial agent, conversational AI, or interactive VLM systems is an asset

  • Position Requirements
    10+ Years work experience
    Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
    To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)

    Job Posting Language
    Employment Category
    Education (minimum level)
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary