×
Register Here to Apply for Jobs or Post Jobs. X

Senior Machine Learning Engineer

Job in Austin, Travis County, Texas, 78716, USA
Listing for: webAI
Full Time position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

About Us: webAI is pioneering the future of artificial intelligence by establishing the first distributed AI infrastructure dedicated to personalized AI. We recognize the evolving demands of a data-driven society for scalability and flexibility, and we firmly believe that the future of AI lies in distributed processing at the edge, bringing computation closer to the source of data generation. Our mission is to build a future where a company's valuable data and intellectual property remain entirely private, enabling the deployment of large-scale AI models directly on standard consumer hardware without compromising the information embedded within those models.

We are developing an end-to-end platform that is secure, scalable, and fully under the control of our users, empowering enterprises with AI that understands their unique business. We are a team driven by truth, ownership, tenacity, and humility
, and we seek individuals who resonate with these core values and are passionate about shaping the next generation of AI.

About the Role

We are seeking an Senior Machine Learning Engineer to support our Public Sector initiatives focused on building and optimizing production ready AI systems for secure and distributed environments. This role sits at the intersection of machine learning, systems engineering, and deployment optimization, bridging research and real world implementation.

You will be responsible for transforming prototype models into scalable, efficient, and reliable production systems that operate seamlessly across a spectrum of hardware from government cloud infrastructure to edge devices in restricted or disconnected environments.

Responsibilities
  • Productionize AI models from research prototypes into scalable, deployable systems used in real world applications.
  • Develop, fine tune, and optimize models using PyTorch, Tensor Flow, or Hugging Face Transformers, adapting both open and closed source models.
  • Implement model optimization techniques such as quantization, pruning, distillation, and hardware specific acceleration.
  • Engineer systems for dynamic model adaptation using low rank adaptation (LoRA), parameter efficient fine tuning (PEFT), and on device inference strategies.
  • Build and maintain Retrieval Augmented Generation (RAG) pipelines, including vector database integration for contextual retrieval.
  • Work with multi modal AI systems across computer vision, audio, and natural language domains.
  • Employ synthetic data generation and digital twinning techniques (GANs, diffusion models, or simulation based) to create robust datasets for edge cases.
  • Develop GPU accelerated and low level system code in C, C++, or Rust for performance critical operations.
  • Optimize model execution for distributed and resource constrained environments, ensuring reliability under variable connectivity conditions.
  • Collaborate cross functionally with Infrastructure, MLOps, and Security teams to deliver secure, compliant, and high performance AI solutions for government partners.
Qualifications
  • Active US Security clearance or eligibility and willingness to obtain a US Security clearance
  • 5+ years of experience in applied AI, ML engineering, or production AI systems.
  • Deep proficiency in PyTorch, Tensor Flow, or Hugging Face Transformers.
  • Proven experience deploying AI models across cloud, edge, and mobile hardware environments.
  • Expertise in model compression and optimization (quantization, pruning, distillation).
  • Strong understanding of GPU computing, CUDA, and performance profiling.
  • Experience building RAG pipelines and integrating vector databases (e.g., FAISS, Milvus, Pinecone).
  • Familiarity with multi modal models and synthetic data generation methods.
  • Low level programming experience in C, C++, or Rust with understanding of computer memory and concurrency.
  • Strong algorithmic and problem solving skills, especially in distributed or constrained compute environments.
Preferred Skills
  • Experience with edge AI, federated learning, or offline inference systems.
  • Familiarity with distributed training frameworks such as Deep Speed or Ray.
  • Understanding of AI governance and compliance frameworks relevant to public sector deployments.
  • Experience…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary