×
Register Here to Apply for Jobs or Post Jobs. X

Semantic Backend Engineer Remote

Remote / Online - Candidates ideally in
Town of Poland, Jamestown, Chautauqua County, New York, 14701, USA
Listing for: INFUSE
Contract, Remote/Work from Home position
Listed on 2026-01-01
Job specializations:
  • IT/Tech
    AI Engineer, Data Engineer
Job Description & How to Apply Below
Position: Semantic Backend Engineer (Contract, Remote)
Location: Town of Poland

INFUSE is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Privacy Policy
.

Applied ML Engineer – Semantic Ingestion Pipeline

We’re looking for an applied ML engineer to own the semantic ingestion pipeline, from raw PDFs to tagged, summarized, and embedded assets.

Why This Role Matters

Your models decide what gets found, how it’s tagged, and which content and companies stand out. You’ll help define what “relevance” and “freshness” mean for over a million resources and 50,000+ company pages and make sure INKHUB stays ahead of the curve.

Hiring Process
  • Review application against job requirements – humans evaluate without machine learning tools.
  • Invite to submit a video interview; may be followed by a test or short project.
  • Interview with hiring manager and/or interview team (face‑to‑face or Zoom).
  • Decision time – discuss offer live if both parties are excited.
What You’ll Do
  • Own the ETL pipeline from raw PDFs (S3‑ingested) to structured resources.
  • Finalize summarization and classification flow using open‑source models with GPT‑4o fallback.
  • Apply filtering logic (≤3 years old, ≤100 pages, etc.) to enforce resource quality.
  • Map each asset to specific topic taxonomy (10+ per topic across ~9,000 topics).
  • Generate dense embeddings using sentence‑transformers.
  • Load and query embeddings using Milvus or pgvector.
  • Implement “freshness” logic to identify and index new or updated content based on file diffing, crawl timestamp, or document hash.
  • Build a QA/eval harness: format compliance, recall@5, drift monitoring.
  • Expose /v1/semantic‑search via FastAPI, with filtering and rank fusion.
  • Collaborate closely with our Tech Lead on UX integration and snippet generation.
Your Toolbox
  • Python, PyTorch, sentence‑transformers, OpenAI APIs, or similar pretrained LLMs.
  • FastAPI, Milvus or pgvector, PyPDF/Tika, Airflow or Lambda for orchestration.
  • Docker, GPU scheduling, Athena/Redshift SQL.
You Might Be a Fit If
  • You’ve built ML pipelines that touched real users, not just notebooks.
  • You’ve worked on semantic search, embeddings, or large‑scale tagging.

Seniority level:
Mid‑Senior level.

Employment type:

Full‑time. Industries:
Construction, Software Development, IT Services, and IT Consulting.

Referrals increase your chances of interviewing at INFUSE by 2x.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary