More jobs:
Semantic Backend Engineer Remote
Remote / Online - Candidates ideally in
Town of Poland, Jamestown, Chautauqua County, New York, 14701, USA
Listed on 2026-01-01
Town of Poland, Jamestown, Chautauqua County, New York, 14701, USA
Listing for:
INFUSE
Contract, Remote/Work from Home
position Listed on 2026-01-01
Job specializations:
-
IT/Tech
AI Engineer, Data Engineer
Job Description & How to Apply Below
Location: Town of Poland
INFUSE is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Privacy Policy
.
We’re looking for an applied ML engineer to own the semantic ingestion pipeline, from raw PDFs to tagged, summarized, and embedded assets.
Why This Role MattersYour models decide what gets found, how it’s tagged, and which content and companies stand out. You’ll help define what “relevance” and “freshness” mean for over a million resources and 50,000+ company pages and make sure INKHUB stays ahead of the curve.
Hiring Process- Review application against job requirements – humans evaluate without machine learning tools.
- Invite to submit a video interview; may be followed by a test or short project.
- Interview with hiring manager and/or interview team (face‑to‑face or Zoom).
- Decision time – discuss offer live if both parties are excited.
- Own the ETL pipeline from raw PDFs (S3‑ingested) to structured resources.
- Finalize summarization and classification flow using open‑source models with GPT‑4o fallback.
- Apply filtering logic (≤3 years old, ≤100 pages, etc.) to enforce resource quality.
- Map each asset to specific topic taxonomy (10+ per topic across ~9,000 topics).
- Generate dense embeddings using sentence‑transformers.
- Load and query embeddings using Milvus or pgvector.
- Implement “freshness” logic to identify and index new or updated content based on file diffing, crawl timestamp, or document hash.
- Build a QA/eval harness: format compliance, recall@5, drift monitoring.
- Expose /v1/semantic‑search via FastAPI, with filtering and rank fusion.
- Collaborate closely with our Tech Lead on UX integration and snippet generation.
- Python, PyTorch, sentence‑transformers, OpenAI APIs, or similar pretrained LLMs.
- FastAPI, Milvus or pgvector, PyPDF/Tika, Airflow or Lambda for orchestration.
- Docker, GPU scheduling, Athena/Redshift SQL.
- You’ve built ML pipelines that touched real users, not just notebooks.
- You’ve worked on semantic search, embeddings, or large‑scale tagging.
Seniority level:
Mid‑Senior level.
Employment type:
Full‑time. Industries:
Construction, Software Development, IT Services, and IT Consulting.
Referrals increase your chances of interviewing at INFUSE by 2x.
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×