More jobs:
Web Data Engineer
Remote / Online - Candidates ideally in
Overland Park, Johnson County, Kansas, 66213, USA
Listed on 2026-02-16
Overland Park, Johnson County, Kansas, 66213, USA
Listing for:
TriCom Technical Services
Remote/Work from Home
position Listed on 2026-02-16
Job specializations:
-
IT/Tech
Data Engineer, Data Security
Job Description & How to Apply Below
Overview
Our client is seeking a Web Data Engineer who is passionate about extracting structured insights from unstructured Web data. This engineer will design, build, and maintain scalable Web scraping pipelines to gather healthcare provider information from multiple online sources, using agent-based automation frameworks including Firecrawl and advanced Python scraping libraries.
Responsibilities- Develop, deploy, and maintain robust Web scraping pipelines for collecting healthcare provider data.
- Work with agentic frameworks (e.g., Firecrawl) to automate dynamic data extraction workflows.
- Use tools, including Selenium, to extract and parse structured/unstructured Web data.
- Ensure data accuracy, completeness, and freshness through validation, deduplication, and error-handling processes.
- Collaborate with data engineers to integrate scraped data into our existing data pipelines and storage systems.
- Monitor scraping performance and troubleshoot issues with site structure changes, blocking mechanisms, or throttling.
- Follow best practices for ethical and compliant data collection.
- 3+ years of professional experience in Python-based Web scraping or data engineering, preferably in a SaaS based environment.
- Strong proficiency with Python and libraries including Selenium, Beautiful Soup, or Playwright.
- Familiarity with agentic scraping frameworks (e.g., Firecrawl) or autonomous browser-based extraction systems.
- Experience handling large-scale scraping, asynchronous requests, and data normalization.
- Working knowledge of data storage formats and systems (e.g., JSON, Parquet, SQL, or Cloud databases).
- Strong problem-solving skills and ability to debug complex scraping workflows.
- Solid understanding of Web protocols, HTML structures, and REST APIs.
- Bachelor's degree in Data Science, Computer Science, Statistics, Mathematics, or a related quantitative field.
- Experience with Cloud-based data pipelines (Databricks).
- Knowledge of healthcare provider data or healthcare data standards.
- Familiarity with AI-driven or LLM-powered data collection frameworks.
This is a 6-month REMOTE Contract opportunity with our Overland Park, KS client. 100% Paid employee Medical/Dental Benefits, Paid time off, Paid Holidays, and 401(k) (with immediately-vested company match) available with Tri Com during the contract period. H-1B visa sponsorship is not available for this position. No third-parties, please.
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×