×
Register Here to Apply for Jobs or Post Jobs. X
More jobs:

Senior Data Engineer: NLP Scale ETL Pipelines

Job in Herndon, Fairfax County, Virginia, 22070, USA
Listing for: Sabree-Software-Services
Full Time position
Listed on 2026-05-29
Job specializations:
  • IT/Tech
    Data Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Senior Data Engineer: NLP & Large-Scale ETL Pipelines
Job Description Seeking a detail-oriented and experienced ETL Engineer to join our data engineering team. The ideal candidate will be responsible for designing, building, and maintaining data pipelines and integration processes that enable reliable, timely, and high-quality data delivery across the organization. This role requires strong technical capabilities, a deep understanding of data architecture, and the ability to collaborate effectively with cross-functional stakeholders.

The ideal candidate brings deep expertise in Natural Language Processing (NLP), large-scale data processing, and search/discovery systems, with the ability to lead technical teams and translate mission needs into scalable, production-ready capabilities that deliver operational impact.

Development will take place in an iterative fashion using Agile development methodology with input from all levels of stakeholders. The candidate must have the ability to communicate with project team members, user community, and leadership to assess changes and demonstrate iterative progress.

Key Responsibilities Design, develop, and maintain ETL/ELT pipelines that support data warehouse, analytics, and application needs. Must be experienced with large data sets [hundreds of thousands of records, GB and TB size data sets]Extract, transform, and load data from various sources into centralized storage solutions.

Design and enhance search and discovery platforms across large volumes of structured and unstructured data Perform data ingestion, ETL, and integration across enterprise and multi-source environments

Optimize ETL workflows for performance, scalability, and reliability.

Conduct data validation, profiling, and quality checks to ensure accuracy and completeness.

Troubleshoot and resolve data inconsistencies, pipeline failures, or performance bottlenecks.

Build and maintain cloud-native solutions (AWS) aligned to secure and resilient architecture patterns

Partner with mission operators, analysts, and senior stakeholders to define requirements and deliver mission-relevant analytics

Translate mission needs into technical designs, architectures, and implementation roadmaps, ensuring alignment to operational objectives

Deliver clear, compelling visualizations, dashboards, and executive-level briefings that communicate analytic insights and recommendations

Provide technical leadership and mentorship, including hands-on development, code review, and team development

Own delivery of analytic capabilities from concept through deployment, accreditation, and sustainment

Support system accreditation, data governance, and security architecture, ensuring data integrity and compliance within classified environments,Required Skills Bachelor’s degree in Computer Science, Information Systems, Engineering, or related field (or equivalent experience).Minimum 6-8 years working in Linux Operating system with updating the system for efficient parallel processing, understanding memory, storage and processing data at scale

Minimum 6-8 years in Object Oriented programming. Python is preferred software development language

Minimum 6-8 years of demonstrated experience with applications in the Commercial Cloud Services (C2S) environment or an Amazon Web Services cloud environment. Willing to consider substituting C2S if candidate has a minimum 4-6 years of cloud computing technology to include Azure, Oracle, Google, etc.

Minimum 4-6 years of demonstrated (Extract, Transform, Load - ETL) with large structured and unstructured raw data sets.

Strong experience with ETL tools such as Informatica, Talend, SSIS, AWS Glue, or Azure Data Factory.

Proficiency in SQL, including complex queries and query optimization.
6-8 Years of experience with AWS platform including understanding EC2, RCS instance types

Strong understanding of data warehousing concepts, data modeling, and schema design.

Hands-on experience with scripting languages such as Python, Bash, or Power Shell.

Familiarity with relational and No

SQL databases.

Experience using version control systems such as Git.,Desired Skills Experience working with big data technologies (e.g., Spark, Hadoop, Databricks).

Experience with transforme…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary