Data Engineer II Job Newton area,New Jersey USA,IT/Tech

At Thorlabs, we design and manufacture components, instruments, and systems that transform the world by identifying, enabling, and accelerating key photonics (i.e., light-based) technologies. Backed by a dedicated workforce of more than 3,000 employees worldwide, Thorlabs contributes to cutting-edge research and real-world innovation.

Whether you're early in your career or bringing years of experience, you’ll find opportunities to grow, take ownership, and make meaningful contributions from day one. We know every employee brings unique talents and perspectives that fuel our success, and we seek driven individuals who are excited to make an impact in a fun, fast-moving culture.

The Data Engineer builds, maintains, and improves reliable data pipelines and models that power analytics, reporting, and data-driven decision-making s position develops ETL/ELT processes, validates and optimizes SQL and compute logic, documents lineage and definitions, and collaborates with Data Analysts, Data Stewards, and business partners to deliver trusted and performant datasets. The role requires foundational understanding of Python/PySpark and basic AI/ML enablement (e.g., simple notebooks, scikit-learn examples, Azure ML notebooks) to support data preparation and experimentation.

Although the location of the position is in Newton, NJ, from time to time it may be required to undertake duties at other Thorlabs locations.

Essential Job Functions include the following, but are not limited to:

Develop and maintain ELT/ETL pipelines (Synapse Pipelines, Data Factory, SQL, notebooks) moving data from D365 into ADLS and Synapse; schedule, monitor, and troubleshoot jobs for reliability.
Write efficient SQL (joins, views, CTEs, window functions) and basic PySpark/Python to transform and model data for analytics and certified datasets.
Implement and monitor data quality checks; work with Data Stewards to triage issues and contribute to root‑cause analysis and remediation.
Apply foundational query and file‑format optimizations (e.g., partitioning) under guidance; escalate platform‑level tuning or performance concerns to the Lead.
Maintain metadata, lineage, and clear documentation for tables, transformations, business rules, and dataset logic.
Use Git and Azure Dev Ops for version control, code reviews, task management, and release coordination.
Support analysts and report authors during user acceptance testing; address defects and iterate improvements.
Contribute to basic AI/ML enablement by preparing features, managing datasets, running training or inference notebooks, and assisting with experiment tracking under guidance.

In addition to the essential functions and duties listed above, all positions are also responsible for:

Meeting company standards pertaining to quantity and quality of work performed on an ongoing basis, performing all work related tasks in a manner that is in compliance with all Company policies and procedures.
Adhering to Company policies, procedures, and directives regarding standards of workplace behavior in completing job duties and assignments.

The Company retains the right to change or assign other duties to this position.

Physical

Activities:

This is largely a sedentary role; however, some filing is required. This may include lifting files, opening cabinets, bending, or standing as necessar

Requirements

Experience:

5+ years of experience in data engineering, analytics engineering, or related internships/co‑ops.
Foundational SQL skills (joins, aggregations, CTEs; capable of reading and modifying queries).
Exposure to Azure data services such as Synapse (serverless/dedicated), ADLS, and Data Factory/Synapse Pipelines.
Basic experience with Python or PySpark for data preparation or orchestration.
Familiarity with BI/reporting datasets (Power BI preferred) and data lineage concepts.
Experience using Git/Azure Dev Ops for work items, code reviews, and basic CI/CD processes.
Strong written and verbal communication skills; organized, detail‑oriented, and comfortable working with deadlines.

Education:

Bachelor’s degree in Computer Science, Information Systems, Data/Computer Engineering, Analytics, or a related field.
Relevant…