Data Engineer
Listed on 2026-01-01
-
IT/Tech
Data Engineer
Base pay range
$/yr - $/yr
Direct message the job poster from RSM Solutions, Inc.
Data Integration Engineer – Onsite in Irvine, California.
I am Tom Welke, Partner & VP at RSM Solutions, Inc. I have been recruiting technical talent for more than 23 years and have been in the tech space since the 1990s. I write my own job descriptions without relying on AI or bots. I focus on clear, realistic expectations. Technical fit is the highest priority; social fit is also important.
The hiring manager is a longtime friend who values continuous learning – a key culture of this environment.
This role requires U.S. Citizenship or Green Card Holder status only. All other visa categories are not eligible.
Responsibilities- Design and implement batch and streaming pipelines in Apache Spark running on Kubernetes and Kubeflow Pipelines to hydrate feature stores and training datasets.
- Build high throughput ETL/ELT jobs with SSIS, SSAS, and T SQL against MS SQL Server, applying Data Vault style modeling patterns for auditability.
- Integrate source control, build, and release automation using Git Hub Actions and Azure Dev Ops for every pipeline component.
- Instrument pipelines with Prometheus exporters and visualize SLA, latency, and error budget metrics to enable proactive alerting.
- Create automated data quality and schema drift checks; surface anomalies to support a rapid incident response process.
- Use MLflow Tracking and Model Registry to version artifacts, parameters, and metrics for reproducible experiments and safe rollbacks.
- Work with data scientists to automate model retraining and deployment triggers within Kubeflow based on data freshness or concept drift signals.
- Develop Power Shell and .NET utilities to orchestrate job dependencies, manage secrets, and publish telemetry to Azure Monitor.
- Optimize Spark and SQL workloads through indexing, partitioning, and cluster sizing strategies, benchmarking performance in CI pipelines.
- Document lineage, ownership, and retention policies; ensure pipelines con‑form to PCI/SOX and internal data governance standards.
- At least 6 years of experience building data pipelines in Spark or equivalent.
- At least 2 years deploying workloads on Kubernetes/Kubeflow.
- At least 2 years of experience with MLflow or similar experiment‑tracking tools.
- At least 6 years of experience in T‑SQL, Python/Scala for Spark.
- At least 6 years of Power Shell/.NET scripting.
- At least 6 years of experience with Git Hub, Azure Dev Ops, Prometheus, Grafana, SSIS/SSAS.
- Certifications such as Kubernetes CKA/CKAD, Azure Data Engineer (DP‑203), or MLOps‑focused certifications (e.g., Kubeflow or MLflow) are desirable.
- Experience mentoring engineers on best practices in containerized data engineering and MLOps.
Mid–Senior level
Employment typeFull‑time
Job functionInformation Technology – Manufacturing and Retail Apparel & Fashion
Benefits- Medical insurance
- Vision insurance
- 401(k)
Referrals increase your chances of interviewing at RSM Solutions, Inc by 2x.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).