Cloudera Data Engineer
Madison, Dane County, Wisconsin, 53774, USA
Listed on 2026-02-18
-
Software Development
Data Engineer
Cloudera Data Engineer
We have an exciting opening for a full‑time Cloudera Data Engineer to join our innovative Software Development team in Madison, Wisconsin!
Join a team recognized as one of Madison Magazine's Best Places to Work, where innovation thrives, collaboration drives success, and your work makes a real‑world impact—because at Yahara, we don't just build data pipelines, we empower people and transform industries.
Important Notes about this Position* This position offers remote work flexibility but is only open to candidates who reside in or are willing to relocate to the greater Madison, WI area.
* We are unable to provide sponsorship at this time.
SummaryThe Cloudera Data Engineer designs and maintains enterprise‑scale data pipelines using the Cloudera Data Platform (CDP) and related big data technologies. The role focuses on building scalable ETL/ELT workflows, optimizing distributed compute, and enabling secure, high‑performance data services across multiple domains. Work is highly collaborative within cross‑functional Agile teams.
Our ApproachWe build data solutions grounded in strong engineering fundamentals—reliable architecture, quality controls, and scalable design. We use modern cloud platforms, integrating analytics and ML where valuable while prioritizing data integrity and governance.
What You'll Do- Design and maintain enterprise‑scale pipelines using CDP and big data tooling.
- Build scalable ETL/ELT workflows for structured and unstructured data.
- Develop distributed processing jobs using big data framework components.
- Design data storage solutions balancing performance and cost.
- Collaborate with analysts, scientists, and developers to deliver data solutions.
- Develop technical documentation for pipelines and architectures.
- 5–7 years in data engineering with big data or distributed systems.
- Experience with CDP, CDH, or similar enterprise big data platforms.
- Degree in CS, Data Science, Information Systems, or equivalent experience.
- Strong background in distributed data processing.
- Ability to obtain and maintain Public Trust clearance.
- Self‑starter with a passion for data engineering.
- Strong analytical and problem‑solving skills.
- Enthusiastic about big data technologies and performance optimization.
- Detail‑oriented with a commitment to accuracy and reliability.
- Ability to translate business requirements into effective solutions.
- Collaborative, able to recognize blockers and leverage team strengths.
- Experience with Agile development environments.
- Proven experience designing and implementing production pipelines.
- Experience in biohealth, laboratory, or scientific data environments is a plus.
- Familiarity with HIPAA, FDA, or GxP preferred but not required.
- Cloudera ecosystem experience: CDP, HDFS, Hive/Impala, Spark.
- Programming:
Python, Scala, or Java. - Advanced SQL and distributed compute (Spark, Map Reduce).
- Shell scripting and version control (Git).
- Data storage formats:
Parquet, Avro, ORC. - Workflow orchestration and scheduling.
- Cloud experience (Azure, AWS, or GCP) and understanding of hybrid patterns.
- 20+ days of PTO accruable in the first year!
- Comprehensive health insurance (Medical, Dental, Vision) with HMO and PPO options
- Health Savings Account (HSA) with annual employer contributions
- 401(k) with guaranteed company match (Traditional and Roth options)
- 100% company‑paid short‑term and long‑term disability
- 100% company‑paid life insurance with option to increase coverage
- 100% company‑paid identity theft protection
- On‑site gym with basketball court
- Hybrid/remote schedule with home office stipend
- Fresh fruit, healthy snacks, and beverages provided daily
- Bonus certification program (Microsoft, AWS, PMP, IIBA, etc.)
- Employee Assistance Program (counseling, legal, financial services)
- Monthly and Quarterly Recognition Awards with spot bonuses
- Company‑supported community outreach and volunteer opportunities
- Employee‑run committee involvement opportunities
- Collaborative culture founded on realized values and incredible people
If you need an accommodation as part of the employment process, please contact Human Resources via email at
Yahara Software LLC is an Equal Employment Opportunity/Affirmative Action Employer.
This is a full‑time, salaried position with competitive salary and benefits. Candidates must be eligible to work in the U.S. on a permanent basis and can work on‑site in our office located in Madison, Wisconsin.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).