Principal Data Engineer
Listed on 2026-06-26
-
IT/Tech
Data Engineering
Position Summary
Caremark LLC, a CVS Health company, is hiring for the Principal Data Engineer role in Hartford, CT.
Responsibilities include developing large-scale data structures and pipelines, organizing, collecting, and standardizing data to generate insights and address reporting needs. Collaborate with the data science team to transform data and integrate algorithms and models into automated processes. Build data marts and data models to support internal customers, integrate data from multiple sources while ensuring quality and accessibility, and analyze IT environments to identify critical capabilities and recommend solutions.
Build high‑performance data processing frameworks using cloud or on‑premise platforms. Design and implement efficient Extract/Load/Transform workflows, write ETL processes, and develop real-time and offline analytic tools. Design conformed, aggregated, and semantic data layers and manipulate large datasets with SQL, BTEQ, SAS, and related tools. Work on big-data platforms such as Hadoop (Azure or GCP preferred) and Spark. Use Hadoop architecture, HDFS commands, and optimize queries to build data pipelines.
Utilize strong programming skills in Python, Java, or similar languages to build robust data pipelines and dynamic systems. Experiment with software tools, advise on new tools, support modeling/diagramming, and build design specifications. Collaborate with business solution strategists, support new data source onboarding through discovery, profiling, and mapping, and participate in proof‑of‑concept activities. Telecommuting is available. Multiple positions.
RequirementsMaster’s degree (or foreign equivalent) in Computer Science, Information Systems, Data Science, Statistics, Mathematics, Analytics, or a related field and two years of experience in the offered or related occupation.
Must have at least two years of experience in each of the following:
- Cloud migration technologies:
Azure, Amazon Web Services, or Google Cloud Platform - Messaging platform:
Kafka - Containerization runtime platform
- Solution architecture, design, and end‑to‑end delivery of projects
- Domain support for healthcare or retail organization
- Proof of Value (PoV) and MVP using AI:
Generative AI, AutoML, or virtual AI databases - Guidance on Large Language Model selection and use of minimum viable products
- Conduct data quality assessments, define data governance processes, and MLOps
- Establish data architectures and best practices
Pay Range: $ per year to $ per year. The base salary will depend on experience, education, geography, and other factors. Eligible for bonus, commission, short‑term incentive, and equity award programs.
BenefitsEligible employees may enroll in medical, dental, vision, 401(k) retirement savings, employee stock purchase plan, term life insurance, short‑term and long‑term disability benefits, well‑being programs, education assistance, free development courses, store discounts, and paid time off. Paid holidays are provided consistent with state law and company policies.
Equal‑Opportunity StatementQualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).