Data Scientist Security Clearance Job McLean area,Virginia USA,IT/Tech

Position: Data Scientist with Security Clearance
Careers at Data Scientist

REQUIRED:

Active TS/SCI with Full Scope Polygraph

LOCATION:

Full-time onsite in McLean, VA We’re looking for a Data Scientist who can build and operate production‑grade data pipelines, deliver scalable ETL/ELT workflows, and solve complex analytical challenges in cloud environments. In this role, you’ll work with PySpark and distributed processing, design optimized SQL solutions, and develop secure, well‑engineered data applications using Python and modern orchestration tools.

Who We Are:

We offer advanced services in data science, data engineering, software engineering, AI solutions, cybersecurity, staff augmentation, and IT program management. Passionate Integrity, Driven by Excellence
Ardent Principles offers a competitive salary range and a comprehensive, industry‑leading benefits package designed to support long‑term stability and employee well‑being. We provide more than a position—we offer a workplace committed to excellence, integrity, and mission‑focused impact. Our mission is to act as a bridge between satisfied clients and fulfilled employees, ensuring that your job and well‑being are our top priorities because your satisfaction leads to the success of our clients.

Join us as we continue building the future of secure, high‑impact solutions. Final date to receive applications
June 25, 2026
Department
Data Analysis and Technology Services

Employment Type

Full Time
Location
McLean, VA
Workplace type
Onsite

Key Responsibilities In this challenging yet rewarding role, you are an integral part of what brings our Company's mission to life. You must have the following required skills, certifications and demonstrated experience in and/or with:
* Building production data pipelines and ETL/ELT workflows at scale.
* Using Apache Spark and PySpark for distributed data processing.
* Advanced Python programming skills including data manipulation libraries (Pandas, Num Py) and data engineering best practices.
* Understanding data security, privacy, governance, and compliance principles.
* Workflow orchestration tools such as Step Functions and Airflow.
* Containerization such as Docker or Podman, and deploying data applications in cloud environments.
* AWS services (in particular S3, Lambda, and Step Functions).
* Postgre

SQL and MySQL in production environments, including performance tuning and schema design.
* SQL and query optimization for complex analytical workloads.
* Version control (Git) and CI/CD practices for data pipelines.
* Working with stakeholders to understand data requirements, assess feasibility, and design appropriate solutions with minimal oversight.
* Strong problem-solving and debugging skills for data quality issues, pipeline failures, and performance bottlenecks. Highly Desired Qualifications Other skills and demonstrated experiences that are highly desired but not mandatory to perform the work, include:
* Data lakehouse architectures using Apache Iceberg.
* Configuring, deploying, and integrating data platform components:
Apache Ranger (access control and data governance), Trino (distributed SQL query engine), Data catalogs (Unity Catalog OSS, Apache Polaris, etc.), and Apache Superset (data visualization and dashboarding).
* Bash scripting for automation and data processing tasks.
* Infrastructure as Code (Terraform or Cloud Formation) for data infrastructure.
* Tracking data lineage and associated tooling such as Open Lineage.
* Using Java.
* Data quality frameworks, testing methodologies, and validation strategies.
* Background with large-scale data migrations or platform modernization efforts.
* Integrating AI/ML services and models (translation, OCR, speech-to-text, NLP, language detection, topic modeling), LLMs, and RAG (retrieval-augmented generation) pipelines.
* Geospatial data processing (H3, PostGIS, or similar).
* Contributing to data engineering documentation, best practices, or design patterns.
* No

SQL databases (Dynamo

DB, etc.).
* Excellent written and verbal communication skills with both technical and non-technical audiences.
* Linux Operating Systems
* Agile/Scrum development methodologies in a fast-paced, collaborative team environment.
*…