Senior Data Architect/Data Engineer; w/m/d – Fokus NLP
Listed on 2026-02-20
-
IT/Tech
Data Engineer, AI Engineer, Data Science Manager, Data Analyst
Location: Germany
About us
With deep expertise in Data Engineering, Data Science, and Machine Learning, we help our clients unlock the full potential of their data. statworx is a leading consulting and development company for data and AI, based in Frankfurt am Main. We offer strategic consulting for medium‑sized businesses and global corporations. We develop innovative data and AI solutions across all business areas and corporate functions.
We empower people at all levels of expertise with our data and AI education formats. In short:
We support companies in all aspects of digital transformation – for over 10 years, in more than 1000 data and AI projects, and for over 100 clients from almost all industries. Our AI Development department acts as a catalyst for data and AI transformation. We take a holistic approach that spans the entire journey — from assessing AI maturity to designing, developing, and scaling end‑to‑end data and AI solutions.
tasks
- Combine classical data engineering with modern NLP approaches – particularly in the context of Large Language Models (LLMs), embeddings, knowledge graphs, Retrieval‑Augmented Generation (RAG), and text‑to‑SQL applications
- Design, develop, and operate modern data architectures that form the foundation for advanced NLP applications – from knowledge management systems and semantic search solutions to RAG use cases
- Work closely with our clients to understand their business requirements and data processes, and translate them into tailored, scalable data and AI solutions
- Implement scalable data pipelines and infrastructures to efficiently provide, transform, and version large volumes of structured and unstructured data
- Ensure data quality, security, and governance along the entire value chain, and establish best practices for handling sensitive data in AI projects
- Build and operate scalable data infrastructures in cloud environments, and automate deployments and monitoring systems to ensure reliability and availability
- Provide strategic advice to clients and internal teams on data architecture, technologies, tools, and best practices, acting as a trusted advisor
- Support and mentor junior colleagues, share your knowledge within the team, and contribute to the development of statworx’s data engineering community through workshops, blog posts, or internal talks
- You hold a Master’s degree in (Business) Informatics, Computer Science, or a related field
- You have at least five years of relevant professional experience in data engineering or data architecture
- You have a strong understanding of modern data architectures (Data Lakes, Lake houses, Data Warehouses) and are experienced in ETL/ELT processes and data modeling
- Ideally, you have experience building data infrastructures for NLP applications – especially in the context of LLMs, Retrieval‑Augmented Generation (RAG), semantic layers, and knowledge graphs
- Hands‑on experience with text‑to‑SQL systems or developing interfaces between natural language and databases is a plus
- You are experienced with cloud platforms (Azure, AWS, or GCP) and data platforms such as Databricks or Snowflake
- You are familiar with Infrastructure‑as‑Code (e.g., Terraform, Pulumi) and CI/CD workflows (e.g., Git Hub Actions, Git Lab CI, Azure Dev Ops)
- You have excellent programming skills in Python, SQL, and Bash/Shell, and you write clean, efficient, and maintainable code
- You understand the importance of data governance, security, and privacy (e.g., GDPR) and incorporate these principles into your architectural design
- You combine strong analytical thinking with the ability to translate business requirements into technical solutions and communicate effectively with stakeholders at all levels
- You are fluent in English (written and spoken) and have advanced German skills — or are willing to actively improve them
- Data & AI consulting as our core business:
Work on exciting projects with leading clients – from cutting‑edge NLP use cases to complex data science and machine learning solutions - Depth and diversity:
Engage with challenging, multifaceted problems and continuously expand your expertise in data science, machine learning, and AI - Continuous…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).