Senior NLP Data Engineer
Listed on 2025-12-21
-
IT/Tech
Data Engineer, AI Engineer, Machine Learning/ ML Engineer, Data Scientist
We offer a flexible working policy that supports a healthy balance between personal and professional well‑being. This role requires in‑office presence on Tuesdays & Thursdays to collaborate, connect, and learn from peers — while also maintaining the flexibility for meaningful work‑life balance.
Being a Senior NLP Data Engineer at iManage MeansYou're passionate about transforming unstructured text into meaningful insights that power AI and machine learning solutions. You thrive at the intersection of data engineering, AI and natural language processing, building the pipelines and datasets that fuel generative AI applications, agentic systems, advanced model fine‑tuning and other NLP‑driven capabilities across iManage.
As an NLP Data Engineer on the Applied AI team, you will design, build, and optimize large‑scale text data pipelines that power AI/ML and Generative AI solutions for our customers. You’ll work with knowledge engineering, applied AI, and product teams to prepare, enrich, and integrate document data. Your work will be essential to enabling intelligent, AI‑powered features across the iManage platform.
iMResponsible For
- Designing, developing and maintaining scalable pipelines in MSFT Azure to ingest and transform large volumes of text data from multiple sources
- Designing automated workflows for text normalization, deduplication, language identification, PII redaction and metadata enrichment
- Building automated data validation processes to ensure accuracy and consistency
- Supporting model fine‑tuning, semantic search and Gen AI evaluation tuning through dataset curation, prompt dataset preparation, labeling coordination, and text quality validation
- Partnering with the Applied AI team to gather data requirements and build data interfaces for developing and maintaining machine learning systems
- Maintaining data lineage and following data privacy, security and governance best practices
- Implementing data versioning and lineage tracking for machine learning experiments
- A Bachelor’s degree or higher in Computer Science, Data Engineering, Applied Mathematics, Computational Linguistics, or a quantitative related field.
- 4+ years of data engineering experience, with at least 2 years working with unstructured data in a business setting.
- Strong proficiency in Python, PySpark, and data manipulation for large unstructured text datasets.
- Strong understanding of NLP concepts such as tokenization, embeddings, semantic search, and experience with standard text libraries such as Spa Cy, Hugging Face Datasets, NLTK.
- Solid data
Ops knowledge and experience orchestrating advanced NLP data pipelines using cloud based data infrastructure - Proficiency with Git and collaborative development frameworks
- A passion for enabling AI capabilities through scalable, reliable data architecture.
- Problem solving, creativity, curiosity, and a collaborative mindset.
- Exposure to Microsoft Azure Services such as Fabric, ADLS, AI Foundry, Azure ML, MLflow
- Experience with knowledge graph implementation for NLP applications
- Experience working with data for the legal domain
- Experience designing architectures for large‑scale text corpora
Don't meet every qualification listed above? Studies show that women and people of color are less likely to apply to jobs unless they meet all qualifications. At iManage, we are committed to building a diverse and inclusive environment, and encourage everyone to show up as their full authentic selves. We welcome those that come with a growth mindset and a hunger for learning;
so, if you are excited about this role but your past experience doesn't align perfectly with every qualification we encourage you to apply anyways!
- Join a supportive, experienced team with an inclusive, encouraging, and vibrant culture.
- Have flexible work hours that allow me to balance my ‘me time’ with my work commitments.
- Collaborate in a modern open plan workspace, with a gaming area, free snacks, drinks and regular social events.
- Focus on impactful work, solving complex, real challenges utilizing the latest technologies and protocols.
- Own my career path with our internal development…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).