Engineering Manager, ML/Data Engineering - Content Trust
Listed on 2026-02-16
-
IT/Tech
Data Engineer, AI Engineer
Engineering Manager, ML/Data Engineering (Content Trust)
Role at Scribd, Inc.
About The CompanyAt Scribd Inc. (pronounced “scribbed”), our mission is to spark human curiosity. We create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our four products:
Everand, Scribd, Slideshare, and Fable.
This posting reflects an approved, open position within the organization. We support a culture where employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.
Our flexible work benefit, Scribd Flex, allows employees to choose the daily work‑style that best suits their individual needs. Occasional in‑person attendance is required for all Scribd Inc. employees, regardless of location.
About the Team and RoleThe ML Data Engineering team is the backbone of Scribd’s commitment to a safe and trustworthy library. We build high‑throughput, ML‑driven data pipelines that process hundreds of millions of documents to detect, classify, and mitigate untrustworthy content.
As Manager of ML Data Engineering you will lead a specialized team of engineers responsible for building scalable ML‑based foundations that detect and mitigate harmful content. You’ll design infrastructure that enables ML models to reason across our entire corpus in batch and real‑time, ensuring safety classifiers and automated policy enforcement tools are performant, scalable, and resilient.
You Will- Lead and grow a high‑performing engineering team: manage, mentor, and recruit a world‑class team of data and ML engineers, fostering technical excellence, operational rigor, and deep empathy for the user safety mission.
- Architect scalable ML data pipelines: design and oversee distributed data processing systems capable of handling hundreds of millions of documents, supporting batch and real‑time inference for content moderation and risk detection.
- Build the "Trust" scores: develop and maintain foundational data layers—semantic embeddings, metadata extracts, and behavioral signals—to power our Content Trust ML models.
- Partner on AI/LLM Integration: work closely with Search & Discovery and Applied Research teams to integrate ML/LLM‑based reasoning into our trust pipelines, enabling more nuanced understanding of complex policy violations.
- Drive Operational Excellence: establish SLAs for infrastructure, ensuring automated enforcement systems are fast and explainable.
- Cross‑functional Leadership: collaborate with Product Managers (Content Trust), Legal/Policy teams, and Data Science to translate evolving regulatory requirements (e.g., DSA) into robust technical architectures.
- Leadership
Experience:
8+ years of total engineering experience, with 3+ years in a people‑management or technical lead role within a Data or ML Engineering organization. - Scale Expertise: proven track record of building and operating production‑grade data pipelines at massive scale (100M+ entities) using Spark, Flink, Kafka, or Airflow.
- ML Infrastructure Fluency: deep understanding of the ML lifecycle—feature engineering, model deployment (MLOps), and vector databases (e.g., Pinecone, Milvus, or Weaviate).
- Trust & Safety Context: prior experience building systems for content moderation, fraud detection, spam prevention, or digital rights management.
- Technical Breadth: strong proficiency in Python, Scala, or Go, and experience with cloud‑native infrastructure (AWS/GCP, Kubernetes, and Snowflake/Big Query).
- Strategic Communication: ability to explain complex architectural trade‑offs to non‑technical stakeholders in Legal, Policy, and Product.
- LLM Pipelines: experience building RAG pipelines or managing the data infra for fine‑tuning Large Language Models.
- UGC
Experience:
background working with large‑scale User Generated Content ecosystems and the unique challenges of unstructured document data. - Regulatory Knowledge: familiarity with the technical requirements of global safety regulations such as the Digital Services Act (DSA) or the UK Online Safety Act.
- Adversarial Mindset:…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).