×
Register Here to Apply for Jobs or Post Jobs. X

Engineering Manager, ML​/Data Engineering - Content Trust

Job in Dallas, Dallas County, Texas, 75215, USA
Listing for: Scribd, Inc.
Full Time position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    Data Engineer, AI Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

Engineering Manager, ML/Data Engineering (Content Trust)

Role at Scribd, Inc.

About The Company

At Scribd Inc. (pronounced “scribbed”), our mission is to spark human curiosity. We create a world of stories and knowledge, democratize the exchange of ideas and information, and empower collective expertise through our four products:
Everand, Scribd, Slideshare, and Fable.

This posting reflects an approved, open position within the organization. We support a culture where employees can be real and be bold; where we debate and commit as we embrace plot twists; and where every employee is empowered to take action as we prioritize the customer.

Our flexible work benefit, Scribd Flex, allows employees to choose the daily work‑style that best suits their individual needs. Occasional in‑person attendance is required for all Scribd Inc. employees, regardless of location.

About the Team and Role

The ML Data Engineering team is the backbone of Scribd’s commitment to a safe and trustworthy library. We build high‑throughput, ML‑driven data pipelines that process hundreds of millions of documents to detect, classify, and mitigate untrustworthy content.

As Manager of ML Data Engineering you will lead a specialized team of engineers responsible for building scalable ML‑based foundations that detect and mitigate harmful content. You’ll design infrastructure that enables ML models to reason across our entire corpus in batch and real‑time, ensuring safety classifiers and automated policy enforcement tools are performant, scalable, and resilient.

You Will
  • Lead and grow a high‑performing engineering team: manage, mentor, and recruit a world‑class team of data and ML engineers, fostering technical excellence, operational rigor, and deep empathy for the user safety mission.
  • Architect scalable ML data pipelines: design and oversee distributed data processing systems capable of handling hundreds of millions of documents, supporting batch and real‑time inference for content moderation and risk detection.
  • Build the "Trust" scores: develop and maintain foundational data layers—semantic embeddings, metadata extracts, and behavioral signals—to power our Content Trust ML models.
  • Partner on AI/LLM Integration: work closely with Search & Discovery and Applied Research teams to integrate ML/LLM‑based reasoning into our trust pipelines, enabling more nuanced understanding of complex policy violations.
  • Drive Operational Excellence: establish SLAs for infrastructure, ensuring automated enforcement systems are fast and explainable.
  • Cross‑functional Leadership: collaborate with Product Managers (Content Trust), Legal/Policy teams, and Data Science to translate evolving regulatory requirements (e.g., DSA) into robust technical architectures.
You Have
  • Leadership

    Experience:

    8+ years of total engineering experience, with 3+ years in a people‑management or technical lead role within a Data or ML Engineering organization.
  • Scale Expertise: proven track record of building and operating production‑grade data pipelines at massive scale (100M+ entities) using Spark, Flink, Kafka, or Airflow.
  • ML Infrastructure Fluency: deep understanding of the ML lifecycle—feature engineering, model deployment (MLOps), and vector databases (e.g., Pinecone, Milvus, or Weaviate).
  • Trust & Safety Context: prior experience building systems for content moderation, fraud detection, spam prevention, or digital rights management.
  • Technical Breadth: strong proficiency in Python, Scala, or Go, and experience with cloud‑native infrastructure (AWS/GCP, Kubernetes, and Snowflake/Big Query).
  • Strategic Communication: ability to explain complex architectural trade‑offs to non‑technical stakeholders in Legal, Policy, and Product.
Ideally, you have (Bonus Points)
  • LLM Pipelines: experience building RAG pipelines or managing the data infra for fine‑tuning Large Language Models.
  • UGC

    Experience:

    background working with large‑scale User Generated Content ecosystems and the unique challenges of unstructured document data.
  • Regulatory Knowledge: familiarity with the technical requirements of global safety regulations such as the Digital Services Act (DSA) or the UK Online Safety Act.
  • Adversarial Mindset:…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary