Software Engineer, Data Infrastructure & Acquisition
Software Engineer, Data Infrastructure & Acquisition - Toronto, Canada
The mission of Speechify is to make sure that reading is never a barrier to learning.
Over 50 million people use Speechify’s text‑to‑speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more and remember more. Speechify’s text‑to‑speech reading products include its iOS, Android, Mac apps, Chrome extension and web app. Google recently named Speechify the Chrome Extension of the Year and Apple the 2025 Design Award winner for Inclusivity.
Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – no office. The team includes frontend and backend engineers, AI research scientists and others from Amazon, Microsoft, Google, Stanford, Stripe, Vercel, Bolt and many startup founders.
OverviewThe role focuses on the data side of Speechify’s AI team and is responsible for all aspects of data collection to support model training operations. We build high‑quality datasets at petabyte‑scale and low cost through tight integration of infrastructure, engineering and research work.
Key responsibilities include:
- Scrape and ingest new sources of audio data into our ingestion pipeline.
- Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
- Collaborate closely with scientists to move the cost, throughput and quality frontier, delivering richer data at larger scale and lower cost to power next‑generation models.
- Collaborate with the AI team and Speechify leadership to craft the dataset roadmap for the next‑generation consumer and enterprise products.
- BS/MS/PhD in Computer Science or a related field.
- 5+ years of industry experience in software development.
- Proficiency with bash/Python scripting in Linux environments.
- Experience with Docker and Infrastructure‑as‑Code concepts and at least one major Cloud provider (preferably GCP).
- Experience with web crawlers and large‑scale data processing workflows (a plus).
- Ability to handle multiple tasks and adapt to changing priorities.
- Strong written and verbal communication skills.
- A fast‑growing environment where you shape the company and product.
- An entrepreneurial‑mindset team that encourages risk, intuition and hustle.
- A hands‑off management approach that lets you focus.
- An opportunity to make a big impact in a transformative industry.
- Competitive salaries, a friendly and laid‑back atmosphere and a commitment to building an asynchronous culture.
- An chance to work on a life‑changing product used by millions.
- Build products that directly support people with learning differences such as dyslexia, ADD, low vision, autism and more.
- Work in one of the fastest‑growing sectors of tech – the intersection of AI and audio.
Tell us why you’re interested in this role and what your portfolio and Linked In look like.
Speechify is committed to a diverse and inclusive workplaceSpeechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age or other legally protected status.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: