×
Register Here to Apply for Jobs or Post Jobs. X

Platform Engineer – Reliability & Scale at LangChain – San Francisco, CA

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Victrays
Full Time position
Listed on 2026-01-01
Job specializations:
  • Software Development
    Cloud Engineer - Software, DevOps
Salary/Wage Range or Industry Benchmark: 145000 - 195000 USD Yearly USD 145000.00 195000.00 YEAR
Job Description & How to Apply Below
Platform Engineer – Reliability & Scale at Lang Chain – San Francisco, CA

About Lang Chain

At Lang Chain, our mission is to make intelligent agents ubiquitous. We help developers build mission‑critical AI applications across the entire agent development lifecycle. Our open source frameworks — Lang Chain and Lang Graph — see over 70+ million downloads per month. Developers rely on Lang Chain for composable integrations and Lang Graph for controllable agent orchestration. Our commercial agent platform, consisting of Lang Smith and Lang Graph Platform, enables teams to build, test, run, and manage agents at scale across their organization.

Founded in 2023, Lang Chain powers top engineering teams at companies like Replit, Lovable, Clay, Klarna, Linked In, and more.

About the role

In person 5 days/week in San Francisco, CA or New York, NY

Join our platform engineering team as we scale Lang Smith and Lang Graph Platform products. You’ll architect and operate the critical systems that power our customers’ AI observability and Lang Graph app deployments, working directly with cutting‑edge technologies at the intersection of AI and distributed systems.

Responsibilities

• Scale critical systems:
Design and implement high throughput data‑intensive systems supporting our flagship SaaS products (Lang Smith and Lang Graph Platform)

• Drive reliability:
Build monitoring, alerting, and automated recovery systems that maintain high uptime

• Solve complex problems:
Debug performance bottlenecks, optimize database queries, and architect solutions for distributed system challenges

• Shape platform strategy:
Influence technical decisions around infrastructure, tooling, and operational practices as we grow from startup to enterprise scale

• Respond to incidents:
Participate in on‑call rotation with focus on post‑incident learning, automation and prevention

How to be successful in this role

Experience

5+ years building and operating production systems at scale

Infrastructure expertise

Deep knowledge of Kubernetes, containerized infrastructure, cloud platforms (e.g. GCP)

Database expertise

Production experience with OSS data stores (Postgre

SQL, Redis, Kafka)

Observability mastery

Hands‑on experience with observability stacks (Datadog, Prometheus/Grafana, Open Telemetry or similar)

Programming proficiency

Strong hands‑on software engineering skills (Python, Go, Rust)

Operational mindset

“You build it, you run it, you own it” philosophy with the focus on sustainable practices

Nice to Have

• Proficiency with analytical databases (e.g. Click House)

• Background in high‑growth startups

• Previous experience in AI/ML infrastructure

Competitive salary and equity stake for role and stage of company. Commensurate with experience.

Annual salary range: $145,000-$195,000 USD for Senior Engineers

#JLjbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary