Principal Software & Architecture Lead
Listed on 2026-06-03
-
Software Development
AI Engineer
About the Role
Teserac is building neuron™, a unified AI-native platform for data center observability, intelligence, and workflow automation. neuron™ processes real-time telemetry from thousands of sensors, meters, and control systems across heterogeneous data center environments, thereby giving infrastructure owners and operators the visibility to monitor, analyze, automate, and proactively manage power operations with full situational awareness. An embedded AI teammate serves as every operator's always-on co-pilot: detecting anomalies, correlating events, and surfacing recommendations 24/7.
As Principal Software & Architecture Lead, you will design and own the core system architecture that makes this possible where you will be building for scale, reliability, and the unique demands of critical infrastructure software. This is a hands-on technical leadership role. You will make foundational decisions about data pipelines, API design, AI/ML infrastructure, and system integrations while writing production code alongside the engineering team.
You will also define how AI coding agents are used safely and effectively, set the engineering quality bar, and help build a high-performance, high-ownership engineering culture.
We move fast. We avoid bureaucracy. We expect engineers to own outcomes, not tickets.
Why You Would Be a Good FitYou are a senior engineer who has built and scaled production systems at startups and you know the difference between architecture that looks good on a whiteboard and architecture that survives real customer load. You move fluidly between hands-on development, technical leadership, and customer-facing solution engineering. You are equally comfortable writing production code, defining system design, and aligning engineering decisions with business outcomes.
You thrive in fast-moving environments where ownership is expected and bureaucracy is minimal.
You likely:
- Have built and scaled distributed systems at startups, from early product through production load, and know the inflection points that break naive architectures.
- Are comfortable owning architecture end-to-end: ingestion → storage → compute → APIs → UI.
- Think in event-driven architectures, clean data models, and durable API contracts.
- Can translate complex customer requirements into clear architectural decisions and interface directly with enterprise technical stakeholders.
- Have production experience deploying AI/ML systems and defining how AI-assisted tools are used safely in daily workflows.
- Lead through technical clarity, writing ADRs, running design reviews, mentoring engineers, while staying hands-on with code.
- Thrive in fast-moving environments where ownership is expected and bureaucracy is minimal.
- Design and own the end-to-end system architecture for neuron™: data ingestion, processing, storage, API layer, AI/ML inference, and front-end delivery.
- Lead the architecture of the full-stack platform (React, Type Script, Django, Postgre
SQL), designing for multi-tenancy, scalability, and resilience. - Architect real-time data pipelines capable of ingesting high-frequency telemetry from BMS, EPMS, SCADA, Modbus, BACnet, and SNMP sources across multiple sites.
- Evolve real-time telemetry ingestion, Web Socket-driven workflows, and shape the future event-driven architecture (Kafka or similar).
- Define API contracts, data models, and integration patterns that enable seamless connectivity with third-party DCIM, BMS, and monitoring platforms.
- Design hybrid cloud + on-prem deployment strategies; evaluate and select technology stack components including databases, message brokers, and compute frameworks.
- Establish architectural standards for security, multi-tenancy, role-based access control, and audit logging appropriate for critical infrastructure environments.
- Lead technical design reviews, produce architecture decision records (ADRs), and maintain system documentation.
- Build systems targeting >99.9% uptime; improve observability, monitoring, and operational tooling.
- Drive incident response rigor and root cause improvement.
- Ensure secure and robust…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).