×
Register Here to Apply for Jobs or Post Jobs. X

Software Engineer, Cloud Infrastructure; Seniority Levels

Job in San Carlos, San Mateo County, California, 94071, USA
Listing for: AI Chopping Block, Inc.
Full Time position
Listed on 2026-06-05
Job specializations:
  • Software Development
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Software Engineer, Cloud Infrastructure (Multiple Seniority Levels)

About Beacon AI

We’re a fast-moving team of aviators, engineers, and operators building an AI platform to make flying safer, more efficient, and more capable. Backed by top investors, we’ve secured a dozen Department of Defense contracts and partnered with major airlines to deliver mission-critical systems. We operate without silos or heavy processes. Small, focused teams own what they build, ship quickly, and learn fast, pushing the boundaries of how humans and AI work together in aviation.

Role Overview

We are seeking skilled Cloud and ML Infrastructure Engineers to lead the buildout of our AWS foundation and our LLM platform. You will design, implement, and operate services that are scalable, reliable, and secure.

The broad scope means focus areas in LLM/ML Infra and IoT infra are strong bonus points. For ML Infra, build the stack that powers retrieval-augmented generation and application workflows built with frameworks like Lang Chain. Experience with IoT AWS services is a plus.

You will work closely with other engineers and product management. The ideal candidate is hands‑on, comfortable with ambiguity, and excited to build from first principles.

Key Responsibilities
  • Cloud Infrastructure Setup and Maintenance

    • Design, provision, and maintain AWS infrastructure using IaC tools such as AWS CDK or Terraform.

    • Build CI/CD and testing for apps, infra, and ML pipelines using Git Hub Actions, Code Build, and Code Pipeline.

    • Operate secure networking with VPCs, Private Link, and VPC endpoints. Manage IAM, KMS, Secrets Manager, and audit logging.

  • LLM Platform and Runtime

    • Stand up and operate model endpoints using AWS Bedrock and/or Sage Maker; evaluate when to use ECS/EKS, Lambda, or Batch for inference jobs.

    • Build and maintain application services that call LLMs through clean APIs, with streaming, batching, and backoff strategies.

    • Implement prompt and tool execution flows with Lang Chain or similar, including agent tools and function calling.

  • RAG Data Systems and Vector Search

    • Design chunking and embedding pipelines for documents, time series, and multimedia. Orchestrate with Step Functions or Airflow.

    • Operate vector search using Open Search Serverless, Aurora Postgre

      SQL with pgvector, or Pinecone. Tune recall, latency, and cost.

    • Build and maintain knowledge bases and data syncs from S3, Aurora, Dynamo

      DB, and external sources.

  • Evaluation, Observability, and Cost Governance

    • Create offline and online eval harnesses for prompts, retrievers, and chains. Track quality, latency, and regression risk.

    • Instrument model and app telemetry with Cloud Watch and Open Telemetry. Build token usage and cost dashboards with budgets and alerts.

    • Add guardrails, rate limits, fallbacks, and provider routing for resilience.

  • Safety, Privacy, and Compliance

    • Implement PII detection and redaction, access controls, content filters, and human‑in‑the‑loop review where needed.

    • Use Bedrock Guardrails or policy services to enforce safety standards. Maintain audit trails for regulated environments.

  • Data Pipeline Construction

    • Build ingestion and processing pipelines for structured, unstructured, and multimedia data. Ensure integrity, lineage, and cataloging with Glue and Lake Formation.

    • Optimize bulk data movement and storage in S3, Glacier, and tiered storage. Use Athena for ad‑hoc analysis.

  • IoT Deployment Management

    • Manage infrastructure that deploys to and communicates with edge devices. Support secure messaging, identity, and over‑the‑air updates.

  • Analytics and Application Support

    • Partner with product and application teams to integrate retrieval services, embeddings, and LLM chains into user‑facing features.

    • Provide expert troubleshooting for cloud and ML services with an emphasis on uptime and performance.

  • Performance Optimization

    • Tune retrieval quality, context window use, and caching with Redis or Bedrock Knowledge Bases.

    • Optimize inference with model selection, quantization where applicable, GPU/CPU instance choices, and autoscaling strategies.

What Will Make You Successful
  • End‑to‑End Ownership: Drives work from design through production, including on‑call and continuous improvement.

  • LLM Systems

    Experience:

    Shipped or operated LLM‑powered applications in…

Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary