×
Register Here to Apply for Jobs or Post Jobs. X

Supercomputing Engineer; Test

Job in San Jose, Santa Clara County, California, 95199, USA
Listing for: Etched
Full Time position
Listed on 2026-06-18
Job specializations:
  • Engineering
    Systems Engineer, Hardware Engineer
Salary/Wage Range or Industry Benchmark: 200000 - 250000 USD Yearly USD 200000.00 250000.00 YEAR
Job Description & How to Apply Below
Position: Supercomputing Engineer (Test)

Overview

About Etched

Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel chain-of-thought reasoning agents. Backed by hundreds of millions from top-tier investors and staffed by leading engineers, Etched is redefining the infrastructure layer for the fastest growing industry in history.

Job Summary:

We are seeking highly motivated and detail-oriented Supercomputing Engineer (Test) to join our team. This team plays a critical role in ensuring the reliability and stability of our highest-performance Inference server hardware and software. As a Software Engineer on this team, you will design, develop, and execute comprehensive burn-in test suites, analyze test results, and collaborate with hardware and software engineering teams at Etched and our ODM partners to identify and resolve potential issues.

You will be at the forefront of ensuring our server products meet the highest quality standards before they reach our customers.

Responsibilities
  • Test Development: Design, develop, and implement automated burn-in test suites using common scripting languages (Python, Go, Bash) and test frameworks across all aspects of System Operation including: boot sequences, root-of-trust, system management, workload deployment and performance.
  • Test Execution: Execute burn-in tests on server hardware, monitor system performance and health, and analyze test results.
  • Failure Analysis: Investigate and debug hardware and software failures identified during testing, providing detailed reports and mitigation plans.
  • Collaboration: Collaborate with internal and external hardware and software engineering teams to identify root causes of failures and implement corrective actions.
  • Test Infrastructure: Contribute to the development and maintenance of the burn-in testing infrastructure, including portable test environments and automation tools runable in any environment.
  • Documentation: Create and maintain comprehensive documentation for test plans, test cases, and test results.
  • Performance Analysis: Analyze system performance metrics to identify potential bottlenecks and areas for optimization.
  • Continuous Improvement: Participate in continuous improvement efforts to enhance the efficiency and effectiveness of the burn-in testing process.
Representative projects
  • Develop automated test suites to stress-testing of CPUs, memory, storage, and network subsystems under extreme workloads.
  • Design and implement fault injection tests to simulate hardware and software failures.
  • Create tools to monitor and analyze system performance metrics, such as CPU utilization, cross-socket memory performance and usage, and network latency.
  • Build and maintain a scalable burn-in testing environment capable of handling multiple server configurations.
  • Collaborate with hardware engineers to develop tests for new server features and components.
  • Contribute to the creation of dashboards that show the current state of burn in testing across the server farm.
You may be a good fit if you have
  • Proficiency in at least one scripting language (e.g., Python, Bash, Go).
  • Experience with software testing methodologies and tools.
  • Strong understanding of operating systems (Linux preferred) and server hardware architectures.
  • Ability to analyze complex technical problems and provide effective solutions.
  • Excellent communication and collaboration skills.
  • Ability to work independently and as part of a team.
  • Experience with version control systems (e.g., Git).
  • Experience with reading and interpreting hardware logs.
Strong candidates may also have
  • Experience with hardware burn-in testing or reliability testing.
  • Knowledge of server virtualization and cloud computing concepts.
  • Experience with performance testing and benchmarking tools.
  • Familiarity with hardware diagnostic tools and techniques.
  • Experience with containerization technologies (e.g., Docker, Kubernetes).
  • Experience with CI/CD pipelines.
  • Knowledge of low level hardware…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary