×
Register Here to Apply for Jobs or Post Jobs. X

Lead Test Engineer - Server & Storage Systems; NVMe, SATA, SSD, HDD Onsite

Job in Richardson, Dallas County, Texas, 75080, USA
Listing for: Confidential Company
Full Time position
Listed on 2025-12-20
Job specializations:
  • IT/Tech
    Systems Engineer, Hardware Engineer, Data Engineer, AI Engineer
  • Engineering
    Systems Engineer, Hardware Engineer, Data Engineer, AI Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Position: Lead Test Engineer - Server & Storage Systems (NVMe, SATA, SSD, HDD) Onsite

Base Pay Range

$/yr - $/yr

The Senior Lead Test (& Validation) Engineer – Storage & Server Infrastructure Systems will play a pivotal role in the design, development, and execution of comprehensive test strategies for AI data center's storage and server infrastructure (HW + FM + SW).

This leadership position requires deep expertise in enterprise storage systems, server architectures, networking, and a strong understanding of the unique performance and reliability demands of AI/ML workloads. The ideal candidate will be a hands‑on technical leader.

  • Define, develop, and implement comprehensive test plans and strategies for all storage and server hardware, firmware, and software components within the AI Data Center environment.
  • Lead the Test team in designing, executing, and analyzing complex test cases, including functional, performance, reliability, stress, and endurance testing.
  • Design and implement automated test frameworks and scripts using languages like Python, Go, or similar, to improve efficiency and coverage of testing.
  • Conduct in-depth performance analysis and bottleneck identification for storage systems (e.g., NVMe, SSD, HDD arrays, distributed storage, SAN/NAS) and server platforms (e.g., CPU, GPU, memory, PCIe, networking), and OpenBMC interfaces/features.
  • Debug issues related to BMC functionality and its interaction with server hardware.
  • Develop and maintain robust testbeds and infrastructure for continuous integration and validation.
  • Utilize open‑source and commercial test tools relevant to storage, server, and OpenBMC validation.
  • Collaborate closely with hardware design, software development, infrastructure, and AI/ML engineering teams to understand requirements and integrate testing throughout the product lifecycle.
  • Communicate test progress, results, and critical issues effectively to stakeholders, including executive leadership.
  • Develop specialized test methodologies to validate performance and reliability under heavy AI/ML workloads (e.g., large model training, inference at scale, data ingestion).
  • Understand and test the interactions between GPU
    -accelerated computing, high‑speed networking, and storage systems.
Requirements
  • BS with 8+ years of hands‑on hardware VALIDATION and platform TEST engineering experience with direct exposure to AI data center Server & Storage components including NVMe, SATA, SSDs, HDDs, DIMMS, and system‑level platforms used in large‑scale cloud environments.
  • Need someone that is firmly rooted in hardware and firmware validation.
  • Must have 2+ years of experience in a lead or senior technical role, leading test initiatives, assigning and guiding junior test engineers.
  • Must be very hands‑on with NVMe, SATA, SSDs, HDDs, DIMMS.
  • Great interpersonal skills & English communication skills, with the ability to collaborate effectively across diverse teams and with vendors and customers.
  • Strong in debugging server hardware (BMC, PCIe, networking).
  • Strong in AI/ML workload optimization (Tensor Flow, PyTorch) and their infrastructure requirements.
  • Strong Linux and Python/GO automation, and strong performance analysis of storage/server platforms.
  • Familiarity with OCP (Open Compute Project).
  • Certifications in relevant technologies (e.g., Net App, Dell EMC, HPE, NVIDIA). Distributed storage validation.
  • Contribute to platform firmware validation testing, BIOS bring up.
  • Must work onsite 4 days a week in Richardson, TX.
Seniority Level

Mid‑Senior Level

Employment Type

Full‑time

Job Function

Information Technology

Industries

Technology, Information and Media

Location

Richardson, TX (on‑site, 4 days a week)

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary