This role is hands-on, investigative, and deeply technical. You’ll spend time in labs and datacenters validating servers, debugging platform issues, and making sure hardware survives contact with reality before it reaches fleet scale.
What you’ll do
- Bring up new server, storage, and rack hardware from prototype through production qualification.
- Debug failures across CPUs, memory, PCIe, networking, power delivery, and thermal systems.
- Work closely with firmware and board design teams to isolate issues and drive fixes into production.
- Build validation plans that catch problems before thousands of systems are deployed.
- Develop tooling and automation for repeatable hardware testing and diagnostics.
- Help shape future platform designs based on operational lessons from the lab and field.
Requirements
- Strong hardware debugging experience in servers, embedded systems, or infrastructure platforms.
- Comfort working across hardware, firmware, and operating system boundaries.
- Familiarity with lab equipment such as oscilloscopes, logic analyzers, serial consoles, and power instrumentation.
- Ability to methodically investigate complex failures and communicate findings clearly.
Nice to have
- Experience with PCIe, DDR memory systems, NVMe, or high-speed networking.
- Familiarity with BMCs, BIOS/UEFI, or low-level platform firmware.
- Experience working with ODMs or manufacturing partners.
- Scripting or automation experience in Python, Bash, or Go.
