Tallo logoTallo logo

HPC/AI Data Center Technician

Job

Luminhouse Ltd

Spring, TX (In Person)

$88,400 Salary, Full-Time

Posted 1 day ago (Updated 3 hours ago) • Actively hiring

Expires 6/18/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
57
out of 100
Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

HPC/AI Data Center Technician Department:
Data Center Operations Reports to: Data Center Operations Manager /
Site Lead Location:
On-site (site-specific)
Job Type:
Full-time Schedule:
12-hour shifts, including nights, weekends, holidays, and on-call rotation
Pay:
$40.00 - $45.00 per hour, depending on experience and skills
Overtime:
Paid at 1.5× regular rate for hours worked beyond 40/week, in accordance with
FLSA Travel:
Up to 25-50% for multi-site support, as required About the Role We are hiring an experienced, hands-on Data Center Technician to support the day-to-day deployment and operation of
HPC / AI / GPU
infrastructure inside our facilities. This is a floor-level operations role, not a design or engineering position. You will spend most of your shift on the data center floor — racking equipment, running and terminating cables, troubleshooting hardware, swapping components, and supporting liquid-cooling and power infrastructure. We need someone with real, verifiable, multi-year hands-on experience who can show up on day one and work independently with minimal supervision. You will report to the site lead and serve as the on-site Tier 2 escalation point for network, cabling, and hardware issues. You will work closely with remote NetOps and customer engineering teams via ticketing systems. Key Responsibilities Hardware Deployment & Rack and Stack Unbox, rack, stack, cable, and power on GPU servers, storage arrays, network switches, routers, firewalls, CDUs, and PDUs per rack elevations, cable schedules, and BOMs Validate physical placement, serial numbers, and asset tags Configure basic OS and out-of-band management (BMC / iLO / iDRAC) Perform firmware updates and component replacements (CPU, memory, GPUs, NICs, HBAs, drives, power supplies) Structured Cabling & Fiber Infrastructure Install, terminate, label, and dress copper (Cat6 / 6A) and fiber (SM / MM) cabling Work with 100G / 400G / 800G transceivers, AOC / DAC, and
MPO / MTP
assemblies Use VFL, optical power meter, OTDR, and fiber scope for fault location, cleaning, and repair Verify LLDP neighbors, link status, and optical power levels Maintain
TIA-942 / BICSI
standards and cable management discipline Network Hardware Support Deploy and maintain Arista, Juniper, Cisco, and SONiC switches and routers Provide Tier 2 physical-layer troubleshooting and post-repair validation Assist Network Operations and automation teams with on-site diagnostic tasks Support Tier 1 technicians on network triage Power & Liquid Cooling Install, commission, and maintain Direct Liquid Cooling (DLC) systems, CDUs, heat exchangers, manifolds, piping, pumps, and valves Perform pressure testing, leak detection, water-quality monitoring, flow validation, and sensor calibration Support chilled-water / HVAC systems and
PDU / UPS / RPP
power infrastructure Read and interpret P&ID drawings and mechanical schematics Strictly follow OSHA, LOTO, and hot-work safety procedures
GPU / AI
Cluster Operations Support deployment, burn-in, and ongoing operations of high-density GPU clusters (H100 / H200 / B200 / GB200 and similar platforms) Troubleshoot
GPU, NIC
(InfiniBand / RoCE / NVLink), and interconnect hardware issues Execute node isolation and full-node replacement per customer SLA Break/Fix & Preventive Maintenance Respond to hardware alerts, diagnose root cause, replace components, execute RMA, and perform full-system swaps Execute scheduled preventive maintenance: cleaning, filter changes, health checks Participate in 7×24 on-call rotation, including nights, weekends, and emergency response Smart Hands & Remote Collaboration Act as on-site eyes and hands for remote engineering and customer teams Execute precise instructions via ServiceNow, Jira, or Zendesk Provide clear technical updates and escalation summaries Logistics, Inventory & Asset Management Manage receiving, inbound inspection, put-away, outbound fulfillment, cycle counts, and RMA returns Maintain accurate asset records, 5S warehouse standards, and ESD protection Documentation, Compliance & Safety Document all work in DCIM systems, runbooks, SOPs, and internal wikis Ensure full traceability through change-management processes Follow OSHA, ESD, PPE, and site security policies at all times Required Qualifications High School Diploma or GED 2+ years of hands-on data center operations experience 2+ years of server, storage, and network hardware installation experience 2+ years of structured and data-center cabling experience (copper + fiber) Working knowledge of Layer 2 / 3 networking concepts (OSI, TCP/IP, VLAN) Proficiency with Linux command-line basics Proficiency with ticketing systems (ServiceNow / Jira / Zendesk) Ability to read and interpret rack elevations, cable schedules, and mechanical drawings Clear English written and verbal communication Valid driver's license Legal authorization to work in the United States Preferred Qualifications Certifications CompTIA Network+ / Server+ CCNA (or equivalent JNCIA / Arista ACE) BICSI Installer 1 / 2 or
RCDD OSHA 10 / 30
Vendor certifications (Dell, Lenovo, HPE, NVIDIA, Supermicro) Technical Experience
HPC / GPU
cluster deployment in hyperscale or AI environments 100G / 400G / 800G optics and OTDR-based fiber troubleshooting Direct Liquid Cooling (DLC), CDU, and chilled-water system installation DCIM platforms (Nlyte, Sunbird, Device42) Arista / Juniper / Cisco / SONiC CLI InfiniBand / RoCE / NVLink diagnostics Other Mandarin Chinese language skills are a plus for sites supporting Chinese-speaking customers or teams Physical & Work Environment Requirements Ability to lift 50-70 lbs (23-32 kg) independently and repeatedly Comfortable working on ladders, in confined spaces, and for extended periods standing, bending, and kneeling Comfortable working in loud, cold, high-airflow data center environments Willing to work 12-hour shifts, nights, weekends, holidays, and on-call rotations Compensation & Benefits Item Details Hourly Rate $40.00 - $45.00 / hour Overtime 1.5× regular rate beyond 40 hrs/week (FLSA) Shift Differential Available for nights, weekends, and holidays Health Medical, dental, vision, FSA / HSA Retirement 401(k) with company match Time Off Paid vacation, holidays, and sick leave Other Long-term disability, EAP, training and certification reimbursement How to Apply Submit your resume directly through Indeed. Please make sure your resume clearly lists: Years of hands-on data center experience Specific equipment, vendors, and cable plant types you have worked with Any GPU cluster, liquid cooling, or high-speed optics experience Certifications held
Pay:
$40.00 - $45.00 per hour
Benefits:
401(k) Dental insurance Flexible schedule Health insurance Life insurance Paid time off Relocation assistance Retirement plan Vision insurance
Education:
Associate (Required)
Experience:
data center operations: 2 years (Required)
Language:
Mandarin (Preferred) Shift availability: Day Shift (Required) Night Shift (Required) Overnight Shift (Required)
Work Location:
In person

Similar remote jobs

Similar jobs in Spring, TX

Similar jobs in Texas