Tallo logoTallo logo

HPC Consultant

Job

Cynet Systems

Fremont, CA (In Person)

$113,360 Salary, Full-Time

Posted 3 days ago (Updated 13 hours ago) • Actively hiring

Expires 6/13/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
73
out of 100
Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

We are looking for HPC Consultant for our client in Fremont, CA /
Tualatin, OR Job Title:
HPC Consultant Job Location:
Fremont, CA /
Tualatin, OR Job Type:
Contract Job Overview:
Pay Range:
$52hr - $57hr Responsible for designing, optimizing, and supporting high-performance computing (HPC) environments including cluster scheduling, storage performance, application optimization, and system tuning. The role involves improving workload efficiency, supporting HPC applications, and ensuring optimal performance across compute, storage, and network layers in large-scale production environments.
Requirement/Must Have:
Eight to twelve years of hands-on HPC engineering experience in production environments. Strong expertise in SLURM configuration, tuning, and troubleshooting. Strong knowledge of Linux operating systems. Experience with HPC storage systems and I/O performance analysis. Experience building, installing, and optimizing HPC applications and scientific software stacks. Experience with MPI, OpenMP, and HPC toolchains. Strong scripting skills in Bash and Python. Experience with performance analysis and debugging tools. Strong understanding of HPC system architecture and workload optimization.
Experience:
Experience designing and tuning HPC cluster scheduling policies including fair-share, backfill, and reservations. Experience in HPC storage benchmarking using tools such as IOR, FIO, MDTest, and IOzone. Experience analyzing I/O patterns and mapping workloads to storage architectures. Experience supporting application optimization using compilers and libraries. Experience in system-level performance tuning across compute, storage, and network layers. Experience supporting cluster upgrades, expansions, and hardware refresh activities.
Responsibilities:
Design, configure, tune, and optimize SLURM partitions, queues, QoS, and scheduling policies. Analyze job scheduling behavior, bottlenecks, and resource contention issues. Troubleshoot job failures and performance degradation in HPC environments. Implement scheduling policies such as fair-share, backfill, and reservations. Lead HPC storage benchmarking and performance validation activities. Analyze HPC workload I/O patterns and recommend storage architectures. Support storage procurement decisions including performance and sizing analysis. Collaborate with vendors and internal teams during proof-of-concept evaluations. Build, configure, and maintain HPC applications, compilers, and software stacks. Optimize application performance using MPI, OpenMP, and GPU acceleration where applicable. Manage environment modules and software management frameworks. Perform system-level tuning across compute, memory, network, and storage systems. Diagnose and resolve node-level issues involving CPU, GPU, interconnects, and OS configurations. Create runbooks, performance baselines, and troubleshooting documentation. Support cluster upgrades, expansions, and infrastructure lifecycle activities. Collaborate with researchers, application owners, and infrastructure teams. Translate workload requirements into optimized HPC configurations. Provide technical guidance and recommendations to stakeholders and leadership.
Should Have:
Experience with GPU-based HPC workloads (CUDA, ROCm). Exposure to cloud HPC environments (Azure, AWS, Google Cloud Platform). Experience with parallel file systems such as Lustre or IBM Spectrum Scale. Experience working with vendors for HPC hardware and storage evaluations.
Skills:
SLURM scheduling and cluster management. Linux system administration. HPC storage and I/O performance tuning. MPI and OpenMP programming models. HPC compilers and toolchains (GCC, Intel, NVIDIA HPC SDK). Performance analysis tools. Python and Bash scripting. Environment modules (Lmod). HPC system architecture and optimization. GPU computing (preferred).
Qualification And Education:
Bachelor s or Master s degree in Computer Science, Engineering, or related field preferred.

Similar remote jobs

Similar jobs in Fremont, CA

Similar jobs in California