Tallo logoTallo logo

Performance Architect

Job

Compunnel, Inc.

Milpitas, CA (In Person)

Full-Time

Posted 2 days ago (Updated 44 minutes ago) • Actively hiring

Expires 6/15/2026

Apply for this opportunity

This job application is on an outside website. Be sure to review the job posting there to verify it's the same.

Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
77
out of 100
Average of individual scores

Were these scores useful?

Skill Insights

Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.

Job Description

Job Summary T he Performance Architect develops advanced AI storage solutions through innovative system architectures and complex simulation models for Client next-generation products. This role involves designing, programming, debugging, and modifying simulation models to evaluate architectural changes, while assessing performance, power, and endurance. The architect will collaborate with engineering teams to address complex challenges, drive innovation, and shape the future of data-centric architectures. Key Responsibilities B uild SystemC performance models for AI storage solutions, covering end-to-end components such as GPU/TPU/NPU/xPU, host interfaces, memory hierarchies, base die controllers, and packaging technologies. Improve
AI/ML ASIC
architecture performance through hardware/software co-optimization, post-silicon performance analysis, and strategic roadmap influence. Conduct workload analysis and characterization of ASICs and competitive AI/datacenter solutions to identify performance improvement opportunities. Collaborate with architecture teams to resolve performance issues and optimize datacenter technologies for efficiency and TCO. Model and optimize components of AI/ML accelerator ASICs, including PCIe/UCIe/CXL, NoC, DMA, firmware interactions, NAND, fabrics, and xPU. Perform performance modeling and optimization for large-scale LLM training/inference, including Dense and Mixture of Experts (MoE) architectures across multiple modalities. Develop and optimize parallelization strategies across tensor, pipeline, context, expert, and data parallel dimensions. Architect memory-efficient training systems using techniques such as structured pruning, quantization, continuous batching, speculative decoding, and KV cache optimization. Incorporate and extend state-of-the-art models (e.g., GPT-4, Deepseek-R1) and multi-modal architectures. Collaborate with internal and external stakeholders to disseminate results and iterate rap idly. Required Qualifications B achelor's, Master's, or Ph.D. in Computer/Electrical Engineering. 5+ years of experience in performance modeling, simulation, and analysis using SystemC. Strong background in computer/graphics architecture, ML, and LLMs. Hands-on experience with SystemC/TLM simulation, behavioral modeling, and performance analysis. Preferred Qualifications (if any) Exp erience with storage systems, protocols, and NAND flash. Deep expertise in optimizing large-scale ML systems and GPU architectures. Proven technical leadership in GPU performance and workload analysis. Knowledge of transformer architectures, attention mechanisms, and model parallelism techniques. Experience with GPU/TPU microarchitecture and distributed training systems. Proficiency in PyTorch, CUDA, TensorRT, OpenAI Triton, ONNX, and distributed frameworks (Ray, Megatron-LM). Familiarity with performance analysis tools (NSight Compute, nvprof, PyTorch Profiler). Background in IO subsystem microarchitecture and protocols (NVMe, PCIe, UCIe, CXL, NVLink). Experience with datacenter workload analysis, multi-core systems, and multi-threa d interactions. Certifications (if any) R elevant certifications in performance engineering, AI/ML, or hardware architecture (preferred but not required).

Similar remote jobs

Similar jobs in Milpitas, CA

Similar jobs in California