Staff Software Engineer, Gemini Evals, GenAI, DeepMind
Job
DeepMind
Mountain View, CA (In Person)
$253,500 Salary, Full-Time
Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
78
out of 100
Average of individual scores
Skill Insights
Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.
Job Description
Staff Software Engineer, Gemini Evals, GenAI, DeepMind corporate_fare DeepMind place Mountain View, CA, USA ; New York, NY, USA info_outline
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about . Responsibilities Design and optimize distributed evaluation execution engines capable of orchestrating large volumes of inference steps across TPU and Google compute unit (GCU) pools with high throughput and low latency. Build foundational abstractions to evaluate complex LLM agent loops, tool use, and automated LLM-as-a-judge rating systems. Design error classification, automated retry policies, and observability dashboards to maintain strict service level objective (SLOs) for evaluation pipeline success rates. Partner closely with GDM research scientists and Data Science teams to anticipate frontier model evaluation requirements and translate them into elegant infrastructure solutions. Mentor fellow engineers, set high standards for code quality (Python in Google3), and advocate testing and system design practices.
X Note:
By applying to this position you will have an opportunity to your preferred working location from the following: Mountain View, CA, USA; New York, NY, USA .Minimum qualifications:
Bachelor's degree in Computer Science, Electrical Engineering, or a related technical field or equivalent practical experience. 8 years of experience in software development.Preferred qualifications:
Experience in designing, building, and maintaining high-performance distributed systems or processing pipelines. Experience leading architectural migrations or cross-team infrastructure projects. Proficiency in Python. About the job Artificial intelligence will be one of humanity's most transformative inventions. At Google DeepMind, we are a pioneering AI lab with exceptional interdisciplinary teams focused on advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We use our technologies for widespread public benefit and scientific discovery, ensuring safety and ethics are always our highest priority. We are pushing the boundaries across multiple domains. Our global teams offer learning opportunities and varied career pathways for those driven to achieve exceptional results through collective effort. The US base salary range for this full-time position is $207,000-$300,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can more about the specific salary range for your preferred location during the hiring process.Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about . Responsibilities Design and optimize distributed evaluation execution engines capable of orchestrating large volumes of inference steps across TPU and Google compute unit (GCU) pools with high throughput and low latency. Build foundational abstractions to evaluate complex LLM agent loops, tool use, and automated LLM-as-a-judge rating systems. Design error classification, automated retry policies, and observability dashboards to maintain strict service level objective (SLOs) for evaluation pipeline success rates. Partner closely with GDM research scientists and Data Science teams to anticipate frontier model evaluation requirements and translate them into elegant infrastructure solutions. Mentor fellow engineers, set high standards for code quality (Python in Google3), and advocate testing and system design practices.
Similar jobs in Mountain View, CA
DeepMind
Mountain View, CA
Posted1 day ago
Updated1 hour ago
YouTube
Mountain View, CA
Posted1 day ago
Updated1 hour ago
Similar jobs in California
Alameda County Office of Education
Hayward, CA
Posted14 hours ago
Updated1 hour ago