Site Reliability Engineer Position Available In Miami-Dade, Florida
Tallo's Job Summary: The Site Reliability Engineer position at IBM involves contributing to HashiCorp's offerings, focusing on automating and securing multi-cloud and hybrid environments. The role includes managing infrastructure lifecycle, enhancing cloud solutions, and ensuring efficiency and scalability. Responsibilities include evaluating infrastructure technologies, building platform tooling, and collaborating with engineering teams. Candidates need proficiency in Go, experience with cloud infrastructure, and familiarity with microservices architectures. The job is remote within the US. IBM is an equal opportunity employer.
Job Description
- Introduction
- A career in IBM Software means you’ll be part of a team that transforms our customer’s challenges into industry-leading solutions.
We are an infinitely curious team, always seeking new possibilities, and dedicated to creating the world’s leading AI-powered, cloud-native software solutions. Our renowned legacy creates endless global opportunities for our network of IBMers. We are a team of deep product experts, ensuring exceptional client experiences, with a focus on delivery, excellence, and obsession over customer outcomes.
This position involves contributing to HashiCorp’s offerings, now part of IBM, which empower organizations to automate and secure multi-cloud and hybrid environments. You will join a team managing the lifecycle of infrastructure and security, enhancing IBM’s cloud solutions to ensure enterprises achieve efficiency, security, and scalability in their cloud journey.
- Your role and responsibilities
- About the teamThe Infrastructure Orchestration team is a core part of HashiCorp’s internal platform infrastructure group.
At the intersection of site reliability engineering, software development, and infrastructure, this team is responsible for building the software that deploys and orchestrates infrastructure underpinning the HashiCorp Cloud Platform. We are working on the next-generation infrastructure platform for internal and external services, developing common tooling and workflows that are low friction and enable teams to get services built and deployed quickly and securely.
We work closely with our sister infrastructure teams, release engineering, developer productivity, site reliability engineering teams, and other internal groups consuming our infrastructure platform. As our group expands, we’re seeking mid-level software engineer to join our infrastructure team.
Our infrastructure is hosted on AWS (EC2, S3, RDS, ECS) with backing data stores like PostgreSQL. We leverage the HashiStack suite (Terraform, Consul, Nomad, Vault, Packer) and in-house tooling written in Go. We ensure that all infrastructure components we offer to internal teams can be deployed consistently, reliably, and managed in a secure and compliant manner.
If this sounds interesting, we’d love to meet you! We have a large footprint and a quickly growing user base, with many interesting problems and opportunities for growth and development.
What you’ll do (responsibilities)
- Contribute to the research and evaluation of infrastructure technologies to support our Engineering teams, including drafting RFCs and collaborating with senior engineers on technical decisions
- Build, deploy, and support new platform tooling
- Work with teammates to improve and evolve our software engineering practices
- Create tools for automating deployment, monitoring, and operations of the platform
- Improve reliability and performance of internal infrastructure by maintaining, debugging, and optimizing platform components
- Enhance engineering productivity by building tools that automate operations
- Participate in on-call rotations to support the health of our infrastructure and respond to incidents
- Collaborate with the team, contributing to Engineering RFCs that shape the evolution of our internal platform
- Reviews technical contributions for quality and consistency, partners with stakeholders and teams to resolve issues and propose technical or architectural changesThis job can be performed from anywhere in the US
- Required technical and professional expertise
- Proficiency with Go or another modern programming language
- Experience operating AWS, Azure, or Google Cloud infrastructure
- Familiar with microservices architectures, and ideally have seen microservices in operation at a global scale, including its active development
- Comfortable and enthusiastic about adopting the HashiCorp way of building systems, using an infrastructure-as-code (IaC) approach, and taking advantage of immutable infrastructure
- Have a good handle and understanding of platform engineering
- Preferred technical and professional experience
- Experience using source management tools like Git
- Have a willingness to learn new technologies and methodologies
- Understand the difference between shipping a project that’s done versus a project that is perfect
- Have a customer-centric attitude and willingness to enthusiastically support the engineering teams to helpHashiCorp continue to deliver great products and services
- Familiarity with durable workflow technologies, such as Temporal or Cadence
- Professional experience with configuration management tools such as Ansible, Chef, Puppet, or Salt tomanage Linux hosts
- Familiar with infrastructure management and operations lifecycle concepts
- Experience building and supporting the production infrastructure for a large-scale SaaS application
- Prior exposure to building and operating a large-scale cloud-based infrastructureIBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.