Job Description
Manager, Cloud Engineering We are looking for a Manager of Cloud Engineering to join our engineering team at HDAI. You will lead the Cloud Engineering team responsible for the infrastructure that powers the HealthVision™ platform. This team owns the AWS environment that supports our data ingestion pipelines, AI/ML inference workloads, and clinical-facing applications. High availability, security, and cost efficiency of our cloud infrastructure are essential to delivering AI models that clinicians trust with patient care. This role comes at a key turning point in our platform development: we are scaling to onboard new health system partners, deepening our AI capabilities, and maturing our infrastructure practices to meet growing compliance requirements (HIPAA, SOC 2, HITRUST). You will be hands-on technically while also growing and mentoring a team of cloud engineers. Strong infrastructure and DevOps skills are a must—your designs and decisions must meet a high bar of reliability and security while achieving delivery timelines effectively. At HDAI, we nurture a culture of continually improving, and one of the best assets of our engineers is a love for learning. We hope that you too share this goal. A successful candidate will demonstrate strong attention to detail, clear communication, and effective follow-through. We are excited to have you as a part of this team!
Responsibilities:
1. Lead, mentor, and develop a team of cloud engineers, setting clear expectations, conducting regular one-on-ones, and creating growth opportunities for each team member. 2. Own the design, implementation, and operational health of HDAI's AWS cloud infrastructure, including compute (ECS, Lambda), networking, storage (S3), messaging (SQS), and monitoring. 3. Drive infrastructure-as-code practices using Terraform, ensuring environments are reproducible, version-controlled, and peer-reviewed. 4. Partner with Data Engineering, ML Operations, API Services, and Frontend Engineering teams to ensure infrastructure meets the performance, scalability, and reliability needs of the HealthVision™ platform. 5. Own cloud cost management, proactively identifying opportunities to optimize spend and treating cost discipline as an engineering problem. 6. Ensure that compliance requirements (HIPAA, SOC 2, HITRUST) and security best practices are embedded into infrastructure design and operations, not bolted on after the fact. 7. Build and maintain CI/CD pipelines, container orchestration, and deployment automation to support rapid and reliable software delivery across engineering teams. 8. Establish and evolve operational practices including monitoring, alerting, incident response, on-call rotations, and post-incident reviews. 9. Participate in project planning, execution, and delivery, ensuring infrastructure projects are completed on time and within scope, with risk minimization strategies. 10. Contribute to a culture of continuous learning, growth, and innovation within the Cloud Engineering team and across the broader engineering organization. Qualifications:
1. Bachelor's Degree in Computer Science, Engineering, or equivalent. 2. 5+ years of working experience in cloud engineering, DevOps, or infrastructure engineering, with at least 1 year in a people management or team lead role. 3. Deep hands-on experience with AWS services including ECS, Lambda, S3, SQS, VPC, IAM, CloudWatch, and related services. Experience with other cloud platforms (Azure, GCP) is a plus. 4. Strong proficiency with Infrastructure as Code, particularly Terraform. Experience with configuration management and environment provisioning at scale. 5. Proficient with containerization technologies (Docker, ECS/Fargate) and modern deployment strategies. 6. Solid experience building and maintaining CI/CD pipelines and DevOps tooling to support continuous delivery. 7. Familiarity with Linux systems administration, networking fundamentals, and security best practices in cloud environments. 8. Experience operating infrastructure in a regulated environment. Understanding of HIPAA compliance requirements in practice is strongly preferred; familiarity with SOC 2 and HITRUST is a plus. 9. Applicants must possess a strong ability to diagnose and resolve complex infrastructure and operational issues through effective troubleshooting and root-cause analysis. 10. Excellent interpersonal skills with demonstrated ability to effectively communicate with internal and external teams; ability to develop trust, cooperation, and mutual respect. 11. You exemplify strong accountability and ensure the quality of your work and your team. Pay:
$165,445.00 - $195,765.00 per year Benefits:
401(k) Dental insurance Employee assistance program Flexible schedule Health insurance Health savings account Life insurance Paid time off Referral program Retirement plan Tuition reimbursement Vision insurance Work Location:
Hybrid remote in Dedham, MA 02026