Job Description Help for Job Description. Opens a new window.
Why UKG:
At UKG, the work you do matters. The code you ship, the decisions you make, and the care you show a customer all add up to real impact. Today, tens of millions of workers start and end their days with our workforce operating platform. Helping people get paid, grow in their careers, and shape the future of their industries. That's what we do. We never stop learning. We never stop challenging the norm. We push for better, and we celebrate the wins along the way. Here, you'll get flexibility that's real, benefits you can count on, and a team that succeeds together. Because at UKG, your work matters—and so do you. About the
Team:
We are seeking an experienced DBRE Manager to lead our Database Reliability Engineering team. This role is responsible for ensuring the availability, scalability, performance, and security of critical database systems across multiple platforms and technologies. You will drive operational excellence, lead incident management, and partner closely with engineering, product, and infrastructure teams to deliver highly reliable database services. About the
Role:
- Lead and mentor a team of Database Reliability Engineers supporting multiple database technologies (e.g. SQL Server, MySQL, PostgreSQL, NoSQL platforms).
- Own end-to-end database reliability, including uptime, performance, scalability, and disaster recovery.
- Drive proactive monitoring, alerting, and automation to prevent incidents and reduce toil.
- Oversee incident response, root cause analysis (RCA), and post-incident reviews.
- Establish and enforce best practices for database operations, security, backups, and recovery.
- Collaborate with SRE, DevOps, and application teams to improve system resilience and performance.
- Manage database access controls, compliance, and governance standards.
- Lead capacity planning and performance tuning initiatives.
- Champion infrastructure as code (IaC) and automation strategies.
- Track and report key reliability metrics (SLAs, SLOs, SLIs).
- Support cloud migration and modernization efforts (GCP, AWS, Azure).
- Ensure adherence to security and regulatory requirements (SOC2, HIPAA, etc., if applicable).
•