SOFTWARE ENGINEERING DIRECTOR I, Production Support Operations
Job
Truist
Richmond, VA (In Person)
Full-Time
Review key factors to help you decide if the role fits your goals.
Pay Growth
?
out of 5
Not enough data
Not enough info to score pay or growth
Job Security
?
out of 5
Not enough data
Calculating job security score...
Total Score
100
out of 100
Average of individual scores
Skill Insights
Compare your current skills to what this opportunity needs—we'll show you what you already have and what could strengthen your application.
Job Description
- The position is described below. If you want to apply, click the button at the top or bottom of this page. After you click and complete your application, you'll be invited to create a profile, which will let you see your application status and any communications. If you already have a profile with us, you can log in to check status.
- Need Help? (https://pp-cdn.phenompeople.com/CareerConnectResources/prod/TBJTBFUS/documents/Career\_site\_FAQ-1758133253710.pdf) _If you have a disability and need assistance with the application, you can request a reasonable accommodation. Send an email to_ Accessibility (careers@truist.com?subject=Accommodation%20request) _(accommodation requests only; other inquiries won't receive a response)._
Regular or Temporary:
- Regular
Language Fluency:
- English (Required)
Work Shift:
- 1st shift (United States of America)
- Please review the following job description:
- The Director of Production Support leads teams responsible for ensuring the stability, resilience, and operational excellence of critical technology platforms supporting core lines of business.
ESSENTIAL DUTIES AND RESPONSIBILITIES
- Following is a summary of the essential functions for this job. Other duties may be performed, both major and minor, which are not mentioned below. Specific activities may change from time to time.
- Production Support Leadership & Accountability
- Own end-to-end production support operations
- for multiple mission-critical applications supporting key lines of business, ensuring availability, stability, and performance meet defined SLAs and SLOs. Provide accountable, visible leadership for
- 24x7 operational support
- , including on-call models, escalation paths, and incident response effectiveness. Act as the senior escalation point for
- major incidents
- , ensuring swift recovery, accurate root cause analysis, and durable remediation.
- Incident & Problem Management
- Lead cross-functional incident recovery efforts in partnership with Incident Management, engineering teams, infrastructure, and business stakeholders. Ensure
- timely root cause analysis (RCA)
- , post-incident reviews, and corrective actions that prevent recurrence. Establish and mature a
- production knowledge base
- , documenting known issues, recovery procedures, and architectural insights.
- Engineering-First & SRE Practices
- Drive adoption of
- Site Reliability Engineering (SRE)
- and lean engineering principles, including: Reduction of toil through automation Engineering-based reliability metrics (error budgets, SLIs/SLOs) Proactive resilience and failure prevention practices Champion automation of repetitive and manual operational tasks, including incident detection, response, validation, and recovery where feasible. Promote a culture of
- preventative engineering
- , partnering with development teams to improve system reliability upstream.
- Monitoring, Observability & AI Enablement
- Implement and continuously improve
- real-time monitoring, alerting, and observability
- across applications and infrastructure. Measure and optimize the effectiveness of monitoring and alerting to eliminate noise and accelerate mean-time-to-detect and mean-time-to-recover. Leverage
- AI and advanced analytics
- to correlate telemetry data (logs, metrics, traces) and proactively identify emerging risks and root causes. Champion the safe and responsible use of AI within production operations by adhering to enterprise guardrails and protecting sensitive data and system integrity.
- Operational Readiness & Change Enablement
- 14. Oversee operational readiness across releases, disaster recovery and failover testing and certificate and dependency lifecycle management. Ensure production support is actively embedded in
- change planning
- , minimizing risk from releases and infrastructure changes.
- People, Vendor & Financial Management
- Lead one or more Agile teams (Scrum, Kanban), including onshore and offshore engineers, fostering high performance and accountability. Manage workforce vendors and partners, setting expectations, reviewing performance, and ensuring delivery quality. Own budget and staffing plan aligned to application criticality, operational risk, and business growth objectives.
- Risk Management & Governance
- Act as the first line of defense in production operations by proactively identifying and mitigating technology, operational, and resiliency risks.
- Strategy, Influence & Continuous Improvement
- Serve as a trusted advisor to senior Technology and Business leaders, communicating operational health, risk posture, and improvement roadmaps. Lead or contribute significantly to
- large-scale initiatives
- , platform transformations, or regulatory-driven efforts. Continuously assess organizational maturity and lead initiatives to improve reliability, efficiency, and talent capability.
- Management Responsibilities
- + Full people management accountability, including: + Hiring and succession planning + Coaching and performance management + Compensation input and talent development + Disciplinary action and terminations as necessary
- Agile & Operating Model Expectations
- + Act as an
- Agile and DevOps champion
- , embedding production support within fast-moving delivery models. + Balance "
- keep-the-lights-on
- " operational excellence with continuous engineering improvement. + Drive measurable outcomes such as improved uptime, reduced incident volume, faster recovery, and improved customer experience.
QUALIFICATIONS
Required Qualifications:
- The requirements listed below are representative of the knowledge, skill and/or ability required.
Preferred Qualifications:
- 1.
OTHER JOB REQUIREMENTS / WORKING CONDITIONS
- Visual / Audio / Speaking
- Able to access and interpret client information received from the computer and able to hear and speak with individuals in person and on the phone.
- Manual Dexterity / Keyboarding
- Able to work standard office equipment, including PC keyboard and mouse, copy/fax machines, and printers.
- Availability
- Able to work all hours scheduled, including overtime as directed by manager/supervisor and required by business need.
- Travel
- Up to 50%
- General Description of Available Benefits for
Eligible Employees of Truist Financial Corporation:
- All regular teammates (not temporary or contingent workers) working 20 hours or more per week are eligible for benefits, though eligibility for specific benefits may be determined by the division of Truist offering the position.
- _Truist is an Equal Opportunity Employer that does not discriminate on the basis of race, gender, color, religion, citizenship or national origin, age, sexual orientation, gender identity, disability, veteran status, or other classification protected by law. Truist is a Drug Free Workplace._
- EEO is the Law (https://www.
Similar remote jobs
UnitedHealth Group
Fort Wayne, IN
Posted2 days ago
Updated4 hours ago
Similar jobs in Richmond, VA
V.L.S. Systems, Inc
Richmond, VA
Posted2 days ago
Updated4 hours ago
Similar jobs in Virginia
DNI Delaware Nation Industries
Alexandria, VA
Posted2 days ago
Updated4 hours ago
Virginia Zoological Society
Norfolk, VA
Posted2 days ago
Updated4 hours ago