Technical Program Management Director, DET Site Reliability

Salesforce

Salesforce

IT
Dublin, Ireland
Posted on Oct 16, 2025

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category

Program & Project Management

Job Details

About Salesforce

Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn’t a buzzword — it’s a way of life. The world of work as we know it is changing and we're looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce's core values at the heart of it all.

Ready to level-up your career at the company leading workforce transformation in the agentic era? You’re in the right place! Agentforce is the future of AI, and you are the future of Salesforce.

The Digital Enterprise Technology (DET) Site Reliability team is seeking an experienced Technical Program Manager to serve as deputy to the Sr. Director of Site Reliability. This role drives reliability programs for our internal enterprise systems covering compute, storage, networking, identity, and access management.

You'll shape and execute the reliability strategy while keeping day-to-day operations running smoothly. This means partnering with leadership to define frameworks and then making them work across engineering, product, and infrastructure teams. You'll drive SLO programs, lead production readiness initiatives, and serve as an escalation point when major incidents happen. This requires both strategic thinking and hands-on execution.

Responsibilities

Program Leadership

  • Partner with leadership to define and implement Service Level Objectives (SLOs) and reliability frameworks for enterprise systems.

  • Drive adoption of production readiness practices throughout engineering and product organizations.

  • Build relationships with stakeholders across multiple business units to identify gaps and opportunities.

  • Lead recurring planning for reliability initiatives aligned with organizational goals.

  • Foster a culture of continuous learning within reliability engineering.

Incident Response

  • Serve as an escalation point for major incidents affecting DET systems.

  • Act as incident commander for critical production issues when severity reaches the highest levels.

  • Enforce incident response procedures across all teams.

  • Coordinate with incident managers worldwide on high priority situations.

  • Keep executives informed with timely updates during active incidents.

Delivery Management

  • Lead the planning and execution of programs using Agile, Scrum, and SAFe principles across multiple teams.

  • Manage risks, issues, and cross-team dependencies before they become problems.

  • Track and report on key performance indicators (KPIs).

  • Communicate program status, risks, and progress to stakeholders and leadership.

  • Run governance structures that enable effective executive decision making.

Operational Excellence

  • Implement automation, tooling, and best practices for reliability engineering.

  • Facilitate technical discussions to surface critical tradeoffs and dependencies.

  • Support incident postmortem and root cause analysis processes.

  • Champion continuous improvement in production readiness and SRE practices.

  • Oversee problem management and continuous improvement processes.

Required Experience

Education & Experience

  • Bachelor's degree in Computer Science, Engineering, or related field (or equivalent practical experience).

  • 8+ years of experience in software engineering organizations.

  • 5+ years of experience as a Technical Program Manager or comparable role.

  • Demonstrated track record implementing and executing SRE or production readiness programs.

  • Experience managing geographically distributed teams.

Technical Expertise

  • Strong understanding of cloud infrastructure, distributed systems, and large-scale production systems.

  • Experience with operational health monitoring, incident management, and incident response processes.

  • Familiarity with cloud deployments, migrations, and data center technologies.

  • Knowledge of infrastructure lifecycle management and ITIL practices.

  • Ability to articulate system-level tradeoffs, identify risks, and probe critical paths.

  • Proven experience leading incident response for complex distributed systems.

  • Strong background managing high pressure situations requiring quick decisions and clear communication.

Professional Skills

  • Experience leading ITSM/Infrastructure projects and programs across multiple teams.

  • Solid command of project lifecycle and project management methodologies (PMI Framework, SAFe).

  • Experience using project planning and PLM tools like Jira, Smartsheet, Asana, or Linear.

  • Strong communication abilities to convey complex technical concepts to diverse audiences.

  • Experience with documentation and work tracking tools such as Jira, Confluence, and project management platforms.

  • Demonstrated success influencing without formal authority across engineering teams.

  • Ability to thrive in ambiguous environments while maintaining operational excellence standards.

Unleash Your Potential

When you join Salesforce, you’ll be limitless in all areas of your life. Our benefits and resources support you to find balance and be your best, and our AI agents accelerate your impact so you can do your best. Together, we’ll bring the power of Agentforce to organizations of all sizes and deliver amazing experiences that customers love. Apply today to not only shape the future — but to redefine what’s possible — for yourself, for AI, and the world.

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement

Salesforce is an equal opportunity employer and maintains a policy of non-discrimination with all employees and applicants for employment. What does that mean exactly? It means that at Salesforce, we believe in equality for all. And we believe we can lead the path to equality in part by creating a workplace that’s inclusive, and free from discrimination. Know your rights: workplace discrimination is illegal. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications – without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.