Director, Cloud Platform Operations
Tungsten Automation
This job is no longer accepting applications
See open jobs at Tungsten Automation.See open jobs similar to "Director, Cloud Platform Operations" Omega Venture Partners.Director, Cloud Platform Operations
Tracking Code
Job Location
Job Level
Category
Position Type
This role is accountable for the availability, performance, automation, and operational efficiency of our production SaaS environments hosted in Azure, AWS, and colocation data centers, including ensuring service continuity, compliance, and integration with the broader cloud platform strategy.
Operating at the intersection of engineering excellence and production stability, you will lead a globally distributed team delivering on our commitments for uptime, security, scalability, and service observability. You are expected to be deeply fluent in core SRE principles, including service level indicators (SLIs), service level objectives (SLOs), and error budgets, and capable of embedding these concepts into daily operations to drive reliability and accountability across teams. You will collaborate closely with the Cloud Architecture team, Cloud Production Services, Cloud Application Operations, GRC, Cloud Security, and R&D to strengthen our platform's maturity, reduce toil, scale infrastructure reliability, and ensure secure-by-design practices are integrated into operations across 16+ SaaS services. The role also requires strong working knowledge of compliance and security frameworks such as ISO 27001, SOC 2, NIST, and the Cloud Security Alliance (CSA), and close collaboration with the GRC team to maintain security posture, certifications, and audit readiness.
This role demands a relentless focus on cross-functional alignment, structured ways of working, and operational clarity. The ideal candidate is obsessive about documentation—ensuring that every infrastructure component, monitoring decision, response protocol, and operational dependency is clearly described, reviewed, and maintained. Runbooks, architecture descriptions, and well-defined observability standards must be embedded into the operational culture. What we monitor, why we monitor, and how we distinguish normal from abnormal should be unambiguous and accessible to all stakeholders.
Equally, the role requires an uncompromising commitment to automation. Manual effort should be scrutinized and systematically eliminated wherever possible. Reducing toil and increasing efficiency through automation is not a side objective—it is a core leadership responsibility.
This role is in EU and plays a pivotal role in anchoring our global delivery model through regional leadership, engineering rigor, and strong alignment with U.S. and APAC-based teams.
Key Responsibilities
• Lead global Cloud Operations and SRE practices, ensuring modern operational methods and engineering discipline.
• Ensure 24/7/365 monitoring and service availability of production SaaS platforms, driving root cause elimination and continuous improvement.
• Own and drive adoption of SRE principles (SLOs, SLIs, error budgets, incident retrospectives) to ensure service reliability and resilience.
• Drive adherence and adoption to predefined Infrastructure as Code (IaC) modules, containerization, and observability implementations across cloud services.
• Lead the shift toward automation-first operations, eliminating manual toil and reducing mean time to resolve (MTTR).
• Champion comprehensive documentation practices, including infrastructure design, monitoring rationale, incident response runbooks, and operational standards.
• Oversee cost management and capacity planning activities to support performance, scalability, and budget optimization.
• Provide oversight of colocation operations, ensuring alignment with platform-level standards and integration with cloud operations.
• Partner with Cloud Architecture and Cloud Security teams to align operational implementation with platform blueprints and innovation initiatives.
• Provide leadership during compliance audits (SOC2 Type II, ISO 27001), coordinating with the Cloud Security team for security-specific aspects.
• Lead, mentor, and scale a high-performing team of engineers and managers, fostering a culture of ownership, autonomy, and continuous learning.
• Represent Cloud Platform Operations in cross-functional planning, risk management, and strategic delivery efforts.
While the job description describes what is anticipated as the requirements of the position, the job requirements are subject to change based upon any changing needs and requirements of the business.
Required Experience
• 8+ years of experience in cloud operations or SRE; at least 3 years in a senior leadership or director role.
• Deep hands-on experience with public cloud environments (Azure and AWS), especially operating at scale in production.
• Proven leadership in deploying and operating SaaS services in high-availability, secure, and regulated environments.
• Experience overseeing hybrid infrastructure or colocation environments is a plus.
• Strong understanding of modern operational frameworks including SRE and ITIL v4.
• Expertise in observability tooling (e.g., Prometheus, Grafana, logz.io or similar)
• Knowledgeable in Infrastructure as Code (e.g., Terraform, ARM, CloudFormation).
• Proficiency in compliance and security frameworks including ISO 27001, SOC 2, NIST, and Cloud Security Alliance (CSA).
• Experienced in working in the Atlassian suite, Jira and Confluence.
• Demonstrated experience working closely with GRC teams to support security posture, attestations, and external certifications.
• Exceptional leadership, communication, and stakeholder management skills; experience managing globally distributed teams.
• Strong strategic mindset with the ability to execute tactically under pressure.
Preferred Qualifications:
• Experience leading operational teams through periods of hypergrowth, migration, or re-platforming.
• Familiarity with containerization (Docker, Kubernetes), cloud-native architectures, SRE and platform engineering practices.
• Working experience with compliance automation, policy-as-code, and operational risk frameworks.
• Background in multi-tenant SaaS and service-level management across customer verticals.
Tungsten Automation is an Equal Opportunity Employer
This job is no longer accepting applications
See open jobs at Tungsten Automation.See open jobs similar to "Director, Cloud Platform Operations" Omega Venture Partners.