Principal/Architect- Availability Engineering & SRE

Salesforce

Salesforce

IT
Multiple locations
Posted on Dec 18, 2024

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category

Software Engineering

Job Details

About Salesforce

We’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good – you’ve come to the right place.

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Salesforce services have reliability, capacity, performance and the availability to deliver our customer's needs and a rate of improvement that our customers expect.

Our software development focuses on enabling service owners to operate their services safely at scale, whether through paved path integrations onto observability frameworks, optimizing existing systems, designing infrastructure or eliminating work through AI/ML investments or traditional automation. On the SRE team, you’ll have the opportunity to manage the complex challenges of scale which are unique to Salesforce, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

The SRE practice at Salesforce is evolving, and this role will shape the technical strategy for SRE and influence the strategy for the Availability Cloud as a whole. You will embed with product owning teams, define the availability roadmap and deliver directly against it. Most importantly, you will mature the SRE practice, mentoring and actively developing the engineers around you. Your success is measured by scaling the impact and delivery of your community.

Responsibilities

  • Spearhead and enable the culture of Service Ownership to flourish and thrive. Define healthy service ownership practices and work with embedded teams to develop the knowledge and ownership practice

  • Engage in and improve the whole lifecycle of services—from inception and design, through to deployment, operation and refinement.

  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.

  • Develop full paved path observability platform integrations and necessary automations to maintain service, system and product health

  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for and delivering changes that improve reliability and velocity.

  • Practice sustainable incident response and blameless post mortems. Uphold the quality and high standards of post mortems as part of the Architect community at Salesforce

  • Develop and grow the engineering talent around you

Minimum Requirements

  • 15+ years of software development and engineering experience, 5+ years in a technical leadership role

  • Hands-on experience designing, building and operating large scale distributed systems, identifying shortcomings and optimization opportunities, and making data driven cost performance tradeoffs to influence design decisions

  • Demonstrated experience of leading initiatives spanning multiple teams and leveraging deep domain expertise to influence tech roadmap planning and execution

  • Demonstrated ability to effectively collaborate across multiple teams and stakeholders to drive business outcomes

  • Experience, mentoring, and investing in the development of engineers and peers

  • Ability to reverse engineer solutions via independent code and architecture review, envision, define and then contribute to delivery of availability improvement refactoring projects

  • Mastery of one or more object oriented delivery with languages such as Java, Golang, APEX, Python.

  • Deep experience working with core web technologies: HTTP, JSON, REST, XML

  • Proficiency with databases including Oracle or other relational and/or NoSQL solutions

  • Experience owning and operating multiple instances of a critical service

  • Running critical infrastructure services; monitoring, alerting, logging, tracing and reporting

  • Subject matter expertise on Service ownership best practices, SLO/I/A definition, driving proactive operational awareness and experience with Incident / Problem management

  • Thorough knowledge of Agile development methodology with experience in both Test / Behavioral Driven Development practices

Preferred Qualifications

  • Experience in fault modeling and tolerance, chaos engineering, performance and load testing.

  • Java, Golang, Python, C++, C
    Experience in: Kubernetes, Istio, Public Cloud (AWS or other)

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement

At Salesforce we believe that the business of business is to improve the state of our world. Each of us has a responsibility to drive Equality in our communities and workplaces. We are committed to creating a workforce that reflects society through inclusive programs and initiatives such as equal pay, employee resource groups, inclusive benefits, and more. Learn more about Equality at www.equality.com and explore our company benefits at www.salesforcebenefits.com.

Salesforce is an Equal Employment Opportunity and Affirmative Action Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Salesforce does not accept unsolicited headhunter and agency resumes. Salesforce will not pay any third-party agency or company that does not have a signed agreement with Salesforce.

Salesforce welcomes all.

Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.

For Washington-based roles, the base salary hiring range for this position is $204,400 to $341,900.

For California-based roles, the base salary hiring range for this position is $223,000 to $372,900.

Compensation offered will be determined by factors such as location, level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, benefits. More details about our company benefits can be found at the following link: https://www.salesforcebenefits.com.