Staff Software Engineer, Capacity Engineering

pinterest

Staff Software Engineer, Capacity Engineering
Location: San Francisco / Remote
Department: Engineering
Job Type: Regular


Company Overview:

Pinterest is a platform where millions of users come every day to find inspiration, explore new possibilities, and plan for the future. Our mission is to help users discover ideas that inspire them and create a life they love. As a Staff Software Engineer, you will be pivotal in optimizing and managing the largest-scale cloud-native infrastructures, directly impacting Pinterest’s operations and efficiency.


Position Overview:

Pinterest is seeking a Staff Software Engineer to join the Capacity Engineering team, focused on optimizing and managing the Machine Learning (ML) infrastructure. You will play a key role in scaling and improving the efficiency of Pinterest’s ML infrastructure, contributing to strategic priorities and collaborating across teams to deliver robust, secure, and efficient ML foundations.


Key Responsibilities:

  • Manage and optimize ML hardware capacity, ensuring it powers the models running at Pinterest.
  • Enhance the efficiency of Pinterest’s ML Infrastructure at scale.
  • Develop profiling and optimization capabilities for ML infrastructure across Pinterest.
  • Collaborate with ML Platform, Infrastructure Engineering, and SRE teams to ensure high availability, resiliency, and security of ML foundations.

Qualifications:

  • Experience: Strong understanding of GPU architectures, Pytorch, and other parts of the ML software stack (Scheduling, Data, Storage).
  • Technical Expertise: Hands-on experience with Kubernetes and large, cloud-native multi-tenant platforms.
  • Software Proficiency: Proficiency in languages such as Java, Python, and C++.
  • ML Knowledge: Deep understanding of ML models, kernels, and optimization techniques.
  • Scalability: Experience in building and managing highly available distributed applications at scale.
  • Cloud Expertise: Experience with AWS or similar cloud environments.
  • Capacity & Performance: In-depth knowledge of infrastructure capacity and performance optimization.
  • Education: Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent experience.

Additional Information:

  • Work Model: Hybrid (in-office collaboration 1-2 times per quarter).
  • Relocation: This position is not eligible for relocation assistance.
  • Salary Range: $170,371—$350,763 USD (based on location, prior experience, and skills).

Pinterest’s Commitment to Inclusion:

Pinterest is an equal opportunity employer committed to diversity and inclusion. All qualified applicants will be considered without regard to race, color, gender, sexual orientation, disability, or other protected status.


To apply for this job please visit www.pinterestcareers.com.

Job Overview
Job Location