Uncubed

Technical Program Manager, Reliability

Lyft, San Francisco, CA

Lyft is your friend with a car, whenever you need one


Lyft connects people to transportation to change the way we live and get around our communities. Lyft’s engineering team is growing rapidly, and we are looking for Technical Program Managers to help us scale. Come be part of a new team at Lyft focused at enabling and empowering engineering teams to deliver at scale.

Technical Program Managers at Lyft drive cross-functional initiatives, leveraging strong leadership, planning, communication, and collaboration skills. Our TPMs are technically strong with software engineering or systems engineering experience. They are problem solvers who make things happen around them by setting clear goals and inspiring teams to deliver. TPMs at Lyft are both strategic and tactical and do what it takes to successfully deliver key programs that have a material impact on the business.

We are seeking a technically strong Technical Program Manager to own our end-to-end reliability program. As a transportation company, reliability is critical in ensuring we deliver a trusted experience, every time. We’re looking for someone with a passion for infrastructure, building reliability and quality in, and developing processes and a learning culture around incident management and post mortems.

Responsibilities

  • Identify and resolve systemic issues impacting Lyft engineering and aggressively take action to resolve
  • Establish on-call best practices and education with regards to incident response and internal communication
  • Establish and lead weekly operational review(s)
  • Own incident management, post mortem process, and follow through
  • Own production readiness review process and communication
  • Partner with Perf Frameworks and RSWE (Reliability SWE) on readiness for major events Lead regular incident dry runs and disaster recovery testing
  • Define and train a rotating set of incident commanders
  • Establish and manage the incident review team
  • Lead iterative delivery of strategic cross-functional initiatives from concept to ship, through focus, transparency, communication, visibility, and accountability  
  • Leverage deep technical expertise with large-scale, distributed 24x7 production systems to build comprehensive plans, to identify risks, and to ensure smooth project launches with a goal of delighting our passengers and drivers
  • Partner with engineering, product, and business leadership to build highly collaborative teams and to enhance communication across teams and stakeholders

Experience and Skills

  • 5+ years as a TPM, engineering, technical operations, or product leader
  • Experience leading cross-functional teams to deliver complex projects iteratively with multiple dependencies and constraints, in a highly dynamic and agile environment
  • Proven ability to operate effectively and autonomously across multiple teams in situations of extreme ambiguity, with only high level direction
  • Experience building roadmaps, release plans, project plans with a thorough understanding of dependency management
  • Able to communicate highly technical problems and solutions at all levels from engineer to partner to C-level executives
  • Able to influence, negotiate and inspire others in a matrixed environment
  • Excellent organization, planning skills, and attention to detail
  • Experience delivering projects in large-scale, distributed production systems and 24x7 production operations
  • At least two years of software engineering or systems engineering experience
  • Bonus points if you have experience in running site reliability programs that have focused on simulated incidents or disaster recovery testing!

About Lyft

Wherever you’re headed, count on Lyft for rides in minutes. The Lyft app matches you with local drivers at the tap of a button. Just request and go.

Ride by ride, we’re changing the way our world works.

Want to learn more about Lyft? Visit https://www.lyft.com/