Senior Systems Reliability Engineer - Studio & Corporate Infrastructure

Netflix, Los Angeles, California

Leading subscription service for watching TV episodes and movies

Los Angeles, CA; Los Gatos, CA; New York, NY

The Studio & Corporate Infrastructure team builds, delivers and operates compute, storage, and network infrastructure globally for the evolving needs of our studio and corporate technical teams. As Netflix grows globally, we continue to look for the best and brightest talent to scale with our growth. Our team is looking for a Senior Systems Reliability Engineer to contribute to the buildout of our edge compute and storage infrastructure and work with internal partners, creatives, production teams and external vendors around the world to deliver amazing experiences for our technical teams. We are looking for an experienced Senior Systems Reliability Engineer that brings a broad set of technical skills and achievements, a development and automation focused mindset to solving problems, and an impressive history of unique career and life experiences to bring diverse views to our team as we expand our infrastructure in support of our offices, studios, and locations around the globe.

Be sure to review our culture page and long-term view to learn more about the unique Netflix culture and the opportunity to be part of our team.

Job Description

  • Work with team members to design, deploy and operate the compute, storage and network platforms globally
  • Collaborate with business and product development teams to implement the infrastructure required to support their vision and strategy
  • Provide support for our planning and deployment teams to enable stability, predictability and scale in our continued growth
  • Work with internal and external partners, facility operators and our hardware vendors to design and develop a roadmap for the evolution of our infrastructure platform strategy
  • Collaborate with members of the Platform Engineering team to implement and support far-reaching strategic efforts, provide constructive feedback, and foster a collaborative environment
  • Work cross-functionally with internal teams and vendors to manage our growth around the globe, with a strong focus on maintaining the high level of availability for our users
  • Actively engage with internal teams to develop frameworks and tools to drive extensive automation of the infrastructure environment

Professional Qualifications

  • At least 7 years experience in systems engineering experience; demonstrable technical experience in new platform development, orchestration, product ownership, and iterative design and deployment
  • Experience designing and deploying large scale systems, multi-vendor platforms and globally distributed infrastructure
  • Strong knowledge of system design; high performance computing; file, block, and storage technologies; integration of compute, storage, and network technologies to deliver cohesive infrastructure solutions
  • Experience in developing partner relationships, including a deep understanding of how compute, storage, and networks work together to be a high performance platform
  • Able to write functional and maintainable code, most notably hands-on experience in Python and Java; able to articulate and demonstrate how to use software development techniques and methods to deliver new platforms and solutions 
  • High level of understanding and examples of executing projects with full stack automation; our scale is going to require a lot of it, we grow to use less manual intervention and work with both internal and open-source tools to automate day-to-day activities 

Organizational Qualifications

  • Self-organize, collaborate and manage efforts with peers and teams across responsibility areas, languages, geography and time zones; acts as an informed captain and make data-backed decisions on behalf of Netflix 
  • Be a self-starter, curious and not afraid to ask questions and challenge the way things are done today; farm for dissent and seek alternative views to both problems and your solutions
  • See a problem or opportunity, take ownership and act on it independently; develop methods and habits to be an informed and motivated product owner
  • Inform, educate and develop team members, peers and customers to grow expertise, develop alignment and encourage collaboration; being able to present your ideas concisely and in an informative manner to both large and small audiences is an important skill for your role as a technical leader
  • Know when to take ownership and when to escalate issues; our teams work to cover a global footprint and having self, situational and organizational awareness to know when you should take the lead on an issue and when you should bring in reinforcements to help tackle the problem at hand is critical
  • Develop technical documentation, diagrams and templates and be rigorous in maintaining accuracy and relevancy of developed materials 

Additional Requirements

  • Escalation on-call rotation
  • Occasional travel (quarterly offsites, conferences)
  • Experience working in media and studio environments is very much desired

About Netflix

Netflix is the world’s leading Internet television network with over 100 million members in over 190 countries enjoying more than 125 million hours of TV shows and movies per day, including original series, documentaries and feature films. Members can watch as much as they want, anytime, anywhere, on nearly any Internet-connected screen. Members can play, pause and resume watching, all without commercials or commitments.

Want to learn more about Netflix? Visit Netflix's website.