Senior Distributed Systems Engineer - Real-Time Insights

Netflix, Los Gatos, California

Leading subscription service for watching TV episodes and movies

About the Opportunity
Netflix is enjoyed by over 100 million members around the world who, in aggregate, watch more than 125 million hours of TV shows and movies every day.  The product experience is delivered on 1000+ different device types calling into a complex distributed system of services running in the cloud.  One result of all of this usage is data.  Lots of data.
We are looking for a seasoned distributed systems engineer to help us develop, extend, and scale Mantis, our mission-critical stream processing engine that provides rich, real-time operational insights and other key metrics.  Mantis is used extensively by a breadth of Netflix engineering teams to help us detect operational and functional failures throughout the Netflix architecture that could be affecting the customer experience.  We also use Mantis to trigger alerts and remediations for such failures.  With respect to “lots of data”, Mantis currently processes over one trillion events per day, every day.
About the Role
As a Distributed Systems Engineer on the Mantis team you will have the freedom to design, build and extend the Mantis platform as well as building out analytics capabilities through streaming machine learning components. You'll also be responsible for operating Mantis as a critical service for other Edge teams and Netflix as a whole.
About the Team
The team is part of the Edge Engineering organization where most of the tier-1 critical services reside within Netflix. We're a small, high impact team operating infrastructure which runs thousands of streaming jobs for a breadth of teams across Netflix while also providing real-time insights for the entire engineering organization and real-time alerting on key company-wide metrics.
About You
You should have experience in designing, implementing, and operating large scale, high throughput, low latency distributed systems.  Ideally, you have a background in platform and infrastructure development.  Exposure to and/or experience in stream processing engines, such as Spark Streaming, Flink, Storm, would be great too.  Mantis is built on top of non-blocking technologies such as Netty and RxJava, so exposure to these or comparable technologies would be nice to have as well.  We would also expect either experience and/or eagerness to get more involved in machine learning and analytics.
At Netflix, we like to move fast, bias towards action, and rely on great people to make great judgment calls.  You should have passion for, and curiosity in, distributed systems.  You should also love solving real problems and making an impact for the company, your partners, and your teammates.  Things also change fast and often, so you need to be able to function well with ambiguity and be comfortable making decisions under uncertainty.  For more on the Netflix culture, read our culture document.

More About Mantis and Related Technologies

Hiring Manager information:

About Netflix

Netflix is the world’s leading Internet television network with over 100 million members in over 190 countries enjoying more than 125 million hours of TV shows and movies per day, including original series, documentaries and feature films. Members can watch as much as they want, anytime, anywhere, on nearly any Internet-connected screen. Members can play, pause and resume watching, all without commercials or commitments.

Want to learn more about Netflix? Visit Netflix's website.