Site Reliability Engineering Manager
Reddit, New York
Reddit is an American social news aggregation, web content rating, and discussion website.
Reddit is a network of more than 100,000 communities where people can dive into anything through experiences built around their interests, hobbies and passions. Reddit users submit, vote and comment on content, stories and discussions about the topics they care about the most. From pets to parenting, there’s a community for everybody on Reddit and with more than 50 million daily active people, it is home to the most open and authentic conversations on the internet. For more information, visit redditinc.com.
Reddit is building a top tier SRE organization, and is looking for engineering managers to help shape and grow it from its existing core.
This is a high impact role where you will drive technical roadmaps, operations philosophy, architecture review, and execution for one of the largest sites in the world. The ideal candidate understands the value of an engineering and metric centered approach to reliable service support, roots out toil wherever it may live, and knows that “Hope is not a strategy.”
What You’ll Do
- Build, hire and lead a high-calibre team of Site Reliability Engineers to act as a source of focused expertise, and a force multiplier for Reddit’s product engineering.
- Support multiple Reddit product teams with expertise and engineering development to optimize availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.
- Lead by example, care for the team, and establish credibility with the quality of the team's technical execution.
- Drive a cycle of virtuous improvement with blame-free postmortems.
- Coach and mentor engineers to support the distribution of best practices across Reddit as a whole.
What We Look For
- 2+ years experience of managing a team of software engineers and/or SREs.
- 5+ years of experience developing cloud and internet-scale systems.
- Strong track record of managing a team including hiring, onboarding, and professional development.
- Experience problem solving and analyzing and troubleshooting systems
- Software development experience in one or more of: Python, Java, Go, C++, Rust, etc.
- Strong preference is given for deep experience with any of:
- Cloud infrastructure (AWS, GCE)
- Metrics, monitoring, and alerting systems
- CI/CD automation
- BS degree in Computer Science, similar technical field of study or equivalent practical experience
Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please contact us at [email protected].
Founded by Steve Huffman and Alexis Ohanian in 2005, Reddit is an online community where users submit, vote, and comment on content, news, and discussions. Nicknamed "the front page of the internet," Reddit is one of the top ten sites in the United States (source: Alexa), with hundreds of millions of users each month on desktop, mobile web, and our official Android/iOS apps.
Want to learn more about Reddit? Visit Reddit's website.
Slack's cloud-based collaboration tools and services are used worldwide.