Uncubed
   

Site Reliability Engineering Manager

Reddit, San Francisco, CA

Reddit is an American social news aggregation, web content rating, and discussion website.


Our mission is to bring community and belonging to everyone in the world. Reddit is a community of communities where people can dive into anything through experiences built around their interests, hobbies, and passions. With more than 50 million people visiting 100,000+ communities daily, it is home to the most open and authentic conversations on the internet. From pets to parenting, skincare to stocks, there’s a community for everybody on Reddit. For more information, visit redditinc.com

Reddit is building a top tier SRE organization, and is looking for engineering managers to help shape and grow it from its existing core.  

This is a high impact role where you will drive technical roadmaps, operations philosophy, architecture review, and execution for one of the largest sites in the world.   The ideal candidate understands the value of an engineering and metric centered approach to reliable service support, roots out toil wherever it may live, and knows that “Hope is not a strategy.”

What You’ll Do

  • Build, hire and lead a high-calibre team of Site Reliability Engineers to act as a source of focused expertise, and a force multiplier for Reddit’s product engineering.
  • Support multiple Reddit product teams with expertise and engineering development to optimize availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning.
  • Lead by example, care for the team, and establish credibility with the quality of the team's technical execution. 
  • Drive a cycle of virtuous improvement with blame-free postmortems.  
  • Coach and mentor engineers to support the distribution of best practices across Reddit as a whole.

What We Look For

  • 2+ years experience of managing a team of software engineers and/or SREs.
  • 5+ years of experience developing cloud and internet-scale systems.
  • Strong track record of managing a team including hiring, onboarding, and professional development.
  • Experience problem solving and analyzing and troubleshooting systems
  • Software development experience in one or more of: Python, Java, Go, C++, Rust, etc.
  • Strong preference is given for deep experience with any of:
    • Cloud infrastructure (AWS, GCE)
    • Kubernetes 
    • Metrics, monitoring, and alerting systems
    • CI/CD automation
  • BS degree in Computer Science, similar technical field of study or equivalent practical experience

#LI-remote, #LI-JS3

Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please contact us at [email protected].

About Reddit

Founded by Steve Huffman and Alexis Ohanian in 2005, Reddit is an online community where users submit, vote, and comment on content, news, and discussions. Nicknamed "the front page of the internet," Reddit is one of the top ten sites in the United States (source: Alexa), with hundreds of millions of users each month on desktop, mobile web, and our official Android/iOS apps. 

Want to learn more about Reddit? Visit Reddit's website.