Uncubed
   

Senior Site Reliability Engineer

Reddit, London

Reddit is an American social news aggregation, web content rating, and discussion website.


Reddit is a network of more than 100,000 communities where people can dive into anything through experiences built around their interests, hobbies and passions. Reddit users submit, vote and comment on content, stories and discussions about the topics they care about the most. From pets to parenting, there’s a community for everybody on Reddit and with more than 50 million daily active people, it is home to the most open and authentic conversations on the internet. For more information, visit redditinc.com.

*Remote - UK*

Reddit is poised to rapidly innovate and grow like no other time in its history. This is a unique opportunity to leave your mark on one of the most influential and trafficked corners of the internet.

As a Site Reliability Engineer on Reddit’s Infrastructure team, you’ll use your knowledge of operating distributed systems to improve the consistency, reliability, and performance of our growing ecosystem of services. You’ll also use your development experience to contribute to the internal Infrastructure Product that all of Reddit Engineering uses to develop, deploy, and operate their services.

Join us and help build the future of Reddit!

Responsibilities

  • Advise: Work with engineering teams in designing and developing systems that are resilient and highly performant at tremendous scale
  • Amplify: Contribute to the development our internal Infrastructure Product, which is used by Reddit engineering teams to build, deploy, and operate their services
  • Automate: Build tools and systems to support the operation of our infrastructure and services
  • Diagnose: Draw on your knowledge of distributed systems to identify and fix network, system, and service-level issues
  • Optimise: Observe and improve performance, reduce cost, and improve the experience for millions of users

Qualifications

  • 5+ years of experience in Software Engineering, Site Reliability Engineering, or a Development focused DevOps role.
  • Proficiency in one or more of the following: Go, Python, C, C++, Java, Perl, Rust
  • Experience with Kubernetes and Cloud systems
  • Experience with the development and operation of high-traffic backend systems
  • A demonstrated ability to debug, fix, and optimise code
  • Troubleshooting skills that span applications, networking (TCP/IP), and systems
  • Strong working knowledge of Linux
  • Excellent communication and collaborative skills

Nice-to-haves

  • While not required, familiarity with any of these is a big plus!
  • Experience working in an environment that applies Infrastructure-as-code principles
  • Exposure to a Configuration Management System (Puppet, Chef, Salt, etc)
  • Experience with Infrastructure-as-code processes (via Terraform, CloudFormation, etc)
  • Docker or Kubernetes in a production setting
  • Working knowledge of Amazon Web Services or Google Cloud Platform



Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, please contact us at [email protected].

About Reddit

Founded by Steve Huffman and Alexis Ohanian in 2005, Reddit is an online community where users submit, vote, and comment on content, news, and discussions. Nicknamed "the front page of the internet," Reddit is one of the top ten sites in the United States (source: Alexa), with hundreds of millions of users each month on desktop, mobile web, and our official Android/iOS apps. 

Want to learn more about Reddit? Visit Reddit's website.