Uncubed

Site Reliability Engineer (SRE)

Yelp, San Francisco, CA

Connecting people with great local businesses


Our Site Reliability Engineers are the primary interface between our developers and our production operations. No matter how many times we get searched, scraped, scanned, spammed, pinged, paged or queried, keep calm - and keep the site running smoothly.  We work in both the dev and systems worlds, instrumenting key parts of core architecture while supporting developers as they try to do the same.  

We're looking for a renaissance coder - someone who lives, breathes and dreams automation and troubleshooting. Of course, you'll work with a variety of programming languages, but get ready to dive into distributed systems, strapped with tools such as strace and tcpdump, to go solve problems!  
You'll implement monitoring and alerting systems to support site stability and performance.

You'll proactively scale our infrastructure to meet ever-increasing demand.  You'll make sure that when something goes bump in the night, someone hears it. And you'll play a key role in keeping Yelp fast, stable and growing.

What You Will Do:

  • Work closely with developers in supporting new features and services.
  • Monitor site stability and performance.
  • Scale infrastructure to meet demand.
  • Troubleshoot site issues.
  • Develop custom tools as necessary.
  • Document system design and procedures.
  • Participate in light on-call rotation.

We Are Looking For:

  • Mastery of Linux or Unix.
  • Command of your favorite modern programming language: Python, Ruby, Java, C++, etc.
  • Proficiency with configuration management tools like puppet, chef, ansible, etc.
  • Solid understanding of fundamental networking technologies.
  • Knowledge of best practices related to security, performance, and disaster recovery.
  • Experience with web server configuration, monitoring, trending, network design, high availability.
  • Excellent communication skills.
  • A sense of humor!

Pluses:

  • Past experience with MySQL, PostgreSQL, or replicated other databases (high availability, scale-out replication).
  • Advanced knowledge of network design, management of Juniper network equipment, or BGP.
  • Experience at a large-scale consumer internet site.
  • Ubuntu distribution familiarity.
  • Deep understanding of the Python runtime and ecosystem.
Interested in applying? Sweet! Share with us why you want to work at Yelp, and don't forget to mention any side projects, open source contributions, and personal websites/blogs.

Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

*LI-MS1

About Yelp

Yelp connects people with great local businesses. Our users have contributed approximately 127 million cumulative reviews of almost every type of local business, from restaurants, boutiques and salons to dentists, mechanics, plumbers and more. These reviews are written by people using Yelp to share their everyday local business experiences, giving voice to consumers and bringing “word of mouth” online.

Want to learn more about Yelp? Visit http://www.yelp.com