Millions of people across the world come to Pinterest to find new ideas every day. It’s where they get inspiration, dream about new possibilities and plan for what matters most. Our mission is to help those people find their inspiration and create a life they love. In your role, you’ll be challenged to take on work that upholds this mission and pushes Pinterest forward. You’ll grow as a person and leader in your field, all the while helping Pinners make their lives better in the positive corner of the internet.
Pinterest is looking for an experienced site reliability engineer to build and run our large-scale distributed systems. As an SRE on the Big Data Query Platform team, you will design and build our applications and infrastructure that handle billions of monthly page views and petabytes of data as Pinterest continues to grow.
What You’ll Do:
Develop, and operate across a large-scale data and storage technology stack
Develop software solutions to allow operability of large-scale distributed systems handling petabytes of data
Manage capacity and performance to help scale our infrastructure both on public and private clouds around the world
What We’re Looking For:
Knowledge of Linux/Unix/BSD internals and experience working with open source software (e.g. MySQL, Hadoop, Envoy, HAProxy, and Nginx)
Experience with technologies such as Kubernetes, Tensorflow, ElasticSearch, ZooKeeper, HBase, Hadoop, Memcache and Kafka with a focus on reliability, automation, operability and performance
2+ years of experience with programming languages (Python, Golang, Ruby)
Infrastructure as code a plus (e.g. Terraform, Puppet, Chef, Ansible, Salt, Fabric, Docker)
Bonus points if experienced with deploying web apps to cloud infrastructure (AWS), working with distributed, service-oriented architecture, and large-scale machine learning infrastructure.
Pinterest is full of possibilities to design your life. Discover recipes, style inspiration, projects for your home and other ideas to try.