Systems Engineer - Kubernetes Platform

Cloudflare, Remote US

Help us build a better Internet

About Us

At Cloudflare, we have our eyes set on an ambitious goal: to help build a better Internet. Today the company runs one of the world’s largest networks that powers approximately 25 million Internet properties, for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Cloudflare protects and accelerates any Internet application online without adding hardware, installing software, or changing a line of code. Internet properties powered by Cloudflare all have web traffic routed through its intelligent global network, which gets smarter with every request. As a result, they see significant improvement in performance and a decrease in spam and other attacks. Cloudflare was named to Entrepreneur Magazine’s Top Company Cultures list and ranked among the World’s Most Innovative Companies by Fast Company. 

We realize people do not fit into neat boxes. We are looking for curious and empathetic individuals who are committed to developing themselves and learning new skills, and we are ready to help you do that. We cannot complete our mission without building a diverse and inclusive team. We hire the best people based on an evaluation of their potential and support them throughout their time at Cloudflare. Come join us! 

Many of Cloudflare’s critical internal services run on Kubernetes. These services include those responsible for Cloudflare’s control plane and APIs, data analytics and other internal tools used to manage our global network. These Kubernetes platforms are purpose-built from the ground up and run on bare metal Linux in different regions around the world. The scale we work at involves tens of thousands of containers and terabits per second of network traffic. The team takes pride in knowing this platform helps run parts of the global Internet.

As an engineer on the Kubernetes platform team, you'll be building the tools to help engineers deploy and operate the services that make Cloudflare work. Our mission is to provide a reliable, yet flexible, platform to help product teams release new software efficiently and safely. The role includes both software engineering and DevOps operational responsibilities. 

What You'll Do

  • Improve Kubernetes, Ceph and Prometheus automation, configuration management and other tooling 
  • Design scalable and resilient systems that can keep up with company growth
  • Improve the efficiency of managing resources such as CPU, bandwidth and storage
  • Harden the platform against security threats and resource contention issues
  • Improve our GitOps systems and practices
  • Work with app teams to understand their potential challenges and help them choose the best way to architect their systems on Kubernetes
  • Contribute back to the open source community
    • Some of our favorite open source projects include: Prometheus, Rook.io, Kubevirt, Contour, Envoy, Consul, cdk8s, Vault, Ceph, Cloudprober, Etcd, Calico, Terraform
  • Help respond and prevent incidents impacting core platforms

What You'll Need

  • Experience managing production Kubernetes or similar orchestration platforms
  • Recent experience with configuration management frameworks such as SaltStack or Ansible
  • Knowledge of how container runtimes work inside of Linux (isolation, storage, and networking)
  • Ability to work with codebases in Bash, TypeScript and Go
  • A firm grasp of IP networking including routing and iptables
  • Excellent debugging skills in a Linux environment
  • Source control experience including branching, merging and rebasing
  • The ability to break down complex problems into smaller pieces, provide options, talk through trade-offs and drive the effort to solve the problem

Bonus Points

  • Experience operating Kubernetes on-premise at scale in capacities including SRE, systems design or architecture
  • Providing guidance and building platforms across multiple zones and regions as foundation for other teams to build distributed highly-available applications 
  • Operational experience with Etcd, Prometheus, Ceph, Rook, SaltStack, Vault, Calico, other common CNIs like Cilium

What Makes Cloudflare Special?

We’re not just a highly ambitious, large-scale technology company. We’re a highly ambitious, large-scale technology company with a soul. Fundamental to our mission to help build a better Internet is protecting the free and open Internet.

Project Galileo: We equip politically and artistically important organizations and journalists with powerful tools to defend themselves against attacks that would otherwise censor their work, technology already used by Cloudflare’s enterprise customers--at no cost.

Athenian Project: We created Athenian Project to ensure that state and local governments have the highest level of protection and reliability for free, so that their constituents have access to election information and voter registration.

Path Forward Partnership: Since 2016, we have partnered with Path Forward, a nonprofit organization, to create 16-week positions for mid-career professionals who want to get back to the workplace after taking time off to care for a child, parent, or loved one. We released to help fix the foundation of the Internet by building a faster, more secure and privacy-centric public DNS resolver. This is available publicly for everyone to use - it is the first consumer-focused service Cloudflare has ever released. Here’s the deal - we don’t store client IP addresses never, ever. We will continue to abide by our privacy commitment and ensure that no user data is sold to advertisers or used to target consumers.

Sound like something you’d like to be a part of? We’d love to hear from you!

This position may require access to information protected under U.S. export control laws, including the U.S. Export Administration Regulations. Please note that any offer of employment may be conditioned on your authorization to receive software or technology controlled under these U.S. export laws without sponsorship for an export license.

Cloudflare is proud to be an equal opportunity employer.  We are committed to providing equal employment opportunity for all people and place great value in both diversity and inclusiveness.  All qualified applicants will be considered for employment without regard to their, or any other person's, perceived or actual race, color, religion, sex, gender, gender identity, gender expression, sexual orientation, national origin, ancestry, citizenship, age, physical or mental disability, medical condition, family care status, or any other basis protected by law. We are an AA/Veterans/Disabled Employer.

Cloudflare provides reasonable accommodations to qualified individuals with disabilities.  Please tell us if you require a reasonable accommodation to apply for a job. Examples of reasonable accommodations include, but are not limited to, changing the application process, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.  If you require a reasonable accommodation to apply for a job, please contact us via e-mail at [email protected] or via mail at 101 Townsend St. San Francisco, CA 94107.

About Cloudflare

About us

Cloudflare is the simplest way to make websites faster, safer and smarter. Millions of websites have signed up for our service, including large enterprises, major consumer destinations, and government agencies. With offices in San Francisco and London, Cloudflare operates a highly-available global network that has security measures built into every layer and regularly clocks in lightning-fast speeds.

We're on a mission to build a better web - and we need smart, talented people to join our team. Our team works on the forefront of leading technologies including nginx, Go and Lua programming languages. We're a strong supporter of the open source community and regularly share our technology learnings at https://blog.cloudflare.com.

Want to learn more about Cloudflare? Visit Cloudflare's website.