Skip to main content


Sr. Site Reliability Engineer - Distributed Caching

at Roblox

San Mateo, CA

Data Infrastructure Jobs All Jobs

Every day, tens of millions of people from around the world come to Roblox to play, learn, work, and socialize in immersive digital experiences created by the community.

Our vision is to build a platform that enables shared experiences among billions of users. This is what’s known as the metaverse: a persistent space where anyone can do just about anything they can imagine, from anywhere in the world and on any device. The breadth of opportunities, and the evolving demands of this first-of-its-kind platform, ensure that your avenues for growth are always expanding and flexible.

Join us and you’ll usher in a new category of human interaction while solving exceptional challenges that you won’t find anywhere else.

As a Sr. Site Reliability Engineer - Distributed Caching, you’ll be supporting Roblox’s global platform by designing, maintaining and operating our large scale caching infrastructure while contributing to our internal Infrastructure-as-a-Service offerings. You will work with a cross-functional team of engineers while having real ownership and impact.

You Are:

  • Experience designing & operating large-scale distributed systems handling millions of real-time requests per second 
  • Experience with Systems configuration management with familiarity in Automation tools, such as Chef, Ansible, and Terraform
  • Experience deploying on top of container orchestrators like Kubernetes or Nomad and service discovery systems like Consul
  • Experience and understanding of L2-L4 of networking, TCP/UDP protocols
  • Experience with Linux systems and shells, daemons, and processes
  • Experience with programming languages, like Python or Go
  • Experience with telemetry stacks, like TICK and databases such as Memcached, Redis or similar a bonus
  • BS degree (or equivalent professional experience) in Computer Science, with at least 5 years of hands on experience 

You Will:

  • Build tools to operate, monitor, maintain and scale our Redis & Memcached footprints
  • Have a leading role in designing & implementing our internal IaaS offerings on top of a container orchestrator platform

You'll Love:

  • Excellent medical, dental, and vision coverage
  • A rewarding 401k program
  • Flexible vacation policy
  • Free catered lunches five times a week and several fully stocked kitchens with unlimited snacks
  • Onsite fitness center and fitness program credit
  • Annual CalTrain Go Pass
  • A Roblox Admin badge for your avatar