Intermediate Site Reliability Engineer, Tenant Scale: Tenant Services

GitLab

📍 Remote, Americas; Remote, EMEA 💼 Fully Remote 📅 Posted: 2026-01-10 🌟 3/5 Relevance 📊 Source: Greenhouse Gitlab

Job Description

GitLab is an open-core software company that develops the most comprehensive AI-powered DevSecOps Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating human progress. Our platform unites teams and organizations, breaking down barriers and redefining what's possible in software development. Thanks to products like Duo Enterprise and Duo Agent Platform, customers get AI benefits at every stage of the SDLC. 

The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier, with all team members expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact. GitLab is where careers accelerate, innovation flourishes, and every voice is valued. Our high-performance culture is driven by our values and continuous knowledge exchange, enabling our team members to reach their full potential while collaborating with industry leaders to solve complex problems. Co-create the future with us as we build technology that transforms how the world develops software.

An overview of this role

As a Site Reliability Engineer (SRE) at GitLab, you keep GitLab.com and other production systems running smoothly for millions of users by combining pragmatic operations with strong software engineering practices. You focus on the systems layer (operating systems, storage, networking) and edge services and Kubernetes workloads, designing and operating highly scalable, reliable, and secure infrastructure that supports one of the largest single-tenancy open source SaaS sites on the Internet. You’ll work across the Infrastructure organization to automate away toil, improve availability and performance, and respond to incidents during your local daytime hours as part of a globally distributed on-call rotation. In t...

Skills & Technologies

Design EMEA Engineering Fully Remote Infrastructure Kubernetes Lead Operations Platform Product
Apply for this job

This will take you to the original job posting

← Back to all jobs
Share on Twitter Share on LinkedIn