Canada or United States Remote (Global) USD 124,300 - 266,400 Yearly

GitLab is hiring a Site Reliability Engineer

About the Role

Beacon Biosignals is looking for a Site Reliability Engineer to ensure our user-facing services and production systems are reliable, scalable, and efficient. In the Environment Automation specialization, you will operate and automate hundreds of GitLab environments from provisioning to day-to-day maintenance.

What You'll Do

  • Design and implement automation to provision and manage hundreds of isolated GitLab environments using Terraform, Ansible, and Kubernetes.
  • Troubleshoot issues across Kubernetes clusters, cloud services, and GitLab applications to identify root causes of failed deployments, crash loops, and scheduling conflicts.
  • Replace manual workflows with infrastructure-as-code solutions for automated version upgrades, configuration rollouts, and provisioning pipelines.
  • Build observability systems using tools like Prometheus, ELK, and Grafana to detect bottlenecks, predict usage trends, and optimize resource consumption.
  • Lead incident response and postmortem efforts, applying technical depth to resolve issues and establish operational standards.
  • Influence architectural decisions around automation, scalability, and operational excellence. Partner with engineering teams to improve automation, platform resilience, and production-readiness.

What We're Looking For

  • Proven ability to operate and troubleshoot production workloads across multiple tenants or environments.
  • Strong hands-on experience with Terraform, including workspace strategies, state management, and scalable automation patterns.
  • Skilled at diagnosing deployment failures, interpreting pod logs, and debugging scheduling issues and rollback scenarios in Kubernetes production environments.
  • Ability to read and debug code in Go and/or Ruby.
  • Experience supporting infrastructure for many customers or environments simultaneously.
  • Able to reason through complex systems and operational challenges. Brings on-call experience.
  • Proven ability to work across teams and with internal or external customers to solve technical problems.
  • Comfortable using GitLab as a daily tool for infrastructure automation, collaboration, and operational workflows.

Nice to Have

  • Experience with Ansible and templating tools like Jsonnet.

Technical Stack

  • Terraform, Ansible, Kubernetes, Helm Charts, omnibus-gitlab
  • GCP, AWS
  • Prometheus, ELK, Grafana
  • Go, Ruby, Jsonnet

Team & Environment

Part of the Dedicated team, focused on delivering a fully managed, single-tenant GitLab experience through the GitLab Dedicated platform.

Benefits & Compensation

  • Compensation: $124,300—$266,400 USD
  • Benefits to support your health, finances, and well-being
  • Flexible Paid Time Off
  • Team Member Resource Groups
  • Equity Compensation & Employee Stock Purchase Plan
  • Growth and Development Fund
  • Parental leave
  • Home office support

Work Mode

This role is open to candidates in the United States and operates in a global work mode.

GitLab is proud to be an equal opportunity workplace and is an affirmative action employer.

Required Skills
TerraformKubernetesGoRubyAnsibleHelmGCPAWSPrometheusELKGrafanaGitLabInfrastructure as Code
Your first international client?

Don't lose them over invoicing

Clients ghost freelancers with unprofessional invoicing. Glopay gives you a real EU company partnership so they take you seriously from invoice #1.

Instant EU company partnership
Invoice builder with your branding
Automated payment reminders
Real-time payment tracking
Get EU company now
Ready in 24 hours
About company
GitLab
GitLab is the intelligent orchestration platform for DevSecOps, enabling organizations to increase developer productivity, improve operational efficiency, reduce security and compliance risk, and accelerate digital transformation. The platform is used by over 50 million registered users and more than 50% of the Fortune 100.
All jobs at GitLab Visit website
Job Details
Department Information Technology
Category infrastructure
Posted 2 months ago