More than 100,000 people have found their dream job through Fuzu.

CLOSED FOR APPLICATIONS

Site Reliability Engineer / Senior Site Reliability Engineer, Environment Reliability

Closing: Nov 13, 2022

This position has expired

Published: Nov 11, 2022 (27 days ago)

Job Requirements

Education:

Work experience:

Language skills:

Job Summary

Contract Type:

Sign up to view job details.

You may be a fit to this role if you:

  • Have experience in running and operating production workloads.
  • Have strong programming skills - preferably with Ruby and/or Go.
  • Strong background with Infrastructure as a Code technologies, and libraries powering GitLab, such as: Terraform, Ansible. Experience with data templating tools such as Jsonnet would be considered a bonus.
  • Are able to reason about large systems - how they work and can be operated on a large scale, edge cases, failure modes, behaviors.
  • Share our values, and work in accordance with those values.


Responsibilities

You may be a fit to this role if you:

  • Have experience in running and operating production workloads.
  • Have strong programming skills - preferably with Ruby and/or Go.
  • Strong background with Infrastructure as a Code technologies, and libraries powering GitLab, such as: Terraform, Ansible. Experience with data templating tools such as Jsonnet would be considered a bonus.
  • Are able to reason about large systems - how they work and can be operated on a large scale, edge cases, failure modes, behaviors.
  • Share our values, and work in accordance with those values.


  • Build: Automating every operational task is a core requirement for Environment Automation SRE. E.g. package updates, configuration changes across all customer platforms without interruptions, tools for automatic provisioning of customer facing services, etc.
  • Respond: Respond to user emergencies, platform alerts and support requests.
  • Maintain: Develop a good (early) warning system and system that allows for reliable and quick maintenance tasks, such as library upgrades, version migrations and similar.
  • Plan: Develop monitoring and alerting systems that predict capacity needs based on the customer usage patterns. Plan for new service rollouts, expansion of existing services and preparing advice for customers to optimise their resource consumption.  
  • Collaborate: Work with other engineering stakeholders on resolving larger architectural bottlenecks and participate by offering a large scale operational point of view. Work in close collaboration with software development teams to shape the future roadmap and establish strong operational readiness across teams. 


Applications submitted via Fuzu have 32% higher chance of getting shortlisted.

Don’t miss your chance to work at GitLab. Enter your email to start your application now