Site Reliability Engineer

Description

Bugsnag is growing! We're looking for a Site Reliability Engineer to join our small, talented team in Bath, UK. Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. We use our engineering skills and knowledge to build tools and processes to keep Bugsnag online 24/7, making our systems self-heal wherever possible to limit the impact of being on-call.

About us

Bugsnag is the trusted software stability “command center” for over 5,000 engineering teams worldwide, including Airbnb, Slack, Pinterest, Lyft, Yelp, and Pandora. We process over 1 billion crash reports daily from 85,000 applications, and empower our customers to make data-driven decisions on when to focus on building new features, or when to fix bugs. Founded in 2013 by James Smith and Simon Maynard, Bugsnag is backed by leading investors Matrix Partners, Benchmark Capital, and GV. 

  • We value ownership, personal development, and transparency. We believe in empowering our employees to take ownership of their work while being 100% transparent through instantaneous feedback. 
  • The latest tech! We believe in finding the best technology to power our business.
  • We're located in Downtown San Francisco (one block from BART and MUNI at Montgomery Street) and at the edge of the city centre in Bath.
  • Competitive salary/benefits - lunch on us every day, pension, annual conference attendance, and your choice of Apple hardware.
  • We offer all new Bath hires a trip out to San Francisco to meet the team. 

Responsibilities

  • Ensure that all the microservices and databases at Bugsnag are observable, introducing monitoring in the right places to help us have visibility into what is happening in our production systems.
  • Influence the design of new and existing microservices to ensure they will be observable, stable and performant.
  • Maintain our databases and help to keep them performant by making changes to their configuration and deployment, as well as looking into the system's usage of them.
  • Automate tasks to keep the Bugsnag systems resilient with minimal manual intervention.
  • Validate our infrastructure to ensure it is highly available and recoverable.
  • Ensure we have best practice security in place to protect our infrastructure and customer data.

About you

  • You have 5+ years of engineering experience.
  • You have experience in one or more of the following languages: C#, Java, Python, Go, JavaScript or Ruby.
  • You have experience with multiple datastores (we use MongoDB, Redis, and Elasticsearch).
  • You are comfortable working with and maintaining a Linux or other *nix flavour system.
  • You are quick to learn new skills and solve problems.
  • You have the ability to take ownership of all stages of a project from architecture/design through implementation to delivery.
  • You have experience working within a team and communicate well with others.
  • You are willing to go the extra mile to make other developers more efficient.

Bonus points for

  • Experience with Docker/Kubernetes.
  • Experience with Terraform/Packer/Vagrant.
  • Experience with Chef/Puppet/Ansible.
  • Experience with MongoDB/Redis/Elasticsearch.
  • Experience with RabbitMQ/Kafka.
  • Experience with microservices in production.

At Bugsnag we truly believe that employing individuals with various experiences, perspectives, and backgrounds will lead to a finer environment for our employees, our products, and our community. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements.

Want to apply later?

Type your email address below to receive a reminder

Apply to Job

ErrorRequired field
ErrorRequired field
ErrorRequired field
ErrorRequired field
Error
insert_drive_file
insert_drive_file
Error
ErrorRequired field