Site Reliability Engineer - Observability

London, England, gb
Company: Yelp
Category: Architecture and Engineering Occupations
Published on 2021-06-14 18:03:47

Yelp is looking for an experienced Site Reliability Engineer to guide our engineering teams toward a reliable and efficient future. 

The Production Observability team collaborates with groups all across Yelp Engineering to improve visibility into the health of business-critical services and infrastructure, with the goal of reducing the amount of time it takes to diagnose issues in production.

We do this by combining metrics, traces and logs to build a platform that helps teams understand how their systems behave in production, not only when they’re broken. 

Yelp engineering culture is driven by our values: we’re a cooperative team that values individual authenticity, and encourages “unboring” solutions to problems.

New hires are expected to deploy working code their first week, and your impact will only grow from there with the support of your manager, mentor and team. At the end of the day, we are all about helping our users, growing as engineers, and having fun in a collaborative environment.

Where You Come In:

  • Design, build and deploy software systems that run 24/7 at scale.
  • Drive best practices by coordinating across many teams and teaching other engineers how to investigate problems.
  • Build tooling to identify hot spots and regressions across the infrastructure that put Yelp products at risk.
  • Dive deep into our large service-oriented architecture to make it transparent, measurable and tunable.
  • Optimize our workflows and products systematically through automation.
  • Participate in light on-call rotations - we have geographically distributed SRE teams for follow-the-sun support, which means no 2:00 AM pages!

  • What it Takes to Succeed:

  • An experienced software engineer, with an interest in observability and devops.
  • Familiarity with performance analysis tools. (e.g. tracers, profilers, debuggers, visualization tools) 
  • Fluency in Python, C, C++, Java, or a similar language.
  • Experience building and supporting large-scale distributed systems that back a consumer app or website.
  • Experience exploring datasets and turning performance metrics into easily-understood data visualizations.
  • Familiarity with real-user and/or synthetic performance monitoring.

  • What You'll Get:

  • Full responsibility for projects from day one, an awesome team, and a dynamic work environment
  • Competitive salary with equity in the company, a pension scheme, and an optional employee stock purchase program
  • 25 days paid holiday initially, rising to 29 with service
  • Private health insurance, including dental and vision
  • Flexible working hours and meeting-free Wednesdays
  • Regular 3-day Hackathons and weekly learning groups, always with interesting topics
  • Opportunities to participate in events and conferences throughout Europe and the US
  • Public transportation season ticket loan and £58 per month toward any exercise of your choice
  • Central location, a fully stocked kitchen, adjustable sitting/standing desks, quarterly offsites, locally roasted coffee, happy hours, and more! 

  • Yelp values diversity. We’re proud to be an equal opportunity employer and consider qualified applicants without regard to Age, Disability, Gender Reassignment, Marriage or Civil Partnership, Pregnancy and Maternity, Race, Religion or Belief, Sex.

    Jobs you might also be interested in