PostHog

Site Reliability Engineer (Remote US)

Remote US
Kubernetes Terraform SQL AWS Kafka PostgreSQL Redis
Description

Help us to increase the number of successful products in the world!

About PostHog

PostHog helps engineers build better products. We are a single platform to analyze, test, observe, and deploy new features. We give engineers product analytics, session recording, feature flags, A/B testing, event pipelines, SQL access, and a data warehouse… and there’s plenty more to come.

PostHog was created as an open-source project during Y Combinator's W20 cohort and had the most successful B2B software launch on HackerNews since 2012 - with a product that was just 4 weeks old. Since then, more than 50,000 companies have installed the platform. We've had huge success with our paid upgrades, raised $27m from some of the world's top investors, and have shown strong product-led growth - 97% driven by word of mouth. 

Despite the 📉 tech market, we're default alive and doing better than ever! We average 10% monthly revenue growth and are on track for $10m ARR in early 2024. While others are focused on layoffs and struggling to grow into huge valuations, we're focusing on building an awesome product for end users, hiring a handful of exceptional team members, and seeing fantastic growth as a result.

What we value

  • We are open source - building a huge community around a free-for-life product is key to PostHog's strategy.

  • We aim to become the most transparent company, ever. In order to enable teams to make great decisions, we share as much information as we can. In our public handbook everyone can read about our roadmap, how we pay (or even let go of) people, what our strategy is, and who we have raised money from. We also have regular team-wide feedback sessions, where we share honest feedback with each other.

  • Working autonomously and maximizing impact - we don’t tell anyone what to do. Everyone chooses what to work on next based on what is going to have the biggest impact on our customers.

  • Solve big problems -we haven't built our defining feature yet. We are all about acting fast, innovating, and iterating.

Who we’re looking for

We’re looking for a security-focused Site Reliability Engineer to join our Infrastructure team in scaling the foundations of our highly available and flexible cloud platform that PostHog runs on. At the core you will be part of the team responsible for maintaining our AWS/Kubernetes-based infrastructure and making sure it scales to the next 10x milestone.

This isn't someone who walks around telling people to change their passwords regularly. You see security and compliance as a feature of the platform rather than a checkbox to be filled, developing novel solutions that keep engineers moving fast, yet safe.

What you’ll be doing

  • Improving our constantly evolving cloud infrastructure to support new products and ideas at an infrastructure level

  • Solving security and compliance issues with technical solutions that don't hinder the pace of product development

  • Working with tools such as Envoy, ArgoCD, Karpenter or anything else that enables us to reliably and safely deploy changes

  • You will work closely with Product and Pipeline teams to provide guidance and build solutions to allow self-service of essential infrastructure and monitoring tools

Example issues

Almost everything at PostHog is built in public - this isn't as true for infrastructure work as it often involves sensitive content. Nonetheless here are some example headlines of recent work:

  • Secure all internal services with Tailscale

  • Enable Canary deploys for a gradual rollout of services

  • Migrate to Kafka S3 tiered storage

  • Configure PostHog to deploy mono-repo services only when they individually change

Requirements

  • Experience managing large-scale cloud infrastructures (AWS in particular)

  • Experience with a range of database technologies such as Postgres, Kafka, Redis, Clickhouse, S3, etc.

  • Deep knowledge of Kubernetes, and associated tooling such as Helm

  • Motivation to work with other engineering teams to understand their goals and raise the bar of what can be solved by infrastructure

  • Infrastructure as Code with tools like Terraform is your default way of working

Nice to have

  • Experience working with SOC2, HIPAA or other regulatory frameworks

  • Experience scaling and working with Clickhouse

Benefits

What we offer in return:

🛫 Regular team off-sites (we went to Iceland in March) with carbon offsetting for work travel withProject Wren


We believe people from diverse backgrounds, with different identities and experiences, make our product and our company better. That’s why we dedicated a page in our handbook to diversity and inclusion. No matter your background, we'd love to hear from you!


Also, if you have a disability, please let us know if there's any way we can make the interview process better for you - we're happy to accommodate!
#LI-Remote

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

50,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 264 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers