Company Description
Etsy is the global marketplace for unique and creative goods. We build, power, and evolve the tools and technologies that connect millions of entrepreneurs with millions of buyers around the world. As an Etsy Inc. employee, whether a team member of Etsy, Reverb, or Depop, you will tackle unique, meaningful, and large-scale problems alongside passionate coworkers, all the while making a rewarding impact and Keeping Commerce Human.
What’s the role?
We are looking for an engineer with a strong background (or interest) in building observability platforms for humans.
As the Software Engineer II, SRE (for Observability) you will be building and supporting our telemetry stack which consists of Prometheus, Grafana, BigQuery, StatsD, Vector, Google Cloud Logging, Honeycomb. We are having an ambitious roadmap to improve visibility of services and infrastructure running across VMs and Containers . As a team, we are a strategic advantage for Etsy by building the future of observability.
You will be playing an instrumental role in crafting the future architecture of how we run our systems in the cloud while being part of a dynamic international team.
This is a full-time position reporting to the Senior Manager, Core Infra. In addition to salary, you will also be eligible for an equity package, an annual performance bonus, and our competitive benefits that support you and your family as part of your total rewards package at Etsy.
This role requires your presence in Etsy’s Dublin office once or twice per week depending on your proximity to the office. Candidates living within commutable distance of our Dublin office, may be the first to be considered. Learn more details about our work modes and workplace safety policies here.
What’s this team like at Etsy?
The team is committed to craft and maintain critical systems that power our 91.2M active buyers and 6.2M active sellers, with a goal of 99.95% uptime.
This team’s mission is to build tools and services to collect, store and analyze telemetry data. We optimize for time to detect, diagnose and resolve issues.
We work closely with developers on best practices for instrumentation because we understand that preventing issues is a balance over many tradeoffs. We are a force multiplier for the entire engineering organization.
You’ll be creating and maintaining an observability suite of tools that enable our engineers to deploy 100+ times every week and gather deep insights on the infrastructure’s reliability. We believe the future of Observability is no longer disconnected across 3 pillars of metrics, logs, and tracing, but is a single braid of interconnected data.
What does the day-to-day look like?
Working with an extensive amount of telemetry (Petabytes of logs, millions of metrics, and traces) that is growing rapidly.
Design systems that operate reliably enabling our customers to hit 99.95% availability
Build best-in-class developer tooling by keeping Observability human
Understanding and solving real business needs at a large scale by applying your software engineering and analytical problem-solving skills.
Work with the Engineering Team at large to gain a deep understanding of Etsy Engineering, i.e. security, product engineering, risk, database infrastructure teams, and understand up/downstream impacts
Design, develop and implement highly scalable and maintainable systems by contributing at all levels of our infrastructure stack using technologies like Chef, Terraform, Kubernetes, GoLang, Prometheus, Envoy, OTel
Of course, this is just a sample of the kinds of work this role will require! You should assume that your role will encompass other tasks, too, and that your job duties and responsibilities may change from time to time at Etsy's discretion, or otherwise applicable with local law.
Qualities that will help you thrive in this role are:
You are an experienced software engineer, where the last 2 years are in Observability systems/developer tooling roles, preferably in a cloud environment.
Prior experience in executing multiple large, successful projects within a team environment. Each of these projects may have taken many months or longer to see through.
Prior experience in either implementing or actively using metrics, logging and tracing systems in a professional setup is essential.
Wears every 9 of uptime as a badge of honor
Proficiency in one of the programming languages like PHP, Python, Go is essential.
Experience with cloud architectures (GCP, AWS), and exposure to Kubernetes or other container orchestration frameworks is nice to have
Hands-on experience with Infrastructure As Code tooling like Terraform and configuration management tooling like Chef/Ansible is essential.
Have a “leave it better than you found it” mentality, and are willing to work with and improve on code you did not originally write.
Additional Information
What's Next
If you're interested in joining the team at Etsy, please share your resume with us and feel free to include a cover letter if you'd like. As we hope you've seen already, Etsy is a place that values individuality and variety. We don't want you to be like everyone else -- we want you to be like you! So tell us what you're all about.
Our Promise
At Etsy, we believe that a diverse, equitable and inclusive workplace furthers relevance, resilience, and longevity. We encourage people from all backgrounds, ages, abilities, and experiences to apply. Etsy is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. If, due to a disability, you need an accommodation during any part of the interview process, please let your recruiter know. While Etsy supports visa sponsorship, sponsorship opportunities may be limited to certain roles and skills.
Other Jobs from Etsy
Software Engineer II, Web Inventory Buyer Experience
Senior iOS Engineer I, Buyer Ads Experience
Senior Product Manager, Mobile App Foundations
Senior Product Manager, Mobile Performance and Reliability
Similar Jobs
Senior Software Engineer - Ansible
Senior Platform Engineer
Senior Site Reliability Engineer (Turkey)
Senior Site Reliability Engineer (LATAM)
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say