Kentik

NOC Site Reliability Engineer - Europe

Remote Europe
Ruby GCP React PostgreSQL Puppet Chef Bash Python Kafka gRPC Git AWS API Azure Redis MySQL Microservices Ansible Go Node.js
Description

Who we are

Kentik is the network observability company. Our platform is a must-have for the network front line, whether digital business, corporate IT, or service provider. Network professionals turn to the Kentik Network Observability Cloud to plan, run, and fix any network, relying on our infinite granularity, AI-driven insights, and insanely fast search.

Kentik makes sense of network, cloud, host, and container flow, Internet routing, performance tests, and network metrics. We show network pros what they need to know about their network performance, health, and security to make their business-critical services shine. Networks power the world’s most valuable companies, and those companies trust Kentik. Market leaders like IBM, Box, and Zoom rely on Kentik for network observability. Visit us at kentik.com and follow us at @kentikinc.

What we do

Kentik is looking for an entry to mid-level Site Reliability Engineer to join our Technical Operations team. This team is primarily responsible for monitoring and maintaining the infrastructure and services that power the Kentik Network Observability Platform. We’re looking for enthusiastic learners who will work with and learn from engineering teams across the company as you coordinate issue response and resolution.
 
Kentik is a fully remote, global team and company across many countries and time zones. We operate a well-organized, well-instrumented platform, and offer enormous opportunities for employee growth, including mentorship from senior SREs and developers. 
 
*This is a remote role. Due to our on-call schedule (follow-the-sun), working hours in European time zones is a requirement for this position. 

What you'll do

  • Being part of a real-time, scalable, microservices-based infrastructure, running on open source software, across multiple locations and all major cloud vendors.
  • You will be part of a follow-the-sun on call incident rotation. This involves incident triage, compiling postmortems, RCAs and providing input at all stages.
  • Deep-diving into diverse topics, from NetFlow and IP routing, to database replication strategies or HTTP optimization.
  • Contribute code, code reviews and tools or patches to all kinds of existing code.
  • Collaborate with team members in an asynchronous, remote environment using tools such as email, Google Docs, Slack, Zoom, Git, and more.
  • Provide valuable feedback on team goals, projects, and processes. We believe in continuously improving our team.
  • Write design documents or collaborate on colleagues’ docs to introduce new features or changes into our infrastructure.
  • Managing vendor communications for hardware and scheduling data center technician visits.

What you'll bring

Studies have shown that some candidates tend to apply to jobs only if they meet 100% of the qualifications. We encourage you to apply if you meet most of the criteria - even if you don’t match all of the qualifications, your skills and experience could be valuable in this role!

  • 2+ years of experience in Systems Administration, Datacenter/IT and/or SRE related projects
  • Fluent in English
  • Experience working with *nix system command line (e.g. ssh, grep, awk)
  • Understanding of how HTTP works (TLS, headers, proxying)
  • Any experience with or desire to learn about microservices and containerization
  • Networking experience: Terms such as routes and iptables sound familiar
  • A passion for documenting code, processes, and infrastructure in runbooks and wikis
  • Experience working with a configuration management (infrastructure as code) platform such as: Ansible, Puppet, Chef, SaltStack or CFEngine
  • A preference to automate your way out of tedious and repetitive tasks - humans are terrible at repetition
  • Some familiarity with coding in Bash, Python, Ruby, or Go
  • Experience with public cloud (AWS, GCP, Azure, etc.) architectures and technologies

Our tech stack

  • Our core data engine and platform are primarily written in Go
  • We use Node.js + Express for application serving, and React as our primary UI framework
  • We also use some JS and Python for tooling/scripting
  • In addition to our own database, we use Postgres, Kafka, Mysql, and Redis
  • Internal and public APIs expose both rest/json and gRPC endpoints
  • Haproxy, Envoy for API traffic routing and balancing
  • Github for source control, PRs, issues
  • Jenkins for automated builds

What we offer

Kentik is a fully remote company that operates globally. We seek professionals that will help us thrive as an organization, and in turn, to broaden and enhance your career. We’re very thorough in the interview process to understand your skills and how they will relate to your successful growth here at Kentik. Our compensation philosophy encompasses a fair program for all in order to attract, engage and retain talented individuals who will drive our business and wow our customers.

In addition to a great career opportunity, Kentik offers stellar benefits for our employees, which include:

  • 100% of premiums are paid by company for health, vision and dental coverage for you and your dependents
  • Additionally, an annual Health Reimbursement Account (HRA) of $3,000 for an individual or $4,500 for a family
  • Paid family & medical leave 
  • Open PTO, a quarterly Wellness Day, and a minimum of 10 paid holidays
  • 401(k) retirement account
  • Home office reimbursement 
  • Stock options

Note: Benefits are as listed for all US full-time employees. For compensation, international applicants will be treated equitably in relation to the laws applicable within the countries in which we operate.

 

Come work with us

The true meaning of Kentik is visibility. We’re committed to making sure everyone feels empowered to use their voice, has a sense of belonging, and is represented at Kentik. 

We don’t look for individuals who fit the culture, but those who will continue to add to the culture. 

We encourage everyone to apply, especially those individuals who are underrepresented in the industry: people of color, LGBTQI+ community, women, individuals with disabilities (both seen and unseen), veterans, and people of any age or family status. 

Come as you are!

You will be working at a fast-growing, well-funded startup alongside industry thought leaders and network aficionados as we build the future of observability and set the high bar for how network operations and digital businesses should run. With a competitive salary and amazing benefits on top of the meaningful and challenging projects you’ll take on, we’re sure you’ll enjoy joining the Kentik team.

#li-remote

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

50,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 241 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers