Dremio

Senior Site Reliability Engineer

Lisbon, Portugal
Go Git AWS GCP Azure Kubernetes Java Python
This job is closed! Check out or
Description

Be Part of Building the Future

Dremio is The Easy and Open Data Lakehouse, providing self-service analytics with data warehouse functionality and data lake flexibility across all of your data. Dremio increases agility with a revolutionary data-as-code approach that adopts Git concepts to enable data experimentation, version control, and governance. In addition, Dremio breaks down data silos by simplifying ingestion into the lakehouse, and also allowing queries directly on databases and data warehouses. All of this is available through a fully managed service that not only eliminates the need to maintain infrastructure and software, but also automatically optimizes the data in the lakehouse to maximize performance for every workload.

Founded in 2015, Dremio is headquartered in Santa Clara, CA. Investors include Cisco Investments, Insight Partners, Lightspeed Venture Partners, Norwest Venture Partners, Redpoint Ventures, and Sapphire Ventures. For more information, visit www.dremio.com. Connect with Dremio on GitHubLinkedInTwitter, and Facebook.

If you, like us, say “bring it on” to exciting challenges that really do change the world, we have endless opportunities where you can make your mark.

About the role

We're looking for a Senior SRE, who will be involved in exciting technical challenges by analyzing, troubleshooting, and designing vital services, platforms, and infrastructure while always thinking about reliability, scalability, resilience, security, and performance. We believe that our role is to be continually learning: improving our understanding of and ability to safely operate our service and provide the best possible experience for our users. Psychological safety and a blameless culture are critical components of our SRE culture.

What you’ll be doing

  • Evangelize and advocate for reliability practices across our organization
  • Collaborate with other Engineering teams to support services before they go live through activities such as system design consulting, developing software platforms and frameworks, monitoring/alerting, capacity planning and production readiness reviews
  • Ability to debug and optimize code and automate routine tasks: reduce toil!
  • Analyze and optimize our core product by developing and implementing reliability and performance practices
  • Scale systems sustainably through automation and evolve systems by pushing for changes that improve reliability and velocity
  • Be on-call for production services
  • Practice sustainable incident response and blameless retrospectives

What we’re looking for

  • 5 years of relevant experience in the following areas: SRE, DevOps, Cloud Operations, Systems Engineering, or Software Engineering
  • Excellent command of cloud services on AWS/GCP/Azure, Kubernetes and CI/CD pipelines
  • Experience with monitoring/alerting Prometheus, Thanos, Victoria Metrics, Grafana, vmrules)
  • Have moderate-advanced experience in Java, C, C, Python, Go or other object-oriented programming languages
  • Interested in designing, analyzing and troubleshooting large-scale distributed systems
  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
  • Great ability to debug and optimize code and automate routine tasks
  • Solid background in software development and architecting resilient and reliable applications
  • Good communicator and comfortable working with other engineers across the organization

Bonus points if you have

  • Experience being on-call for an internet facing production system
  • Expertise in k8s, helm, yaml, GitOps, ArgoCD, Distributed Tracing Lightstep, Honeycomb, OpenTelemetry), k8s resource management (e.g. kubecost)

#LI-JW1

What we value 

At Dremio, we hold ourselves to high standards when it comes to People, Thinking, and Action. Our Gnarlies (that's what we call our employees) communicate with clarity, drive accountability, and are respectful towards each other. We confront brutal facts and focus on results while operating with a sense of urgency and building a "flywheel". People who like to jump in and drive momentum will thrive in our #GnarlyLife.

Dremio is an equal opportunity employer supporting workforce diversity. We do not discriminate on the basis of race, religion, color, national origin, gender identity, sexual orientation, age, marital status, protected veteran status, disability status, or any other unlawful factor.

Dremio is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request accommodation due to a disability, please inform your recruiter.

Dremio has policies in place to protect the personal information that employees and applicants disclose to us. Please click here to review the privacy notice. 

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

50,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 223 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

Cancel anytime / Money-back guarantee

Wall of love from fellow engineers