Tamr

Site Reliability Engineer

Cambridge, MA
Docker Kubernetes Spark Python Azure Terraform AWS GCP Ansible
Description
At Tamr we envision a world where people in organizations have accurate, up-to-date, mastered data to deliver impactful business outcomes. We assert that the best way to have such data is via a human-guided machine-learning system that improves over time. Our agile data mastering platform provides the core set of tools enabling companies to leverage their existing domain expertise and data to be used as fuel for decision-making.

We are currently looking for a Site Reliability Engineer to join our SRE team as we continue to evolve and expand the Tamr data mastering platform and tool suite. With our growing customer base and increasing demand for cloud and hybrid-cloud offerings, we are growing our SRE team to support the development and delivery of new products, deployment to cloud environments, and incorporation of third-party technologies and tools to enable product engineering.

You will play a key role in designing and delivering solutions that will make Tamr SaaS offering scalable, featureful, resilient, and secure while providing guidance and mentorship to your team. You'll design and operate automation software to provision, upgrade, monitor, and heal Tamr SaaS deployed on various public cloud platforms such as Google Cloud Compute, Amazon Web Services, and Microsoft Azure.

As an SRE, you will participate in a global uninterrupted rotation and help lead incident management, root cause analysis, continuous improvement activities, and managing engineering efforts against a service-level agreement (SLA) and error budget. 

This position reports to the head of SRE.

As a member of the SRE team some of the projects you will be working on are:
-Manage Tamr SaaS in development and production hosted on public cloud platforms.
-Respond to incidents, facilitate post-mortems and ensure closure of follow-up actions items.
-Develop and drive real-time observability solutions that provide visibility into system health.
-Partner with development teams to improve services through rigorous testing and release procedures.
-Participate in system design consulting, platform management, and capacity planning.
-Balance feature development speed and reliability with well-defined service level objectives.
-Create and maintain self-provisioning infrastructure using tools like Ansible, Terraform, and Docker.
-Improving robustness by automation of workflows, process improvements, CI/CD pipelines, and integrating modern toolsets.
-Participate in a 24x7 on-call rotation.

You might be a good fit if you have 3 or more of the following:
-1+ years of experience in DevOps/SRE/Systems Administration with some experience with Linux/Unix systems administration.
-1+ years of experience with cloud-based provisioning, monitoring, and troubleshooting (preferably AWS or GCP).
-1+ year(s) of Docker and Kubernetes or OpenShift experience.
-Familiarity with infrastructure automation tools like Terraform and Ansible.
-Experience with one or more scripting languages such as Python
-Minimum Bachelor's degree in Computer Science or equivalent.

Technologies we use:
-Multi-cloud (GCP/AWS/Azure)
-Git, GitOps, Terraform, Ansible
-Kubernetes, Helm, Istio, Docker
-Big Data Technologies (BigTable/HBase, Dataproc/Databricks/Spark)
-PostgreSQL, BigQuery, Snowflake, Synapse
-Java, Python, Scala
-Spinnaker, Jenkins

Additional Information 
This position is based at our office in Cambridge MA. Tamr does sponsor employees requiring a visa. 

Tamr provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state and local laws.
Tamr
Tamr
Analytics Data Integration Database Enterprise Software

0 applies

3 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

πŸ₯³πŸ₯³πŸ₯³ 401 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. πŸ› οΈ
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. πŸš€
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. πŸ“…

What Fellow Engineers Say