The Hashgraph Group

Systems and Monitoring Engineer (Remote)

Remote India
Prometheus Grafana PagerDuty Terraform Ansible AWS GCP Splunk Datadog Linux Docker Kubernetes Python Go Bash API CI/CD Machine Learning Deep Learning Streaming
Description

Systems & Monitoring Engineer

Department: AMS

Employment Type: Permanent employee

Schedule: Full-time

Seniority: Experienced

Location: India, Remote

The Hashgraph Group (THG) is a global organization headquartered in Switzerland, and is a part of the Hedera Hashgraph (“Hedera”) ecosystem.  
 
Hedera is a revolutionary proof-of-stake public Distributed Ledger Technology (DLT) network that is fast emerging as the gold standard in DLT for enterprise-grade solutions and decentralized applications (dApps). Hedera is governed by a council of the world’s leading organizations - which include Google, Boeing, IBM, Dell, Deutsche Telekom, LG, Abrdn, London School of Economics,  to name a few. 
 
THG works closely with enterprises, startups, governments, and academic and training institutions around the world to deliver financing, custom-design solutions, and professional training and innovation programs, aimed at accelerating the development and utilization of the Hedera Hashgraph network. 

About the Role:

Are you passionate about next-gen observability, automation, and operational excellence? As our Systems & Monitoring Engineer, you’ll architect and own the monitoring stack for our Hedera-based ecosystem, blending classic NOC best practices with the unique challenges of DLT and Web3. You’ll be the technical backbone ensuring uptime, resilience, and regulatory compliance for our global support teams. 

What You’ll Do 

1) Web3 Observability

  • Design, deploy, and maintain monitoring solutions (Prometheus, Grafana) for DLT-specific metrics (consensus finality, node health, on-chain activity). 

  • Build custom exporters and dashboards for real-time, actionable insights. 

  • Distinguish between infrastructure and protocol health to ensure meaningful alerts. 

2) Incident Response & Compliance

  • Integrate and manage PagerDuty for rapid, automated incident response. 

  • Implement DORA-compliant processes, including automated “kill switches” and regular disaster recovery drills. 

  • Maintain clear, actionable runbooks for support teams. 

3) Automation & Infrastructure as Code

  • Deploy and manage Mirror Nodes and RPC relays using Terraform/Ansible across AWS/GCP. 

  • Build CI/CD pipelines for support tooling and state proof verification. 

  • Automate critical response actions for rapid threat mitigation. 

4) NOC Leadership

  • Serve as the L3 escalation point for complex incidents (“ghost transactions,” API anomalies). 

  • Perform root cause analysis using logs (Splunk, Datadog) and collaborate with cross-functional teams. 

What You Bring 

  • 4+ years in DevOps, SRE, or NOC roles (with 1–2 years in Web3/Blockchain environments). 

  • Deep expertise in Prometheus/Grafana, Linux, Docker/Kubernetes, and scripting (Python, Go, Bash). 

  • Proven experience with cloud platforms (AWS/GCP) and IaC tools (Terraform). 

  • Strong understanding of Hedera Hashgraph or EVM-based chains, and ability to interpret ledger APIs. 

  • Familiarity with ITIL/ITSM, DORA, SOC2, or ISO 27001 frameworks.

What we offer
  • A unique opportunity to be a part of the world’s leading DLT ecosystem
  • Significant career growth potential in a fast growing sector
  • Working with colleagues and on projects across the globe
  • Open and direct communication, flat structures
  • Flexible working hours
  • Competitive salary package
The Hashgraph Group
The Hashgraph Group

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say