Two95 International

Site Reliability Engineer

Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia George Town, Penang, Malaysia
Python Go C++ Java SQL AWS GCP Azure Terraform Ansible Docker Kubernetes Prometheus Grafana ELK Splunk OpenTelemetry
Description

Site Reliability Engineer (SRE) - Ads / Monetization Platform

Location: Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia, George Town, Penang, Malaysia, Johor Bahru, Johor, Malaysia

Department: information Technology

Workplace: on_site

Employment Type: full

Description

Role Summary

As a Site Reliability Engineer (SRE), you will build and operate highly available, globally distributed advertising/monetization services. You will improve reliability, scalability, and operability through automation, observability, incident management, and sound engineering practices.

Key Responsibilities

  • Own reliability across the service lifecycle: design reviews, capacity planning, launch, deployment, operations, and continuous improvement.
  • Build and operate highly available services across multiple regions/data centers; improve resilience, latency, and scalability.
  • Develop automation and tooling to reduce toil (deployment, remediation, runbooks, self-healing) using scripting and software engineering best practices.
  • Define and implement SLOs/SLIs/SLAs; create dashboards and alerting to track service health (availability, latency, errors, saturation).
  • Lead sustainable incident response: triage, mitigation, root-cause analysis (RCA), and blameless postmortems with actionable follow-ups.
  • Collaborate with software engineering, security, and compliance stakeholders to meet data governance and regulatory requirements.

Requirements

Must-have Qualifications

  • 3+ years of experience in SRE, DevOps, systems engineering, or production operations for large-scale services.
  • Strong coding skills in one language: Python or Go or C++ (Java acceptable).
  • Solid Linux/Unix fundamentals: processes, memory/CPU, filesystems, permissions, and troubleshooting.
  • Networking fundamentals in cloud environments: TCP/IP, DNS, HTTP/HTTPS, load balancing, basic security concepts.
  • SQL proficiency and experience with data workflows/ETL is a plus for ads/analytics-related systems.
  • Strong communication, ownership mindset, and ability to work effectively across global teams.

Preferred Qualifications

  • Experience supporting advertising, recommendation, or high-traffic consumer internet platforms.
  • Hands-on experience with cloud platforms (AWS/GCP/Azure) and infrastructure-as-code (Terraform/Ansible).
  • Experience with containers and orchestration (Docker, Kubernetes).
  • Observability experience with tools such as Prometheus, Grafana, ELK/Splunk, OpenTelemetry.
  • Experience operating large data systems (streaming, distributed storage/compute) and performance tuning.

Two95 International
Two95 International

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say