Platform Engineering Associate – D&A Product Reliability & Operations (GCP / DevOps/ SRE)
Location: Mumbai, India
Time Type: Full time
Job Description
Job Description
Are You Ready to Make It Happen at Mondelēz International?
Join our Mission to Lead the Future of Snacking. Make It Uniquely Yours.
Platform Engineering Associate – D&A Product Reliability & Operations (GCP / DevOps / SRE)
Experience: 4+ years
Purpose: Support, operate, and continuously improve Data & Analytics (D&A) products running on Google Cloud (GCP) by applying a platform-as-a-product mindset to run-state ownership. Drive reliability, security, observability, and cost-aware operations through hands-on engineering (Terraform, CI/CD, monitoring, automation) and disciplined incident management.
Core Responsibilities (Product Ownership + Run-State):
•
Own day-to-day platform operational health for a defined portfolio of D&A products (pipelines, data products, analytics apps/dashboards, ML workloads). Manage the run backlog: intake, triage, prioritization, resolution, and prevention.
•
Establish and maintain runbooks, operational readiness checklists, SLAs/SLOs where applicable, and clear support documentation to enable self-service and reduce tickets.
•
Apply SRE practices: define/track SLIs/SLOs (availability, data freshness/latency, job success rate, quality signals), participate in on-call/incident response, lead structured triage, and drive post-incident corrective actions to closure.
•
Build and maintain observability: dashboards and actionable alerts across logs/metrics/traces and D&A signals (pipeline failures, SLA misses, anomalies). Reduce alert noise and improve MTTR through better instrumentation and automation.
•
Triage and remediate security vulnerabilities across product runtimes (images, libraries, pipelines, IaC). Embed security checks and compliance controls into CI/CD and support audit evidence needs.
•
Support infrastructure and environment consistency via Terraform and GCP services (IAM, networking, Compute/GKE, Storage, Monitoring/Logging).
•
Integrate FinOps fundamentals into operations: enforce tagging/labeling, identify waste (idle/oversized resources, runaway jobs), and partner with FinOps/product owners to implement optimizations.
Required Skills: Terraform (PR-based governance), strong GCP fundamentals, Kubernetes/GKE familiarity preferred, CI/CD (GitHub Actions/Jenkins), observability (Cloud Monitoring/Logging and/or Prometheus/Grafana/Datadog), security tooling exposure (Dependabot/GitHub Advanced Security, SonarQube, Wiz/Tenable), Python/Bash automation, strong troubleshooting and stakeholder communication.
How you will contribute
You will ensure that delivered services are optimized to meet business demands and the service operations strategy, plan, measure, report and communicate service improvement initiatives, and serve as a consultant on issues and resolutions. You will also recommend actions that can be taken to optimize investments and benefits and to mitigate risks. This role will require you to identify suppliers, evaluate them, on-board new vendors, establish and run vendor governance; collaborate with management and follow-up on requisitions, purchase orders, invoices, and payments; work with project resources to provide design collateral and to configure software components so they are aligned with security policy and governance; and ensure adherence to development and configuration standards and processes.
What you will bring
A desire to drive your future and accelerate your career. You will bring experience and knowledge in:
- Working collaboratively with multiple vendors
- Leading complex projects - project management
- Stakeholder management and influencing skills
- Managing infrastructure services delivery, support and excellence
- Working in global IT function with regional or global responsibilities in an environment like Mondelēz International
- Working with IT outsourcing providers using frameworks such as the IT Infrastructure Library
- Working with internal and external teams and leading when necessary
More about this role
You will ensure that delivered services are optimized to meet business demands and the service operations strategy, plan, measure, report and communicate service improvement initiatives, and serve as a consultant on issues and resolutions. You will also recommend actions that can be taken to optimize investments and benefits and to mitigate risks. This role will require you to identify suppliers, evaluate them, on-board new vendors, establish and run vendor governance; collaborate with management and follow-up on requisitions, purchase orders, invoices, and payments; work with project resources to provide design collateral and to configure software components so they are aligned with security policy and governance; and ensure adherence to development and configuration standards and processes.
Core Responsibilities (Product Ownership + Run-State):
•
Own day-to-day platform operational health for a defined portfolio of D&A products (pipelines, data products, analytics apps/dashboards, ML workloads). Manage the run backlog: intake, triage, prioritization, resolution, and prevention.
•
Establish and maintain runbooks, operational readiness checklists, SLAs/SLOs where applicable, and clear support documentation to enable self-service and reduce tickets.
•
Apply SRE practices: define/track SLIs/SLOs (availability, data freshness/latency, job success rate, quality signals), participate in on-call/incident response, lead structured triage, and drive post-incident corrective actions to closure.
•
Build and maintain observability: dashboards and actionable alerts across logs/metrics/traces and D&A signals (pipeline failures, SLA misses, anomalies). Reduce alert noise and improve MTTR through better instrumentation and automation.
•
Triage and remediate security vulnerabilities across product runtimes (images, libraries, pipelines, IaC). Embed security checks and compliance controls into CI/CD and support audit evidence needs.
•
Support infrastructure and environment consistency via Terraform and GCP services (IAM, networking, Compute/GKE, Storage, Monitoring/Logging).
•
Integrate FinOps fundamentals into operations: enforce tagging/labeling, identify waste (idle/oversized resources, runaway jobs), and partner with FinOps/product owners to implement optimizations.
Required Skills: Terraform (PR-based governance), strong GCP fundamentals, Kubernetes/GKE familiarity preferred, CI/CD (GitHub Actions/Jenkins), observability (Cloud Monitoring/Logging and/or Prometheus/Grafana/Datadog), security tooling exposure (Dependabot/GitHub Advanced Security, SonarQube, Wiz/Tenable), Python/Bash automation, strong troubleshooting and stakeholder communication.
Travel requirements:
Work schedule:
No Relocation support availableBusiness Unit Summary
Headquartered in Singapore, Mondelēz International’s Asia, Middle East and Africa (AMEA) region is comprised of six business units, has more than 21,000 employees and operates in more than 27 countries including Australia, China, Indonesia, Ghana, India, Japan, Malaysia, New Zealand, Nigeria, Philippines, Saudi Arabia, South Africa, Thailand, United Arab Emirates and Vietnam. Seventy-six nationalities work across a network of more than 35 manufacturing plants, three global research and development technical centers and in offices stretching from Auckland, New Zealand to Casablanca, Morocco. Mondelēz International in the AMEA region is the proud maker of global and local iconic brands such as Oreo and belVita biscuits, Kinh Do mooncakes, Cadbury, Cadbury Dairy Milk and Milka chocolate, Halls candy, Stride gum, Tang powdered beverage and Philadelphia cheese. We are also proud to be named a Top Employer in many of our markets.
Mondelēz International is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation or preference, gender identity, national origin, disability status, protected veteran status, or any other characteristic protected by law.
Job Type
RegularSoftware & ApplicationsTechnology & DigitalThere are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
