Senior DevOps Engineer
Team: Core Engineering
Location: Pune
Workplace Type: onsite
About the Role:
- We are seeking a highly skilled Senior DevOps Engineer to lead the design, implementation, and continuous improvement of our cloud infrastructure, kubernetes, CI/CD pipelines, observability systems, and reliability practices. This role is critical in ensuring platform stability, scalability, security, and operational excellence across production and non-production environments. You will work closely with Engineering, Security, and Product teams to build resilient, automated, and high-performing infrastructure systems.
Key Responsibilities:
- Infrastructure & Cloud Engineering: Design, implement, and manage scalable cloud infrastructure (AWS preferred)
- Lead infrastructure-as-code initiatives (Terraform / CloudFormation)
- Improve high availability, disaster recovery, and multi-region resilience
- Optimize cloud cost and resource utilization
- Kubernetes & Container Platform: Architect and manage production-grade Kubernetes clusters
- Improve cluster reliability, auto-scaling, and performance
- Implement workload monitoring, alerting, and SLO-based reliability standards
- Enforce namespace isolation and resource governance
- CI/CD & Automation: Design and optimize CI/CD pipelines (Jenkins, ArgoCD)
- Implement zero-downtime deployment strategies
- Automate environment provisioning (fully touchless builds with seed data)
- Improve deployment reliability and rollback mechanisms
- Observability & Reliability: Own monitoring, alerting, and logging strategy (Prometheus, Grafana, Datadog, etc.)
- Ensure 100% monitoring coverage for critical services
- Reduce Sev1/Sev2 incidents caused by infrastructure
- Create and maintain runbooks (COPs) for incident response
- Define SLOs, SLIs, and error budgets
- Security & Compliance: Implement IAM best practices and least privilege access
- Improve secrets management and credential rotation
- Partner with security team on audits and compliance controls
- Incident Management. Lead root cause analysis for major incidents
- Drive postmortems and preventive improvements
- Improve MTTR and overall operational maturity
Required Skills & Experience:
- 6+ years in DevOps / SRE / Cloud Engineering
- Strong experience with AWS (VPC, IAM, EC2, S3, RDS, EKS, etc.)
- Deep Kubernetes experience (production clusters)
- Strong understanding of networking and Linux systems
- Experience with Infrastructure as Code (Terraform preferred)
- Experience implementing monitoring & alerting systems (Datadog, prometheus.Grafana)
- Strong scripting skills (Python / Bash )
- Experience managing production systems with high availability requirements
- Good understanding on databases like Postgres, MySQL
- Strong communication written and verbal skills
- Ability to follow structured processes while being proactive in identifying improvements.
- Analytical and problem-solving mindsent.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
