Lead Site Reliability Engineer

Pune, India
Kubernetes Java Bash Python Terraform AWS


Your responsibilities

  • Automating the creation, deployment, testing, securing, and overall management of our infrastructure and services.  This requires an ability to understand key details about our services, the majority of which are written in Java.
  • Developing quality assurance methodologies for your code, including creating and validating your own unit tests..
  • Creating and using modern Continuous Integration/Continuous Deployment (CI/CD) pipelines and tooling . . . specifically using Cloud-native technologies; and being able to create the pipelines in such a way that they can scalably be used by the typical engineer.
  • Taking responsibility for ensuring our offerings are secure and compliant with modern frameworks.
  • Fixing various issues in our production environments without involving other teams most of the time.
  • Mentoring junior engineers.
  • Serving in an on-call rotation.
  • Creating root cause analysis (RCA) documentation; and host and participate in meetings on such topics involving multiple stakeholders.
  • Designing and implementing monitoring, logging, and dashboarding platforms across Cloud providers and regions.

Your experience, skills, and capabilities should include:  

  • 8+ years of experience 
  • 7+ years experience as an SRE, Platform Engineer etc.
  • 7+ years experience managing mission-critical web applications at scale
  • 3+ years are preferably in Bash, Python, Terraform, Helm
  • 8+ years with substantial experience in AWS
  • very deep experience with various Cloud-native monitoring, logging, and dashboarding platforms (including vendor-specific platforms like CloudWatch and CloudTrail; and third-party platforms like New Relic, FireHydrant, DataDog, PagerDuty, Prometheus, etc)
  • A strong ability to perform solely within an infrastructure-as-code (IaC) framework using; this means intimately knowing Terraform and/or Cloudformation in our case.
  • Strong experience with Gitlab pipelines, AWS CodeBuild/Codedeploy/Codepipeline, etc.
  • Deep understanding of kubernetes, including but not exclusive to vendor implementations of such (e.g., AWS EKS)
  • Being an excellent verbal and written communicator in English.  Explaining and documenting are key functions of this role.
  • Experience working in a fast-paced startup environment.
  • B.Tech./M.Tech. in Computer Science and Engineering or MCA or MSc. in Computer Science or Equivalent

Big Data Cloud Security Compliance Cyber Security SaaS

0 appplies


See 15,000+ More Jobs Like These

Subscribe to weekly membership and unlock all jobs

Engineering Jobs

15,000+ jobs from 2,600+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Cancel anytime