ESSENTIAL DUTIES AND RESPONSIBILITIES
- Collaborate with development teams to enhance the reliability and efficiency of microservices applications.
- Engage with product development (PD) teams by participating in design reviews and production readiness checks.
- Collaborate with engineering teams, providing product feedback and where necessary contribute code to the product
- Work closely with cross-functional teams to ensure seamless integration of new features and services.
- https://aws.amazon.com/blogs/apn/the-6-pillars-of-the-aws-well-architected-framework/
- Analyze data from observability and monitoring tools to improve operational metrics of microservices as well as the entire platform.
- Leverage end-to-end technical expertise gained by engagement with multiple PD teams and analyzing observability data to propose improvements in code and design to improve SLO and prevent incidents.
- Create system documentation and training materials to empower and educate our fellow team members
- Take a purist SRE approach to shared multi-tenant infrastructure for a resilient SaaS microservice-based containerized systems in addition to customer-centric application environments
- Oversee and automate the team’s growing presence in AWS
- Creatively build and develop tooling to aid in driving 24x7x365 follow-the-sun operations of critical production systems
- Build and maintain observability tooling, metrics, and dashboarding for a global platform product infrastructure
- Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks and issues
- Collaborate with engineering teams, providing product feedback and where necessary contribute code to the product
REQUIRED SKILLS AND EXPERIENCE
- Education and Work Experience
- Bachelor’s Degree in Computer Science or related field with 10+ Years of experience
- Software engineering and task automation skills with Bash, Python, and/or Go are a must
- Experience in developing and maintaining Java-based web applications, including deployment and support on Apache/Tomcat servers in a live production environment.
- Familiarity with the Agile software development lifecycle
- Deep background with Linux systems and engineering
- Highly experienced with engineering and automating on Amazon Web Services (AWS)
- Prior experience with IaC tools like Terraform/Terragrunt/Terraspace
- Prior experience with devops/gitops tools (Git, Bitbucket, Flux CD, Teamcity) for gate promotions
- Production-At-Scale support background in a heavily microservice-based world
- Hands-on engineering and ops expertise in containerization (Docker, Helm, Kubernetes/EKS, CNI and Ingress networking)
- Strong understanding of Single-Sign On, SAML, OAuth (Bonus if hands-on experience with Okta)
- Seasoned expertise around x.509 certificate technology and basic concepts of encryption
- Experience working with Relational Databases such as Aurora Postgres and/or Oracle RDS
- Advanced exposure to application development, web UI (design and development), JSON, application architecture
- Experience strongly utilizing observability tools (logging/APM) like Datadog, CloudWatch, and PagerDuty.
- Familiarity with event store/stream-processing technologies like Kafka or AWS SQS
- Understanding of Open Application Model systems such as KubeVela or Crossplane
- Personal Qualities and Soft Skills
- You greatly prefer writing code than clicking a GUI.
- You enjoy teaching, being a mentor to others, and working across boundaries
- Outstanding troubleshooting skills; ability to think critically and display an aptitude for problem solving
- Strong analytical mind with a penchant for process development and enhancement
- A highly positive can-do attitude with desire for being a team player
- Great communication skills and ability to explain complex technical concepts to a varied audience
- Demonstrate strong follow-through, a strong work ethic and consistently keep and meet commitments
- Other Requirements
- Ability to read, write, and speak English
- We provide 24x7 support to our customers, so we expect you to take turns with your teammates being on-call for weekend production emergencies or to provide rotating weekend operational support
- Travel – Expect occasional travel (less than 5%) to other Guidewire offices for training and team meetings
Other Jobs from Guidewire Software
Software Engineer II ( Java)
Site Reliability Engineer - Platform Microservices Reliability
Senior Software Engineer ( Java)
Senior Site Relibility Engineer
Consultant Developer Intern (6 months) – Kuala Lumpur, Malaysia
Similar Jobs
Site Reliability Engineer - Platform Microservices Reliability
Senior Site Relibility Engineer
SW DevOps Engineer
Senior Software Development Engineer (US Federal)
Staff Database Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say