Platform Support Engineer
Team: Customer Support
Location: Remote UK
Commitment: Full-Time
Workplace Type: remote
WHAT YOU WILL BE DOING
- Strong pod-level troubleshooting skills in AKS/EKS (not just restarting pods).
- Analyze application and DB (RDS, MySQL) performance issues.Deeply investigate and analyze application performance issues (Java, Grails, Hibernate), identifying root causes and implementing solutions.
- Oversee the monitoring of our SaaS applications and underlying infrastructure (Kubernetes on AWS and Azure, VPN connections, customer applications, Elastic Search, MySQL) for alerts and performance issues.
- Strong understanding of basic computing concepts like DNS, IP addressing, Networking, and LDAP.
- Effectively participate and contribute in on-call escalations with a strong operational mindset and provide technical guidance during critical incidents.
- Proactively communicate with customers on technical issues when required.
- Ability to guide junior engineers when needed technically.
- Manage the full lifecycle of alerts, incidents, and service requests reported through FreshService, ensuring timely and accurate logging, prioritization, resolution, and escalation.
- Develop, implement, and maintain operational procedures, runbooks, and knowledge base articles to standardize incident resolution and service request fulfillment.
- Drive continuous improvement initiatives to optimize operational efficiency, reduce incident rates, and improve service request turnaround times.
- Collaborate with backend engineering and development teams to troubleshoot complex issues, identify root causes, and implement preventative measures.
- Ensure adherence to defined SLAs (Service Level Agreements) and KPIs (Key Performance Indicators) for operational performance. Maintain operational documentation, including system diagrams, contact lists, and escalation paths.
- Ensure compliance with relevant security and compliance policies.
- Plan and coordinate scheduled maintenance activities with minimal impact to service availability.
WHAT YOU BRING
- Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field.
- Minimum of 3+ years of experience in IT/Cloud operations and application support (specifically Java apps), with knowledge of cloud infrastructure (AWS and Azure).
- Strong experience with application support (Java, Grails, Hibernate) and performance analysis in a production environment, able to pinpoint a performance degradation through analysis.
- Strong understanding of cloud computing concepts, architectures, and services on both AWS and Azure platforms.
- Working knowledge of containerization and orchestration technologies, specifically Kubernetes. End-to-end technical accountability and operational ownership. Willingness to work in a 24/7 operating model.
- Experience managing and troubleshooting network connectivity, including VPNs and connections to external networks.
- Familiarity with monitoring tools and practices, with experience in setting up and responding to alerts.
- Hands-on experience with log management and analysis tools, preferably Elastic Search.
- Working knowledge of database systems, preferably MySQL, including L2 troubleshooting and performance monitoring.
- Experience with ITSM (IT Service Management) systems, preferably FreshService, including incident, problem, and service request management processes.
- Excellent problem-solving, analytical, and troubleshooting skills with a data- driven approach. Experience with Grafana systems and dashboards is a plus.
- Strong communication (written and verbal), interpersonal, and presentation skills.
- Ability to work effectively under pressure and manage multiple priorities in a fast-paced environment.
- Experience in developing and documenting operational procedures and runbooks.
- Experience with automation tools and scripting languages (e.g., Python, Bash) is a plus.
- Experience working in a SaaS environment is highly desirable.
- Working knowledge of database systems, preferably MySQL, including L2 troubleshooting and performance monitoring.
- Experience with ITSM (IT Service Management) systems, preferably FreshService, including incident, problem, and service request management processes.
- Excellent problem-solving, analytical, and troubleshooting skills with a data- driven approach.Experience with Grafana systems and dashboards is a plus.
- Strong communication (written and verbal), interpersonal, and presentation skills.
- Ability to work effectively under pressure and manage multiple priorities in a fast-paced environment.
- Experience in developing and documenting operational procedures and runbooks.
- Experience with automation tools and scripting languages (e.g., Python, Bash) is a plus.
- Experience working in a SaaS environment is highly desirable.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
