Senior Infra/ Infra Engineer (Automation & Observability) (Contract)
Location: MAS: MAS Building
Time Type: Full time
Job Description
[What the role is]
We are seeking a skilled Ansible Automation and Elastic Observability Engineer to design, implement, and maintain our infrastructure automation and observability platform. The successful candidate will be responsible for developing automation solutions using Ansible whilst building comprehensive observability capabilities with the Elastic Stack to monitor, analyse, and optimise our IT infrastructure and applications.[What you will be working on]
Design and implement automation frameworks for infrastructure, applications, and processes for critical information infrastructure (CII)
Develop and maintain observability solutions, including monitoring dashboards, alerts, and metrics collection frameworks
Build self-healing systems and automated remediation solutions to ensure system reliability and performance
Develop and maintain Logstash pipelines for data ingestion, transformation, and enrichment from various sources including application logs, system metrics, and business data.
Develop comprehensive monitoring dashboards using Kibana/Grafana to track system health, performance metrics, and business KPIs. Troubleshoot cluster issues, performance bottlenecks, and data ingestion problems.
Participate in incident management, on-call rotations, and post-incident reviews to implement improvements
Create and maintain comprehensive documentation, runbooks, and best practices for automation and observability
Assist in change management of Observability & Automation platform for new versions, Hotfixes, Platform Admin tasks, etc.
Troubleshoot and resolve complex issues related to Elastic & Ansible components
[What we are looking for]
A relevant university degree with at least 3+ years of relevant working experience.
Minimum 3+ years of hands-on experience with Elasticsearch, Logstash, Kibana, and Beats/Elastic Agent.
Strong understanding of distributed systems, search algorithms, and data structures.
Proficiency in Linux system administration and command-line tools.
Experience with containerisation technologies such as Docker and Kubernetes.
Solid experience with Elastic Stack components including Elasticsearch, Logstash, Kibana, and Beats for log management and observability.
Understanding of observability principles including metrics, logs, and traces (MLT) and their implementation in distributed systems.
Experience with APM tools and distributed tracing technologies for application performance monitoring.
Solid programming skills in Python, Java, or similar languages for custom plugin development and automation. Experience with configuration management tools like Ansible, Puppet, or Chef.
Knowledge of scripting languages including Bash and PowerShell for operational tasks.
Ability to succeed in a fast-paced, high demand environment
Excellent oral and written communication skills
Experience working with infrastructure as code technologies such as Terraform is preferred.
Strong analytical and problem-solving capabilities
Excellent project management and organisational skills
Ability to work effectively under pressure and manage multiple priorities
Strong service-oriented mindset with focus on operational excellence
As part of the shortlisting process for this role, you may be required to complete a medical declaration and/or undergo further assessment.
This is a 2-year contract position. All applicants will be notified on whether they are shortlisted or not within 4 weeks of the closing date of this job posting.
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
