What you'll be doing:
- Monitoring and Incident Management: Develop and maintain monitoring and alerting solutions. Respond to incidents, troubleshoot issues, and perform root cause analysis.
- Automation and Tooling: Automate repetitive tasks and improve deployment processes. Develop and maintain tools to support infrastructure and applications.
- Performance Optimization: Analyze system performance and implement optimizations to improve efficiency and reduce latency.
- Security and Compliance: Ensure systems are secure and compliant with relevant standards and regulations.
- Documentation and Knowledge Sharing: Maintain comprehensive documentation of systems and processes. Share knowledge and best practices with team members.
- Database Management: Ensure the reliability, performance, and scalability of databases. Perform database optimization, maintenance, and troubleshooting.
What you need to have:
- Proficiency in programming languages such as Python, Go, Javascript
- 5+ years of experience with cloud platforms such as AWS, Google Cloud, or Azure.
- Strong understanding of Linux/Unix systems and networking.
- Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
- Knowledge of CI/CD pipelines and tools (e.g., Jenkins, GitLab CI).
- Database Experience: Proficiency with relational and NoSQL databases (e.g., MySQL, PostgreSQL, Redis, Elasticsearch)
- Team Player: Willingness to collaborate and share knowledge with colleagues to drive collective success.
- Ownership: Taking responsibility for your work and demonstrating accountability for outcomes.
What we would love to see:
- Innovative Mindset: A passion for exploring new technologies and methodologies to improve reliability and performance.
- Proactive Approach: Ability to anticipate potential issues and implement preventive measures.
- Continuous Improvement: A dedication to learning and growing in your role, staying updated with industry trends and best practices.
0 applies
7 views
Other Jobs from Sword Health
Junior Associate to CTO
Chief of Staff - AI (Portugal-based Remote/Hybrid)
Chief of Staff - AI (Portugal-based Remote/Hybrid)
Lead Software Engineer - Internal AI Solutions (Portugal-based Remote/Hybrid)
Senior Data Engineer (Remote/Hybrid)
Similar Jobs
Senior Site Reliability Engineer
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 401 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got about 70,000 jobs from 5,000 vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 5,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say