Sr. Backend Operations Engineer - Coverstar
Location: San Francisco
Department: Coverstar
Location Type: REMOTE
Employment Type: FULL_TIME
Responsibilities:
- Expand and enhance our Grafana/Prometheus monitoring solution.
- Consolidate logs, metrics, and system health data for actionable insights and streamlined troubleshooting.
- Configure automated alerts based on predefined thresholds and anomaly detection to ensure rapid incident response.
- Diagnose and resolve infrastructure incidents between 9AM-9PM EDT, leveraging monitoring tools and system logs.
- Implement corrective actions and preventive measures to avoid recurrence.
- Analyze and optimize database queries, indexing, and partitioning strategies for enhanced performance and scalability.
- Regularly inspect database tables, identifying areas for improvement and recommending necessary maintenance activities.
- Monitor database usage trends to predict and proactively address scaling needs, preventing performance issues.
- Improve platform-wide security monitoring with real-time analytics and automated anomaly detection to quickly identify and respond to threats.
- Utilize security tools to simulate realistic attack scenarios to uncover vulnerabilities.
- Conduct ongoing vulnerability assessments and automated penetration testing.
- Strengthen and document incident response procedures, ensuring clear cross-team communication and swift incident remediation.
- Develop and maintain robust CI/CD pipelines for efficient code integration, testing, and deployment.
- Implement and integrate comprehensive testing frameworks, including unit, integration, and end-to-end tests, ensuring high-quality code delivery.
- Collaborate with teams to enforce industry-standard security checks and continuous monitoring across the software delivery lifecycle.
Qualifications:
- Extensive experience in backend infrastructure operations, including monitoring, incident management, database optimization, and security.
- Strong proficiency with Grafana, Prometheus, PostgreSQL (Aurora), and CI/CD pipeline tools.
- Proven ability to implement proactive security measures and conduct continuous assessments.
- Excellent problem-solving and incident management skills.
- Strong collaboration and communication skills, capable of cross-team coordination and documentation.
- Availability during core operational hours (9AM-9PM EDT).
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
