Sr. Software Engineer, Infrastructure
Team: Engineering
Location: London
Commitment: Full Time
Workplace Type: onsite
Key Responsibilities:
- Maintain and enhance AWS infrastructure instrumentation and observability tools (e.g., Grafana, alarms) to ensure system reliability.
- Oversee Circadia's CI/CD pipelines (Jenkins) to enable efficient and seamless code deployment.
- Manage and maintain a fully separated staging environment for testing and development.
- Monitor AWS infrastructure for cost efficiency, identifying and implementing improvements.
- Optimize Snowflake ETL pipelines to reduce costs while maintaining performance and reliability.
- Manage GPT pipelines in Azure to ensure performance and cost-efficiency.
- Develop and maintain data pipelines for integrating external electronic health record (EHR) system data.
- Monitor and maintain MySQL databases to guarantee optimal performance and reliability.
- Collaborate with the backend team to design and implement APIs supporting Circadia's suite of products.
- Architecture: Design, deploy, and manage AWS infrastructure solutions to support various applications and services.
- Design scalable systems for storage and processing of large amounts of medical data.
- Manage databases (e.g., MySQL, MongoDB), optimizing for performance, scalability, and cost-efficiency.
- Manage compute clusters (e.g., ECS), serving various internal and customer-facing products and services.
- Utilize Terraform to efficiently manage cloud infrastructure.
- Ensure high availability, scalability, and reliability of the cloud environment.
- Security: Collaborate with development, operations, and security teams to ensure seamless integration and delivery of applications.
- Manage cloud infrastructure roles, permissions, and access credentials.
- Oversee regular and thorough rotation of access credentials and keys.
- Reliability: Troubleshoot and resolve infrastructure-related issues promptly and effectively.
- Maintain comprehensive and actionable runbooks for dealing with incidents and infrastructure outages.
- Create detailed post-mortems in case of significant outages.
- Implement automated alerting and incident response systems to identify and resolve issues quickly.
- Documentation: Create and maintain comprehensive documentation for cloud infrastructure and processes.
- Maintain documentation at a level required for a cloud infrastructure powering a SaMD (Software as a Medical Device) product.
- Instrumentation: Develop and maintain instrumentation infrastructure to ensure system health.
- Build instrumentation systems to provide timely system health checks and alerts using Prometheus and Grafana.
- Implement and maintain automated alerting and incident response systems for quick issue identification and resolution.
- DevOps Support: Automate routine tasks and processes to improve efficiency and reduce manual intervention.
- Implement and maintain CI/CD pipelines (Jenkins, CircleCI, or similar) to manage the deployment of Circadia’s services and products (backend services, Android, iOS, React apps).
- Write clean, testable code with a commitment to maintaining high coding standards through comprehensive testing (Jest, PyTest, JUnit, etc.).
Attributes:
- Need to Haves:
- C++ - Strong understanding of multithreading, asynchronous programming, network protocols
- Advanced knowledge of Python and related frameworks (FastAPI, NumPy, Pandas, Pydantic) including multithreading and parallel design principles.
- Understanding of AWS, including knowledge of Cognito, Pinpoint, IoT, MSK and other services.
- Deep understanding of user-centered design principles, design thinking methodologies, and usability best practices.
- Proficient in using and maintaining Docker containers.
- Strong understanding of RESTful API design principles and best practices.
- Experience with TDD and testing frameworks such as Boost.Test, Google Test, etc...
- Nice to Haves:
- Experience with Azure services for managing GPT pipelines and multi-cloud infrastructure.
- Familiarity with big data technologies such as Apache Spark, Kafka, and MSK for large-scale data processing.
- Experience with boost libraries (asio, beast).
- Advanced experience in cost optimization strategies for cloud infrastructure and database performance tuning.
- Technical acumen: Advanced knowledge of all AWS systems and services.
- Detail oriented: Responsible for mission-critical healthcare products and services.
- Communications and Trust: Phenomenal communication skills with the ability to maintain the highest levels of confidentiality on a consistent basis.
- Organisation and Getting Stuff Done: Juggling multiple projects and timelines. Prioritising. Keen eye for detail in all tasks and projects. Exceptional at making lists and maintaining organisation.
- Growth Mindset: Your ability to learn from mistakes, reflect on mistakes, and not make mistakes again. Being curious and asking questions and facing resilience in the face of setbacks.
Benefits
- Join an energetic, diverse team dedicated to working towards the challenge of improving and saving patient lives
- Private health insurance with Vitality Health for you and your family, including discounted gym memberships, wellness retreats, fitness devices, and lots more
- 28 days paid annual leave during each holiday year (including bank holidays)
- Fully financed learning and personal development courses to help you grow in your role
- Opportunity to attend conferences and acquire certifications, paid for by the company
- New laptop of your choice for you to work on either at home, our at Circadia’s London Bridge office
- Flexible / hybrid working to suit your personal circumstances and allow you to be productive wherever you are most comfortable working
- Participate in and help plan regular team events, lunches and dinners
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
