Infrastructure Stability Architect
Location: Hong Kong, Hong Kong SAR
Department: Engineering
Who We Are
We are deeply committed to shaping a fairer, more transparent and accessible society through blockchain technology. This is why we publish proof of reserves monthly, and continue to ship new innovative security features.
About the Opportunity
What You’ll Be Doing
- Design and lead the stability architecture for large-scale distributed systems, including big data platforms, data warehouses, and core middleware infrastructure.
- Develop and optimize comprehensive stability strategies covering capacity planning, performance optimization, fault prevention, and disaster recovery.
- Spearhead chaos engineering practices, design complex fault injection scenarios to validate system resilience and self-healing capabilities.
- Build and refine comprehensive monitoring and alerting systems for rapid fault detection, localization and recovery,.
- Lead root cause analysis for major incidents and formulate long-term improvement plans to continuously enhance system availability and reliability.
- Drive infrastructure intelligence and automation, designing and implementing AIOps solutions.
- Collaborating closely with product, development, and operations teams to integrate stability requirements throughout the product lifecycle.
- Lead the development of stability-related technical standards and best practices, promoting their adoption across the organization.
What We Look For In You
- Bachelor degree or above in Computer Science or related major, with more than 10 years of architecture design experience in large-scale internet or computing platforms.
- Expert knowledge of distributed system architectures, with deep understanding and rich practical experience in big data, cloud-native, and micro-service technologies.
- In-depth understanding of various infrastructure components (e.g. Kubernetes, Kafka, Database) and ability to perform advanced tuning.
- Strong systems thinking capability, able to analyze and solve complex stability issues from a holistic perspective.
- Extensive experience in handling large-scale system failures, with the ability to quickly locate and resolve challenging problems.
- Mastery of Linux systems and network technologies, familiarity with mainstream cloud platforms e.g. Alibaba Cloud, AES) architecture and services.
- Excellent technical leadership skills, able to guide teams and drive cross-department collaboration.
- Proficiency in speaking, reading and writing in both English and Mandarin to collaborate effectively with global and cross-functional team members.
- Passion for continuous learning, able to quickly grasp new technologies and apply them in practical work scenarios.
Perks & Benefits
-
Competitive total compensation package
-
L&D programs and Education subsidy for employees' growth and development
-
Various team building programs and company events
-
Wellness and meal allowances
-
Comprehensive healthcare schemes for employees and dependants
-
More that we love to tell you along the process!
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
