Kubernetes/Container Development/Operation Senior Engineer/Tech Lead - Cloud Services Department (CLSD)
Location: Tokyo, Japan
Time Type: Full time
Job Description
Job Description:
Business Overview
The Technology Platforms Division (TPD) drives the growth of Rakuten's ecosystem by delivering innovative, high-quality technology platforms characterized by integrated control and strategic partnerships.
Within TPD, the Cloud Platform Supervisory Department (CPSD) develops and manages Rakuten's state-of-the-art cloud platform, empowering global scalability and accelerating innovation across its diverse business units.
Department Overview
The Cloud Services Department (CLSD) at Rakuten Group provides high-quality cloud infrastructure and platform services to application developers across Rakuten. Our mission is to enable secure, scalable, and efficient digital innovation.
We deliver key domain services, including compute, storage, core infrastructure components, databases, container platform, observability, and gateway solutions, empowering Rakuten application teams to focus on their core business objectives.
Position:
Why We Hire
The business of Rakuten Group, Inc. is rapidly growing and our private cloud is rapidly growing as well. To support such growth, many interesting and ambitious projects are on going. We’re searching for new members who can enjoy such interesting and ambitious projects. We’re also welcoming those who can propose new ideas which can further support Rakuten Group, Inc.'s growth, with internal/external technologies and flexible mind.
Position Details
We are seeking a highly skilled and motivated Infrastructure Engineer with a strong background in Kubernetes, container technologies, and Linux systems, coupled with proven software development capabilities. In this role, you will be instrumental in designing, developing, and operating our core infrastructure, ensuring high availability, performance, and security. The ideal candidate will thrive in a fast-paced environment, embrace a "Get Things Done" mindset, and contribute to a culture of operational excellence within a large enterprise setting.
Key Responsibilities
1) Operation
- Cluster/Node Provisioning
- Alert/Incident Handling (24/7 OnCall. Daily Rotation roughly 1d/week)
- OS/Middleware Update
- Security Requirement Achievement
- Midnight Release, Midnight Monitoring
- Operation Manual Creation
- Risk Analysis of Production Environment Operation
2) Development
- Design/Proposal Doc (Diagram, pros/cons comparison)
- Cluster/Node auto provisioning
- OS/Middelware auto upgrade, self-service upgrade
- Engineer Self-healing
3) User Support and Migration Support
- Support special cases which user support group cannot handle
- Support migration from the legacy platform to new private cloud
Work Environment
- 17 members
- Language: Go, Python, Groovy, Shell Script
- Infrastructure: Private Cloud (Kubernetes, Baremetal, VM, Container)
- Provisioning/Operation: Ansible, multiple inhouse tools written in Go and operator pattern (redhat operator framework, etc.), jenkins
- Monitoring: prometheus, cortex, grafana, kibana, Datadog, PagerDuty
- CI/CD: Jenkins
- Knowledge Tool: Confluence
- Project Management: JIRA
- Communication Tool: Slack, MS Teams, Viber
Mandatory Qualifications:
- Certified Kubernetes Administrator (CKA) Holder (Note: If not currently held, successful candidates are required to obtain CKA certification within 3 months of joining)
- Experience of designing and developing web services with a "statically typed" programming language (At least 2 from golang/java/C++ etc.)
- 3 years experience in operating large-scale, mission-critical systems (e.g., cloud platforms, banking, telecommunications).
- Experience as a Tech Lead
- Native Japanese language proficiency.
- Strong sense of responsibility to keep the stability of the system, and to output artifacts by deadline
- Get things done mind for projects to meet the deadline
- Experience of leading projects
- Deep understanding and experience of Kubernetes/container/Linux provisioning and trouble shooting
- Basic knowledge of networking, TCP/IP
- Basic knowledge of distributed system and HA structure
- Experience of large scale system operation (100+ servers)
- Those who can follow to the strict rules such as document creation and approval process which is mandatory for infrastructure
Desired Qualifications:
- Participate in open source activities, OSS contributor
- Bachelor/Master's degree around computer science, engineering, or related fields
- Experience of automation of large scale system operation
- Experience of development of middle - large scale application
- Experience of multiple monitoring tool (prometheus, cortex, grafana, datadog, newrelic, elasticsearch, kibana, etc.)
- Private/public cloud experience
#engineer #infrastructureengineer #technologyplatformdiv
Languages:
English (Overall - 3 - Advanced), Japanese (Overall - 4 - Fluent)There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
60,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 452 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
To try it out
For active job seekers
For those who are passive looking
Cancel anytime
Frequently Asked Questions
- We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
- We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
- We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
- We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
- Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
- Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
- Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅
What Fellow Engineers Say
