Rakuten

Senior Kubernetes/Container Engineer/Tech Lead

Tokyo, Japan
Kubernetes Linux Go Python Shell Ansible Jenkins Prometheus Grafana Kibana Datadog PagerDuty JIRA Slack MS Teams Viber Java C++ TCP/IP Elasticsearch
Description

Kubernetes/Container Development/Operation Senior Engineer/Tech Lead - Cloud Services Department (CLSD)

Location: Tokyo, Japan

Time Type: Full time

Job Description

Job Description:

Business Overview 
The Technology Platforms Division (TPD) drives the growth of Rakuten's ecosystem by delivering innovative, high-quality technology platforms characterized by integrated control and strategic partnerships.

  
Within TPD, the Cloud Platform Supervisory Department (CPSD) develops and manages Rakuten's state-of-the-art cloud platform, empowering global scalability and accelerating innovation across its diverse business units. 

Department Overview

The Cloud Services Department (CLSD) at Rakuten Group provides high-quality cloud infrastructure and platform services to application developers across Rakuten. Our mission is to enable secure, scalable, and efficient digital innovation.

We deliver key domain services, including compute, storage, core infrastructure components, databases, container platform, observability, and gateway solutions, empowering Rakuten application teams to focus on their core business objectives.

Position:

Why We Hire

The business of Rakuten Group, Inc. is rapidly growing and our private cloud is rapidly growing as well. To support such growth, many interesting and ambitious projects are on going. We’re searching for new members who can enjoy such interesting and ambitious projects. We’re also welcoming those who can propose new ideas which can further support Rakuten Group, Inc.'s growth, with internal/external technologies and flexible mind.

 

Position Details

We are seeking a highly skilled and motivated Infrastructure Engineer with a strong background in Kubernetes, container technologies, and Linux systems, coupled with proven software development capabilities. In this role, you will be instrumental in designing, developing, and operating our core infrastructure, ensuring high availability, performance, and security. The ideal candidate will thrive in a fast-paced environment, embrace a "Get Things Done" mindset, and contribute to a culture of operational excellence within a large enterprise setting.

Key Responsibilities

1) Operation

- Cluster/Node Provisioning

- Alert/Incident Handling (24/7 OnCall. Daily Rotation roughly 1d/week)

- OS/Middleware Update

- Security Requirement Achievement

- Midnight Release, Midnight Monitoring

- Operation Manual Creation

- Risk Analysis of Production Environment Operation

2) Development

- Design/Proposal Doc (Diagram, pros/cons comparison)

- Cluster/Node auto provisioning

- OS/Middelware auto upgrade, self-service upgrade

- Engineer Self-healing

3) User Support and Migration Support

- Support special cases which user support group cannot handle

- Support migration from the legacy platform to new private cloud

 

Work Environment

- 17 members

- Language: Go, Python, Groovy, Shell Script

- Infrastructure: Private Cloud (Kubernetes, Baremetal, VM, Container)

- Provisioning/Operation: Ansible, multiple inhouse tools written in Go and operator pattern (redhat operator framework, etc.), jenkins

- Monitoring: prometheus, cortex, grafana, kibana, Datadog, PagerDuty

- CI/CD: Jenkins

- Knowledge Tool: Confluence

- Project Management: JIRA

- Communication Tool: Slack, MS Teams, Viber 

 

Mandatory Qualifications:

- Certified Kubernetes Administrator (CKA) Holder (Note: If not currently held, successful candidates are required to obtain CKA certification within 3 months of joining) 

- Experience of designing and developing web services with a "statically typed" programming language (At least 2 from golang/java/C++ etc.)

- 3 years experience in operating large-scale, mission-critical systems (e.g., cloud platforms, banking, telecommunications).

- Experience as a Tech Lead

- Native Japanese language proficiency.

- Strong sense of responsibility to keep the stability of the system, and to output artifacts by deadline

- Get things done mind for projects to meet the deadline

- Experience of leading projects

- Deep understanding and experience of Kubernetes/container/Linux provisioning and trouble shooting

- Basic knowledge of networking, TCP/IP

- Basic knowledge of distributed system and HA structure

- Experience of large scale system operation (100+ servers)

- Those who can follow to the strict rules such as document creation and approval process which is mandatory for infrastructure  

Desired Qualifications:

- Participate in open source activities, OSS contributor

- Bachelor/Master's degree around computer science, engineering, or related fields

- Experience of automation of large scale system operation

- Experience of development of middle - large scale application

- Experience of multiple monitoring tool (prometheus, cortex, grafana, datadog, newrelic, elasticsearch, kibana, etc.)

- Private/public cloud experience

#engineer #infrastructureengineer #technologyplatformdiv 

Languages:

English (Overall - 3 - Advanced), Japanese (Overall - 4 - Fluent)
Rakuten
Rakuten

0 applies

0 views

There are more than 50,000 engineering jobs:

Subscribe to membership and unlock all jobs

Engineering Jobs

60,000+ jobs from 4,500+ well-funded companies

Updated Daily

New jobs are added every day as companies post them

Refined Search

Use filters like skill, location, etc to narrow results

Become a member

🥳🥳🥳 452 happy customers and counting...

Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.

To try it out

For active job seekers

For those who are passive looking

Cancel anytime

Frequently Asked Questions

  • We prioritize job seekers as our customers, unlike bigger job sites, by charging a small fee to provide them with curated access to the best companies and up-to-date jobs. This focus allows us to deliver a more personalized and effective job search experience.
  • We've got over 200,000 jobs from 15,000+ vetted companies. No fake or sleazy jobs here!
  • We aggregate jobs from 15,000+ companies' career pages, so you can be sure that you're getting the most up-to-date and relevant jobs.
  • We're the only job board *for* software engineers, *by* software engineers… in case you needed a reminder! We add thousands of new jobs daily and offer powerful search filters just for you. 🛠️
  • Every single hour! We add 2,000-3,000 new jobs daily, so you'll always have fresh opportunities. 🚀
  • Typically, job searches take 3-6 months. EchoJobs helps you spend more time applying and less time hunting. 🎯
  • Check daily! We're always updating with new jobs. Set up job alerts for even quicker access. 📅

What Fellow Engineers Say