About Sakana AI
Sakana AI is a Tokyo-based AI research lab founded by former Google Brain researchers, pioneering nature-inspired approaches to artificial intelligence.
About the Role
As a Software Engineer on the Platform team, you will build and maintain the foundational infrastructure that powers all of Sakana AI's products and research. From compute orchestration to CI/CD pipelines, you'll ensure the team can develop and deploy AI systems efficiently and reliably.
Responsibilities
- Design and operate scalable cloud infrastructure for model training and serving
- Build and maintain CI/CD pipelines, deployment tooling, and developer experience systems
- Manage GPU clusters and optimize resource allocation for training workloads
- Implement monitoring, logging, and alerting across the entire platform
- Ensure infrastructure security, cost optimization, and operational reliability
- Support internal teams with tooling and platform capabilities
Qualifications
- 3+ years of platform or infrastructure engineering experience
- Deep experience with cloud platforms (AWS, GCP) and Kubernetes
- Proficiency in infrastructure-as-code (Terraform, Pulumi) and configuration management
- Experience managing GPU clusters and ML infrastructure at scale
- Strong Linux systems knowledge and networking fundamentals
- English proficiency required; Japanese language skills are a plus