HomeJobsLambda › Data Center Business

Senior Incident Manager

Lambda Remote, USA FullTime Data Center Business Posted 3w+ ago
Apply Now →

What it’s like to work at Lambda

GPU Cloud for AI · San Jose

3.6
Employee Rating
3.7
Work-Life Balance
38
Open Roles
eng-drivenship-fastequityproduct-impact

What employees love

  • Building the GPU cloud that powers the world’s top AI labs — genuinely impactful infrastructure
  • Smart engineering team with strong technical culture and good WLB on most teams

What could be better

  • Management style described as top-down; strategy clarity could be better
  • Internal politics and coordination overhead slow things down for some teams
View full Lambda culture profile →

About the Role

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU.

If you'd like to build the world's best AI cloud, join us.

We are seeking a Senior Incident Manager to lead critical incident response across our AI data center infrastructure. This role is responsible for coordinating rapid resolution of service-impacting events, improving operational resilience, and driving incident management best practices across infrastructure, networking, platform engineering, and data center operations.

 

Role Overview

The Senior Incident Manager is responsible for leading the end-to-end lifecycle of operational incidents impacting AI infrastructure and data center services. This individual acts as the central command point during major incidents, ensuring rapid triage, cross-team coordination, effective communication, and structured post-incident analysis.

This role requires deep operational expertise in high-availability infrastructure, large-scale GPU clusters, networking, and cloud platforms, along with strong leadership and communication skills.

 

What You’ll Do

Incident Leadership

Incident Management Operations

Cross-Functional Coordination

Post-Incident Analysis & Continuous Improvement

Operational Excellence

Communication & Reporting

You

Nice to Have

Key Competencies

What Success Looks Like in This Role

Salary Range Information

The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Similar Roles

More at Lambda
Data Center Technical Program Manager
San Jose Office (Zanker)
Senior Site Reliability Engineer - Observability
San Francisco Office (Fremont St)
Engineering Manager - Control Plane
San Francisco Office (Fremont St)
Software Engineer - Fleet
San Francisco Office (Fremont St)
Network Engineer
San Francisco Office (Fremont St)

Frequently Asked Questions

What is the work-life balance like at Lambda?
Lambda has a work-life balance score of 3.7/5 based on employee reviews. This is about average for the AI/tech industry.
What is Lambda’s culture like?
Lambda is characterized by these culture values: eng-driven, ship-fast, equity, product-impact. Based on employee reviews, the company has an overall rating of 3.6/5. Building the GPU cloud that powers the world’s top AI labs — genuinely impactful infrastructure
How many open roles does Lambda have?
Lambda currently has 38 open roles across departments including engineering, product, sales, and more. Roles are refreshed daily from their careers page.
Is this role remote-friendly?
This role is located in Remote, USA. Check the job description above for specific location and remote work details.
Apply for this role at Lambda →