HomeJobsTwelve Labs › Research Science

Senior ML Research Scientist, Pegasus

Twelve Labs Seoul, South Korea FullTime Research Science Posted 3w+ ago
Apply Now →

What it’s like to work at Twelve Labs

Video Understanding AI · San Francisco / Seoul

3.6
Employee Rating
3.4
Work-Life Balance
32
Open Roles
eng-drivenship-fastflatproduct-impact

What employees love

  • Unique video AI niche with strong product-market fit and cutting-edge research
  • Flat, collaborative org structure with real impact opportunities

What could be better

  • Compensation reported below market average with expensive benefits
  • High attrition rate and some concerns about leadership experience
View full Twelve Labs culture profile →

About the Role

Who we are

At TwelveLabs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media.

With a $110+ million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.

Our partnership with NVIDIA and AWS gives us access to the most advanced chips, including B300s, enabling us to push the boundaries of what's possible in video AI.

We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.

About Pegasus

Pegasus is TwelveLabs' core video understanding product, turning video into useful analysis by reasoning over visuals, speech, audio, and on-screen text. The team is not building a generic Video LLM in isolation; we build customer-facing video intelligence workflows that require temporal understanding, structured outputs, and production-grade reliability.

A key example is Segment, our time-based metadata capability. Instead of asking the model a broad question about a video, customers define the exact segment types they care about and the metadata fields they want back. Pegasus then finds the relevant start and end times and returns structured metadata for each segment, such as titles, summaries, topics, people, visual subjects, confidence, or domain-specific labels. This is designed for workflows where “what happened” is not enough; customers need to know when it happened and receive metadata that can flow directly into search, archive, editing, compliance, or content management systems.

For example, a news archive customer can define a segment type like editorial_narratives and ask Pegasus to split a long broadcast into individual stories. For each story, Pegasus can return a timestamped segment with fields such as segment_title, description, editorial_subjects, visual_subjects, names, and confidence. The output is not just a summary of the full video; it is a structured timeline of the video, aligned to the customer's schema.

This is the distinction that matters for Pegasus: general video analysis answers questions about video, while Segment turns video into time-based, structured data tailored to a specific business workflow.

Learn more about Pegasus!

 

About the Team

The Pegasus team sits at the core of TwelveLabs' video understanding capabilities and is responsible for driving Pegasus, our Video Analysis product. Our focus is on developing multimodal video analysis systems that are designed for high instruction following capability and producing highly complex, hierarchically structured outputs. We focus on shipping products with real-world value rather than doing research in isolation, and we work in a goal-oriented, cross-functional team that encompasses both ML researchers and engineers.

Our work covers a broad range of challenges: large-scale distributed training of multi-modal LLMs that span from pre-training to RL, accurate temporal segmentation and structured metadata extraction for real-world use cases, extending temporal context length to multiple hours, and data curation processes that enable well-aligned evaluation and performance improvements through training data enhancements.

Our team has access to the most advanced chips in the world, including NVIDIA B300s, to push the boundaries of video analysis systems—accelerating our research-to-production cycle as fast as possible.

In this role, you will

Even if you don't check every box, we encourage you to apply.

If you're a zero-to-one achiever, a ferocious learner, and a kind team player who motivates others, you'll find a home at TwelveLabs.

You may be a good fit if you have

Preferred qualifications

Others

Hiring Process

Application Review → Recruiter Interview (비대면/30분) → Loop Interview [Hiring Manager Interview&Live Coding Test Interview] (대면/약 90분) → System Design Interview(대면/약 90분) → Final Round Interview (비대면/약 30분) → Reference Check → Offer

Benefits and Perks

Similar Roles

More at Twelve Labs
Staff ML Research Engineer, Marengo
Seoul, South Korea
ML Research Engineer, Video Cognition System
Seoul, South Korea
Senior ML Research Engineer, Marengo
Seoul, South Korea
Staff ML Research Scientist, Pegasus
Seoul, South Korea
ML Research Scientist, Pegasus
Seoul, South Korea
Similar roles at other companies
Senior Data Scientist - Product
Apollo.io · Remote, United States
Senior Data Scientist | Growth
Ramp · New York, NY (HQ)
Senior Data Scientist, Guest & Host
Airbnb · Remote
Senior Data Scientist, Forecasting (Integrated Planning)
CoreWeave · New York, NY / Sunnyvale, CA / Bellevue, WA
Senior Data Scientist - Data Foundations & AI
Plaid · San Francisco HQ

Frequently Asked Questions

What is the work-life balance like at Twelve Labs?
Twelve Labs has a work-life balance score of 3.4/5 based on employee reviews. This is below average, which may indicate a fast-paced, demanding work environment.
What is Twelve Labs’s culture like?
Twelve Labs is characterized by these culture values: eng-driven, ship-fast, flat, product-impact. Based on employee reviews, the company has an overall rating of 3.6/5. Unique video AI niche with strong product-market fit and cutting-edge research
How many open roles does Twelve Labs have?
Twelve Labs currently has 32 open roles across departments including engineering, product, sales, and more. Roles are refreshed daily from their careers page.
Is this role remote-friendly?
This role is located in Seoul, South Korea. Check the job description above for specific location and remote work details.
Apply for this role at Twelve Labs →