Building the World's Most Powerful Video Understanding AI Infrastructure
Innovative video AI with a collaborative culture, but compensation lags the market. Choose Twelve Labs if you want to work on cutting-edge multimodal AI in a flat org — but weigh the below-average pay.
Twelve Labs is building foundation AI models that can accurately search, summarize, and understand video content at scale. Their Large Visual Language Models (VLMs) are trained on massive video datasets, and the platform is used by developers and enterprises worldwide. The company operates a hybrid model between San Francisco and Seoul, with a flat organizational structure. Reviews praise the passionate, collaborative team and cutting-edge technology, but flag below-average compensation and high attrition as concerns. The founding team has deep ML expertise.
Builds Large Visual Language Models (VLMs) for video understanding. The Twelve Labs API enables search, summarization, and classification across video content at scale.
Multimodal AI research with a focus on video foundation models. The team works on training large-scale VLMs on massive video datasets. Read the blog →
Flat, startup-style organization with ~100 employees across San Francisco and Seoul. Small, autonomous teams with high ownership and direct product impact.