Working at Fireworks AI in 2026: The Ex-Meta Team Powering LLM Inference

Q: Who founded Fireworks AI?

Fireworks AI was founded in 2022 by Lin Qiao, former Head of PyTorch at Meta, along with co-founders from Meta's AI infrastructure team including Benny Chen, Chenyu Zhao, Dmytro Dzhulgakov, Dmytro Ivchenko, and James Reed.

Q: What companies use Fireworks AI?

Fireworks AI powers inference for companies including Cursor, Perplexity, Notion, Sourcegraph, Uber, DoorDash, Shopify, and Upwork. The platform serves over 10,000 enterprise customers.

Q: What is Fireworks AI's Glassdoor rating?

Fireworks AI has a Glassdoor rating of approximately 3.6 out of 5.0, based on a small number of reviews (~8). Employees highlight world-class technical peers and cutting-edge work on LLM inference, while noting startup intensity, a 2.9/5 work-life balance score, and limited structure.

Most people who use Cursor or Perplexity don't think about what makes the autocomplete feel instant, or why search answers appear before they finish reading the question. Behind both products — and behind Notion's AI features, Uber's internal ML, and DoorDash's recommendation engine — sits Fireworks AI, a company that has become the invisible infrastructure layer powering much of the AI industry's real-time intelligence.

Founded in 2022 by Lin Qiao, former Head of PyTorch at Meta, Fireworks AI has grown from a six-person team of ex-Meta AI infrastructure engineers into a $4 billion company with 189 employees, $315M in annualized revenue, and over 10,000 enterprise customers. Their core bet: that the real bottleneck in AI isn't training models — it's serving them fast enough for production workloads that demand sub-second latency at massive scale.

That bet is paying off. But what does it actually feel like to work there? Here's what we found.

Fireworks AI at a Glance

Founded	2022
Headquarters	Redwood City, CA
CEO	Lin Qiao (ex-Head of PyTorch, Meta)
Company Size	~189 employees
Valuation	$4B (Series C, Oct 2025)
Total Funding	$327M+
ARR	~$315M (Feb 2026); reached ~$800M by May 2026
Glassdoor Rating	3.6 / 5.0 (limited reviews ~8)
Work-Life Balance	2.9 / 5.0
Culture Values	Eng-Driven, Ship Fast, Many Hats, Learning

$4B

Valuation

416%

YoY Revenue Growth

10K+

Enterprise Customers

The numbers tell a story of explosive growth. Fireworks AI hit $315M in annualized revenue in February 2026, up 416% year-over-year. At just 189 employees, that translates to roughly $1.67M in revenue per employee — a figure that rivals the best enterprise software companies in the world. This isn't a company burning through capital chasing growth. It's a company where the product sells because the latency numbers are simply better than anything else available.

The Founding Team: Why It Matters

Understanding Fireworks AI's culture requires understanding where its founders came from. This isn't a team of product managers who decided to build an AI wrapper. These are the people who built PyTorch itself.

Lin Qiao (CEO) — Former Senior Director of Engineering at Meta, where she led 300+ engineers and oversaw PyTorch, Caffe2, and Meta's core AI training infrastructure. Ph.D. in Computer Science from UC Santa Barbara.
Dmytro Dzhulgakov — Former core maintainer of PyTorch at Meta. Knows the framework's internals better than almost anyone alive.
James Reed — Former PyTorch compiler lead at Meta. Built the optimization layers that make model execution fast.
Benny Chen — Former Meta ads infrastructure lead. Brings experience in systems that serve billions of requests per day.
Chenyu Zhao — Former Google Vertex AI lead. Understands enterprise ML serving from the cloud provider side.

This pedigree matters for two reasons. First, it means the founding team has already solved inference problems at the largest possible scale — PyTorch powers the majority of the world's AI research. Second, it shapes the culture. When the CEO wrote the framework your models run on, engineering credibility is built into the DNA of the company. Decisions are made by the people closest to the metal, not by MBA-holders with pitch decks.

What the Culture Actually Feels Like

Fireworks AI's culture is defined by a small set of characteristics that emerge directly from its founding team and stage: extreme technical depth, high autonomy, rapid iteration, and the kind of intensity that comes with being a 189-person company doing $315M in revenue with customers who can't afford downtime.

Engineering-driven to the core

At most companies, "engineering-driven" is an aspirational label. At Fireworks AI, it's structural. The company was founded by infrastructure engineers, and the product is literally a performance-optimization platform. The technical problems — custom CUDA kernels, speculative decoding, GPU memory management, multi-model routing — are the kind that attract engineers who want to work close to hardware. You're not building CRUD apps. You're squeezing microseconds out of GPU inference pipelines while maintaining correctness at scale.

Employee Pro "World-class team of ex-Meta AI infrastructure engineers — the caliber of technical peers here is unmatched"

The team's technical credibility means engineering decisions carry weight. There's no "product said we need this, so build it regardless of whether it makes sense" dynamic. When customers like Cursor need speculative decoding that outperforms GPT-4 in both speed and accuracy, the engineers who build it are also the ones who decide how to architect it. That level of ownership is rare, especially at a company growing this fast.

Many hats, by necessity and design

At 189 employees serving 10,000+ enterprise customers, there's no room for narrow specialization. A systems engineer might find themselves debugging a CUDA kernel in the morning, writing customer-facing documentation in the afternoon, and jumping on a call with an enterprise customer's ML team before end of day. Fireworks AI explicitly embraces this many-hats culture — it's not understaffing disguised as culture, it's the genuine cross-functional work that comes with building a technically complex product at a small company with massive traction.

Employee Pro "Impact-driven culture with lack of bureaucracy — visibility into different functions and ability to shape the company"

For engineers who thrive on breadth and hate being boxed into a narrow lane, this is compelling. For those who want deep specialization and clear role boundaries, it will feel chaotic.

The intensity trade-off

The 2.9/5 work-life balance score doesn't lie. Fireworks AI is a startup serving production AI workloads for some of the most demanding companies in tech. When Cursor's Fast Apply feature depends on your inference platform, downtime isn't theoretical — it means millions of developers can't code. When Uber's ML pipeline depends on your latency guarantees, "we'll fix it Monday" isn't an option.

Employee Con "Early-stage startup intensity with long hours expected and a high performance bar"

Employee Con "Small team and rapidly evolving product means less structure, process, and career laddering"

The candid feedback from employees is consistent: the work is intellectually thrilling but physically demanding. The lack of formal career laddering is a real gap for engineers who need clear promotion criteria. And the pace — while exciting for some — can grind down people who need predictable schedules. If you're optimizing for work-life balance, look at companies like Notion (4.2 WLB), Linear (4.4 WLB), or HubSpot (4.1 WLB) instead.

What You'd Actually Work On

Fireworks AI's product is a cloud-based platform for running, fine-tuning, and deploying open-source large language, vision, and multimodal models. The core technical challenge is making inference absurdly fast — and they're winning. Their FireAttention custom CUDA kernels consistently benchmark as the highest-throughput inference engine in the GPU-based tier.

Technical stack

CUDA C++ Python PyTorch Kubernetes GPU Clusters Triton

The work spans several domains that rarely coexist in one company:

Kernel engineering. Writing custom CUDA kernels that optimize attention mechanisms, memory access patterns, and batch scheduling. This is the lowest-level work — the kind that requires thinking in terms of GPU warps, shared memory banks, and cache line alignment.
Inference serving. Building the distributed serving infrastructure that routes requests across GPU clusters, handles model loading/unloading, and maintains latency guarantees under variable load. Think of it as building a CDN, but for AI model predictions instead of static assets.
Fine-tuning platform. Managing the full pipeline from supervised fine-tuning (SFT) through Direct Preference Optimization (DPO) and reinforcement fine-tuning. Fireworks supports 400+ models and offers on-demand dedicated GPU deployments up to NVIDIA B300s.
Speculative decoding. The technique that powers Cursor's Fast Apply — predicting likely next tokens in parallel to speed up generation. Getting this right requires deep understanding of both model architectures and systems optimization.
Enterprise reliability. SOC 2 Type II, HIPAA, GDPR, ISO certifications. When DoorDash and Uber are your customers, compliance isn't optional. Engineering these guarantees into a fast-moving system is its own challenge.

The real-world impact is tangible. Notion partnered with Fireworks to fine-tune models and reduced their AI feature latency from 2 seconds to 350 milliseconds — a 5.7x improvement that users feel immediately. Cursor's Fast Apply, powered by Fireworks' speculative decoding, outperforms GPT-4 in both speed and accuracy for code edits. These aren't theoretical benchmarks. They're production numbers from products used by millions.

Compensation

Compensation at Fireworks AI reflects its stage and the specialized talent it recruits. Based on employee-reported data, total compensation ranges vary significantly by level and location:

Software Engineer (mid-level): $175K–$220K base, with equity bringing total comp to $235K–$300K
Senior Engineer (Bay Area): Median total comp around $248K, with top-end packages reaching $412K+
Equity: At a $4B valuation with $315M ARR and 416% growth, early equity could be meaningful — especially if the company reaches the $15B valuation some reports suggest they're targeting

These numbers are competitive for a Series C startup but sit below the top end of frontier AI labs. Anthropic pays $300K–$490K for senior engineers, and OpenAI ranges from $350K–$550K. But Fireworks offers something those companies don't: the equity upside of a high-growth company that's still early enough for meaningful ownership. If Fireworks reaches even half the $15B valuation that recent reports suggest, current equity grants could 3–4x. See how Fireworks stacks up in our highest-paying AI companies ranking.

Who Thrives at Fireworks AI

Based on the culture signals, product focus, and employee feedback, Fireworks AI is strongest for a specific type of engineer:

Systems-level thinkers. If you care about GPU memory hierarchies, kernel optimization, and distributed systems performance, this is one of the few companies where that expertise directly drives revenue. You're not building internal tools that might never ship — you're building the product.
Builders who want breadth. The many-hats culture means you'll work across the stack, interact with customers, and shape product decisions. If you hate being boxed into a single domain, Fireworks gives you room to roam.
People who find energy in intensity. The pace is genuinely fast, the customers are demanding, and the stakes are high. If that energizes you rather than draining you, this is a remarkable place to be right now. The revenue growth (416% YoY) means the company is winning, not just surviving.
Engineers who want to work with the best. The founding team's PyTorch pedigree attracts similarly caliber hires. If you want to be the least experienced person in the room and learn constantly, the talent density here is exceptional.

Fireworks AI is not ideal for engineers who want clear career ladders, predictable schedules, or a fully remote work setup. The 2.9 WLB score reflects genuine intensity, and at 189 employees, formal HR processes like promotion frameworks are still being built. Compare this with a more structured environment at Databricks (7,000 employees, established career tracks) or Stripe (8,000 employees, world-class internal processes). If you need structure, those companies offer what Fireworks doesn't — yet.

How Fireworks AI Compares

Fireworks AI occupies a unique niche in the AI landscape. Here's how it stacks up against companies solving adjacent problems:

Together AI

Competitor

Together AI also offers open-source model inference and fine-tuning, but emphasizes training capabilities and research partnerships more heavily. Fireworks is more focused on production inference speed — if you care about kernel-level optimization, Fireworks goes deeper. If you want to work on training large models, Together might be a better fit.

View Together AI Profile →

Modal

Adjacent

Modal provides serverless GPU compute for AI workloads but operates at a different layer of the stack — it's more about compute orchestration than inference optimization. Fireworks' custom CUDA kernels go much deeper into the serving layer. Both companies share an engineering-driven culture and small-team intensity.

View Modal Profile →

Baseten

Adjacent

Baseten offers model inference infrastructure with a focus on developer experience and deployment simplicity. Fireworks differentiates on raw performance — their FireAttention kernels are purpose-built for throughput. Baseten's approach is more "make inference easy," while Fireworks' is "make inference impossibly fast."

View Baseten Profile →

Open Positions at Fireworks AI

Fireworks AI currently has 36 open positions on our platform, spanning systems engineering, ML infrastructure, and go-to-market roles. The company has announced plans to hire 150+ new AI researchers and engineers following their Series C, so expect the pace of hiring to accelerate. For a company generating $315M ARR with 189 employees, each new hire carries significant weight — and significant opportunity for impact.

Frequently Asked Questions

How many employees does Fireworks AI have in 2026?+

Fireworks AI has approximately 189 employees as of March 2026. The company was founded in 2022 and has grown rapidly — they plan to hire 150+ new AI researchers and engineers following their $250M Series C. For context across AI companies, see our employee count rankings.

What is Fireworks AI's valuation?+

Fireworks AI was valued at $4 billion following its $250M Series C round in October 2025. The round was co-led by Lightspeed Venture Partners, Index Ventures, and Evantic, with continued backing from Sequoia Capital. Reports suggest the company may be targeting a $15B valuation in its next round, reflecting 416% year-over-year revenue growth.

Who founded Fireworks AI?+

Fireworks AI was co-founded in 2022 by Lin Qiao (CEO), who was formerly Head of PyTorch and Senior Director of Engineering at Meta, leading 300+ engineers. Co-founders include Dmytro Dzhulgakov (PyTorch core maintainer), James Reed (PyTorch compiler lead), Benny Chen (Meta ads infra lead), and Chenyu Zhao (Google Vertex AI lead).

What companies use Fireworks AI?+

Fireworks AI powers production inference for Cursor (Fast Apply feature), Perplexity, Notion, Sourcegraph, Uber, DoorDash, Shopify, and Upwork, among 10,000+ enterprise customers. Use cases span code assistance, conversational AI, enterprise search, and agentic workflows.

What is the salary at Fireworks AI?+

Software engineer total compensation at Fireworks AI ranges from approximately $175K to $412K depending on level and location. The median total comp is around $235K, with Bay Area roles averaging $248K. These figures include base salary, equity, and bonuses. See our highest-paying AI companies for comparisons.

What is Fireworks AI's Glassdoor rating?+

Fireworks AI has a Glassdoor rating of 3.6 out of 5.0, based on a limited number of reviews (~8). Work-life balance is rated 2.9/5, reflecting startup intensity. Employees highlight the exceptional technical caliber of peers and cutting-edge inference work, while noting the demanding pace and lack of formal career structure. See our Fireworks AI culture profile for the full breakdown.

Explore Fireworks AI Jobs

See all 36 open roles at Fireworks AI alongside jobs from 118 companies — all with culture context.

View Fireworks AI Profile → Browse Fireworks AI Jobs →