The AI video leaderboard reshuffled completely in early 2026. Sora shut down on March 24th (app closing April 26, API ending September 24). Seedance 2.0 rolled out in March. Google made Veo 3.1 free. And Wan 2.7 dropped as the new open-source leader. Here's where everything actually stands right now (updated April 2026).
The Rankings (Artificial Analysis Arena, March 2026)
Seedance 2.0 from ByteDance tops the leaderboard with an Elo of 1,269. Kling 3.0 from Kuaishou sits at 1,248. Runway Gen-4.5 is close behind at 1,247. Below them, Veo 3.1 and Grok Imagine round out the top five. These are the models that matter.
Seedance 2.0 (ByteDance)
The highest-ranked model, period. Seedance 2.0 is a unified multimodal architecture that accepts text, image, audio, and video inputs. It outputs native 2K resolution and runs 30% faster than version 1.5.
The catch: it's only available in select markets through CapCut - Thailand, Indonesia, Philippines, Vietnam, Malaysia, Brazil, and Mexico. No US or European access yet. If you can get credits, the output quality is unmatched.
ByteDance used Seedance for the 2026 CCTV Spring Festival Gala - the first major public production powered by AI video.
Kling 3.0 (Kuaishou)
Released February 5, 2026. The most practical choice for studios worldwide because it has global API access at $0.075 per second (roughly $4.50 per minute).
What's new in 3.0: native 4K at 60fps, synchronized audio generation, Motion Control (extract motion from reference videos and apply to static images), professional camera controls (pan, tilt, zoom, dolly, rack focus), 16-bit HDR support, and a 7-in-1 multi-modal editor.
This is what most professional studios are using as their primary generation tool right now.
Google Veo 3.1
The first mainstream model with true 4K output at 3840x2160 and 60fps. Veo 3.1 also generates native audio - conversations, sound effects, and music. Available through the Gemini API, Google AI Studio, Vertex AI, and now Google Vids.
As of April 2, 2026, Veo 3.1 is free for all Google account holders (10 generations per month) via Google Vids. A new Veo 3.1 Lite variant launched in April - 50% cheaper than Veo 3.1 Fast at the same speed, making it the most cost-effective entry point for AI video. Veo 3.1 Upscaling is also live as a standalone tool for 1080p/4K enhancement.
Combined with Google's Nano Banana 2 for image generation, Uni-1 from Luma as a competitor, and Gemini for text understanding, the image-to-video pipeline options have never been stronger.
Grok Imagine 1.0 (xAI)
Launched February 2026. Generated 1.245 billion videos in its first 30 days. Grok supports multi-image input (up to 7 reference images), video extension up to 30 seconds, and video story creation (announced March 25, 2026).
Currently outputs at 720p - lower resolution than competitors, but the speed and accessibility through the X platform make it valuable for rapid prototyping.
Runway Gen-4.5
Elo 1,247 - essentially tied with Kling 3.0. Runway excels at physics simulation, temporal consistency, and precise controllability. The company raised $315M in February 2026 at a $5.3B valuation and is pushing into world models and robotics.
For pure cinematic quality, Runway remains the Western standard. Their Gen-4.5 handles text-to-video and image-to-video in 2-10 second clips with superior physics and motion.
Hailuo 2.3 (MiniMax)
Enhanced dynamic expressions, improved physics, better character movement fluidity, and new stylization options (anime, illustration, ink wash, game CG). Near-photorealistic lighting with natural shadow and color transitions.
Sora - The Shutdown
OpenAI shut down the Sora app on March 24, 2026, just six months after launch. The reason: $15M daily inference costs against $2.1M in lifetime revenue. Downloads had dropped 75% from their November peak. The Disney deal ($1B planned investment) was cancelled.
The web app closes permanently April 26, 2026. The API follows September 24, 2026. A successor codenamed "Spud" is in development - focused on world models for enterprise, targeting July 2026.
What Studios Actually Use
The studios listed on StudioList don't rely on a single model. The typical professional stack in March 2026:
Primary generation: Kling 3.0 (global access, 4K, audio) or Seedance 2.0 (if you have access)
Image generation: Midjourney V8 Alpha and Nano Banana 2
Image refinement: Photoshop with Generative Fill
Open source workflows: ComfyUI with Wan 2.7 or LTX-2.3
Post-production: DaVinci Resolve or Premiere Pro
Cost Comparison Table (April 2026)
| Model | Per Second | Max Resolution | Max Length | Audio | Entry Plan | Best For |
|---|---|---|---|---|---|---|
| Veo 3.1 (Free) | Free | 1080p | 8s | Yes | Free (10 gens/mo) | Testing, first-time users |
| Veo 3.1 Lite | ~$0.08 | 720p | 8s | Yes | Included with AI Plus | Budget social content |
| Seedance 2.0 | ~$0.14 | 1080p | 15s | Yes | Via Dreamina/CapCut | Highest quality output |
| Kling 3.0 | $0.08-0.17 | 4K @ 60fps | 15s | Yes | $6.99/mo (Standard) | Best all-rounder, global access |
| Runway Gen-4.5 | ~$0.12/credit | 2K | 10s | No | $12/mo (annual) | Physics, consistency |
| Veo 3.1 Fast | $0.15 | 1080p | 8s | Yes | $8/mo (AI Plus) | Quick drafts with audio |
| Veo 3.1 Standard | $0.40 | 4K | 8s | Yes | $22/mo (AI Pro) | Production with audio |
| Grok Imagine | ~$0.05 | 720p | 30s | Yes | $8-16/mo (X Premium) | Rapid prototyping, long clips |
| Hailuo 2.3 | ~$0.25-0.40 | 1080p | 6s | No | $15/mo (1,000 credits) | Stylization, character motion |
| Luma Ray 3.14 | ~$0.15-0.30 | 1080p | 10s | No | $10/mo | Creative experimentation |
| Midjourney V8 | N/A (images) | 2K native | N/A | N/A | $10/mo | Concept art, storyboarding |
New since March: Veo 3.1 is now free (10 gens/month) for all Google accounts. Veo 3.1 Lite launched at 50% less cost. Kling 3.0 plans start at $6.99/mo with a free tier (66 daily credits). Runway annual pricing starts at $12/mo.
Important notes: Per-second costs are approximate and vary by mode, resolution, and whether audio is enabled. Most platforms use credit systems where the effective cost depends on your plan tier. Iterating on a single usable clip typically requires 5-20+ generations, so multiply these costs accordingly for real project budgets.
The gap between models is narrowing. The gap between studios that know how to use them and those that don't is widening.
How to pick the right model for your shot
The most common mistake teams make is picking a single model and using it for everything. Each model has a distinct strengths profile and you should be matching shots to models, not the other way around. Here is the decision tree that professional studios use as of April 2026.
Is this a hero shot where quality is the only thing that matters, and you have market access via CapCut? Use Seedance 2.0. It produces the highest quality output on the Arena leaderboard and its multimodal architecture handles text, image, audio, and video inputs in a unified pipeline.
Is this a shot with precise physical interactions - a hand grabbing an object, a ball bouncing, cloth draping? Use Runway Gen-4.5. Runway's physics simulation is still ahead of the pack, and the Motion Brush lets you paint exactly which parts of the frame should move and how.
Is this a shot that needs camera control - a specific push-in, tilt, or rack focus? Use Kling 3.0. Kling's professional camera controls (pan, tilt, zoom, dolly, rack focus) are the most director-friendly controls in any model, and the global API makes it practical for production pipelines.
Is this a shot that needs synchronized audio generated alongside the video? Use Kling 3.0 or Veo 3.1. Both generate native audio (dialogue, ambient, music) that is locked to the video timeline. Veo 3.1 is now free for 10 monthly generations via Google Vids if you are just testing.
Is this a shot that needs to extend an existing clip or continue a sequence? Use Grok Imagine. Its 30-second maximum length and video extension feature are ahead of most competitors for longer sequences.
Is this a stylised or illustrative shot - anime, ink wash, game CG, or highly stylised colour? Use Hailuo 2.3. Its stylisation modes are the most developed in the current field.
Is this a shot that needs to be completely custom or brand-fine-tuned? Drop into ComfyUI with Wan 2.7 or LTX-2.3 and build a bespoke pipeline with LoRA adapters. This is slower but gives you output no SaaS platform can match.
Apply this decision tree shot by shot and your per-clip quality will jump noticeably. Studios that do this well produce work that looks like it came from a 30-person team. Studios that pick one tool and stick with it produce work that all looks the same.
The models to avoid or deprioritise
Not every model on the market is worth your time. Three categories of tools should be skipped as of April 2026.
Shutdown or sunsetting platforms. Sora is gone - the app shut on March 24 and the API follows September 24. Do not build workflows that depend on it. OpenAI's successor codenamed Spud is targeted for July 2026 but has no public preview yet.
Single-trick wrappers. A lot of 2024-era platforms were thin wrappers around Stable Diffusion or early Runway APIs. Most of them have not kept up with the pace of model releases and are now running outdated backends at premium prices. If a platform cannot tell you exactly which underlying model it is using, treat that as a red flag.
Closed-source enterprise-only tools. Some models are marketed as "enterprise AI video" with six-figure annual contracts and no self-serve access. Unless you are a Fortune 500 buying for global broadcast workflows, these tools are a tax for bureaucracy. Working studios almost never use them.
Where to see these models in real production
The studios on StudioList publish real case studies showing which models they used on specific projects. The Dor Brothers lean heavily on Kling 3.0 and ComfyUI. Secret Level uses Runway for physics-critical VFX work. Private Island blends Seedance 2.0 with traditional compositing. Browse studio profiles to see tool breakdowns per project, and use Get Matched if you want a studio that is fluent in a specific model you care about.
How the leaderboard will move in the next 6 months
Nothing in AI video stays still for long. Based on the pipeline of announcements and model releases, here is what is likely to shift between April and October 2026 - useful context for any budgeting or tool-selection decisions you are making now.
Seedance 2.0 opens access beyond CapCut. ByteDance has strong incentive to monetise the model globally, and the pressure from Kling's global availability makes limited-market access unsustainable long-term. Expect direct API access to Seedance 2.0 through fal.ai or Replicate by Q3 2026. When that happens, Seedance becomes a default for hero shots at any studio with a real budget.
Runway Gen-5 drops. Runway has been quiet for most of 2026 after the Gen-4.5 release, but their $315M raise at a $5.3B valuation signals a major next-gen push. The rumours point to a focus on world models and temporal consistency over sequences longer than 10 seconds. If Gen-5 ships with genuine long-form temporal coherence, it will be the first model that can carry a full narrative shot without character drift.
Open source closes the gap faster than expected. Wan 2.7 and LTX-2.3 are already 60-70% of the quality of the best commercial models at zero marginal cost. The Wan team has hinted at a 3.0 release targeting full parity with Kling 3.0, and if they hit it, the economics of AI video production shift dramatically. Any studio currently spending $5,000+/month on credits should be running a parallel experiment with a dedicated GPU and an open-source pipeline.
Sora's replacement ships. OpenAI's Spud project is targeted for July 2026. The focus is on world models for enterprise rather than consumer social content, which is a completely different positioning from the original Sora. Worth watching but not worth planning your pipeline around until it has real output on the leaderboard.
Pricing pressure continues. With Veo 3.1 now free for light users and Kling 3.0 starting at $6.99/month, the floor of the market has collapsed. Expect every platform to drop a free tier within the next 90 days if they have not already. This is a good time to be a buyer.
If you pick a model for a project starting today, pick the one that is best for the shot - do not pick based on predictions about what might launch next quarter. The models available right now are strong enough to ship professional work, and betting on a future release is how teams end up missing deadlines.