Benchmarks · 2024

Image-to-Video Arena: Arena.ai Image-to-Video Leaderboard

Name: Image-to-Video Arena: Arena.ai Image-to-Video Leaderboard
Creator: Arena.ai
Published: 2024
Keywords: Image-to-Video Arena, AI benchmark, video model evaluation, Arena.ai

Head-to-head ranking for models that animate a still input image, with or without a text instruction.

Open Dataset

Scores are min-max normalized. Arena.ai publishes raw Bradley-Terry / Elo ratings; we rescale them to a 0–100 axis across every scored model so they sit next to accuracy-style benchmarks. Rankings stay the same as on arena.ai.

Models Tested

Top Score

100.0

Published

2024

Source

Arena.ai

How It Works

Image-to-Video Arena scores motion conditioned on a still input image. The user provides a photo, optionally adds a description of how it should move, and two anonymous models each generate a clip. Voters pick the better animation. The benchmark rewards subject preservation, plausible motion, and faithful interpretation of the prompt, the skills that matter for "make this photo move" product features.

Each comparison is anonymous. Both models receive the same input image and the same motion instruction, then produce a short clip. Bradley-Terry on pairwise wins yields a single rating per model, normalized to 0–100.

Dataset size

Anonymous A/B comparisons of clips generated from real reference images and motion prompts.

Mean score

46.5

Median score

55.7

Open / Closed

3 / 5

Top Scorers

#	Model	Lab	Source	Score
01	Seedance 2.0	ByteDance	Closed	100.0
02	Kling 2.5 Turbo	Kuaishou	Closed	76.5
03	Runway Gen-4.5	Runway	Closed	63.5
04	Veo 3.1	Google	Closed	61.8
05	Veo 3	Google	Closed	49.6
06	LTX-2 19B	Lightricks	Open	14.3
07	HunyuanVideo-1.5	Tencent	Open	6.7
08	Wan2.2-T2V-A14B	Alibaba	Open	0.0

Score Distribution

Open vs Closed Source

Gap on Image-to-Video Arena:+85.7pts closed leads

Top Open-Source Models

1LTX-2 19B14.3
2HunyuanVideo-1.56.7
3Wan2.2-T2V-A14B0

Top Closed-Source Models

1Seedance 2.0100
2Kling 2.5 Turbo76.5
3Runway Gen-4.563.5

Score vs Parameter Count

6 model(s) with undisclosed parameter counts not shown. Most closed-source labs do not publish model size.

Average Score by Lab

Google
55.7n = 2

Most Correlated Benchmarks

Video Arena
+0.89n = 8
Pearson r: −1 to +1. Positive means the two benchmarks rank models in similar order; negative means the opposite.

What It Captures Well

Tests a workflow that pure text-to-video models do not address.
Strong predictor of quality for photo-to-video product features.
Real prompts and real images, so the test stays current.

Where It Falls Short

Requires the model to accept image input: text-only generators are excluded.
Preference voting can favor dramatic camera moves over subtle, accurate ones.
Hard to interpret which dimension cost a model a vote.

Frequently Asked Questions

When should I prefer Image-to-Video over text-to-video?

Whenever you have a reference image you want to animate. Image-to-video is more controllable and tends to produce more consistent identities, but it is harder to swap subjects mid-clip.

Related Benchmarks

Based on score correlations across our database.

Pearson r +0.89

Free Monthly Report

The AI Build Report

The state of AI models, API prices, and what to run where. New every month, free.

Benchmarks · 2024

Image-to-Video Arena: Arena.ai Image-to-Video Leaderboard

Head-to-head ranking for models that animate a still input image, with or without a text instruction.

Open Dataset

Models Tested

Top Score

100.0

Published

2024

Source

Arena.ai

How It Works

Dataset size

Anonymous A/B comparisons of clips generated from real reference images and motion prompts.

Mean score

46.5

Median score

55.7

Open / Closed

3 / 5

Top Scorers

#	Model	Lab	Source	Score
01	Seedance 2.0	ByteDance	Closed	100.0
02	Kling 2.5 Turbo	Kuaishou	Closed	76.5
03	Runway Gen-4.5	Runway	Closed	63.5
04	Veo 3.1	Google	Closed	61.8
05	Veo 3	Google	Closed	49.6
06	LTX-2 19B	Lightricks	Open	14.3
07	HunyuanVideo-1.5	Tencent	Open	6.7
08	Wan2.2-T2V-A14B	Alibaba	Open	0.0

Score Distribution

Open vs Closed Source

Gap on Image-to-Video Arena:+85.7pts closed leads

Top Open-Source Models

1LTX-2 19B14.3
2HunyuanVideo-1.56.7
3Wan2.2-T2V-A14B0

Top Closed-Source Models

1Seedance 2.0100
2Kling 2.5 Turbo76.5
3Runway Gen-4.563.5

Score vs Parameter Count

6 model(s) with undisclosed parameter counts not shown. Most closed-source labs do not publish model size.

Average Score by Lab

Google
55.7n = 2

Most Correlated Benchmarks

Video Arena
+0.89n = 8
Pearson r: −1 to +1. Positive means the two benchmarks rank models in similar order; negative means the opposite.

What It Captures Well

Tests a workflow that pure text-to-video models do not address.
Strong predictor of quality for photo-to-video product features.
Real prompts and real images, so the test stays current.

Where It Falls Short

Requires the model to accept image input: text-only generators are excluded.
Preference voting can favor dramatic camera moves over subtle, accurate ones.
Hard to interpret which dimension cost a model a vote.

Frequently Asked Questions

When should I prefer Image-to-Video over text-to-video?

Whenever you have a reference image you want to animate. Image-to-video is more controllable and tends to produce more consistent identities, but it is harder to swap subjects mid-clip.

Related Benchmarks

Based on score correlations across our database.

Pearson r +0.89

The AI Build Report

The state of AI models, API prices, and what to run where. New every month, free.

Image-to-Video Arena: Arena.ai Image-to-Video Leaderboard

How It Works

Top Scorers

Score Distribution

Open vs Closed Source

Score vs Parameter Count

Average Score by Lab

Most Correlated Benchmarks

What It Captures Well

Where It Falls Short

Frequently Asked Questions

Related Benchmarks

Video Arena

VBench

Video Edit Arena

The AI Build Report

Image-to-Video Arena: Arena.ai Image-to-Video Leaderboard

How It Works

Top Scorers

Score Distribution

Open vs Closed Source

Score vs Parameter Count

Average Score by Lab

Most Correlated Benchmarks

What It Captures Well

Where It Falls Short

Frequently Asked Questions

Related Benchmarks

Video Arena

VBench

Video Edit Arena

The AI Build Report