LPM 1.0: Real-Time AI Character Animation

A team of 23 researchers released LPM 1.0, a 17-billion parameter Diffusion Transformer that generates real-time character video from audio input. The model handles full conversational performance, producing speaking, listening, micro-expressions, and natural motion while maintaining consistent character identity across unlimited-length sessions.

For the broader landscape, see our complete guide to AI video generation in 2026.

What Happened

LPM 1.0 (Large Performance Model) tackles what its creators call the "performance trilemma" in character animation: balancing expressiveness, real-time speed, and long-term identity stability. Previous approaches forced trade-offs between these three qualities. LPM 1.0 handles all three simultaneously.

Given a single character image and identity-aware references, the model generates listening videos from user audio and speaking videos from synthesized audio. Text prompts provide additional motion control. The full system runs at real-time speed, enabling practical deployment for interactive applications.

Why It Matters

Real-time character animation at this quality level opens immediate applications for game developers, VTubers, live streamers, and anyone building conversational AI interfaces. The model's ability to maintain identity consistency across infinite-length sessions solves one of the biggest pain points in AI-driven avatar systems, where character appearance tends to drift during extended interactions.

The team also introduced LPM-Bench, the first standardized benchmark for interactive character performance. This fills a gap in the field where no consistent evaluation framework existed.

Key Details

Architecture: 17B-parameter Diffusion Transformer with multimodal conditioning
Online LPM: A streamlined causal streaming variant distilled from the full model for low-latency interactive use
Capabilities: Speaking, listening, emotional micro-expressions, and natural motion, all synchronized with audio
Applications: Game NPCs, live-streaming characters, conversational AI agents, virtual production

The model's streaming variant, Online LPM, is specifically designed for scenarios where latency matters. By distilling the full 17B model into a causal streaming generator, it supports real-time interaction without sacrificing the expressiveness that makes the output compelling.

What to Do Next

Demos and technical details are available on the project page. The full research paper covers architecture decisions, training methodology, and LPM-Bench evaluation results. Game developers and avatar creators working on real-time character systems should evaluate how this approach compares to existing solutions like HeyGen Avatar V, which also targets identity-consistent avatar generation but through a different architectural approach.

LPM 1.0 Generates Real-Time Character Video

What Happened

Why It Matters

Key Details

What to Do Next

Keep reading

Luma Ray3.2 Adds Keyframe Control and HDR Video

Gemini 3.5 Live Translate: 70+ Languages, Real Time

Cohere North Mini Code: Open 30B Coding Model

What Happened

Why It Matters

Key Details

What to Do Next

Stay ahead of AI

Keep reading

Luma Ray3.2 Adds Keyframe Control and HDR Video

Gemini 3.5 Live Translate: 70+ Languages, Real Time

Cohere North Mini Code: Open 30B Coding Model

Stay ahead of Creative AI