HappyHorse 1.1 Brings Audio-Native AI Video to ComfyUI

HappyHorse 1.1 is now available in ComfyUI as a partner node, and it generates video with synchronized audio in a single render pass. That means dialogue, sound effects, and visuals come out of one generation instead of being stitched together afterward. For creators already building inside ComfyUI, it slots directly into the node graph alongside the image and video tools they use every day.

What Happened

On June 24, 2026, the ComfyUI team announced that HappyHorse 1.1 had landed as a built-in partner node. The model handles three jobs from one interface: text-to-video, image-to-video, and reference-to-video. Each mode ships as a ready-made template, so you can drop in the image-to-video workflow template and start generating without wiring nodes by hand. The headline feature is audio-native output: the model produces dialogue and sound effects alongside the picture rather than leaving sound as a separate post step.

Why It Matters

Most AI video tools still hand you a silent clip, leaving you to source music, foley, and voice on your own. Generating audio inside the same pass collapses a multi-tool pipeline into one node, which matters for short episodic series, product commercials, and game cutscenes where sound carries half the story. It also keeps the work inside ComfyUI, the same place creators run local editors like Bernini-R for local video editing, so you do not have to bounce footage between apps to add a soundtrack.

Key Details

Version 1.1 focuses on consistency. Reference-to-video now accepts up to nine images and holds multiple characters in a scene without them blurring into each other. Motion is smoother frame to frame, and longer prompts hold up: the model retains context past 2,500 characters. Output runs at 720p or 1080p in 16:9, 9:16, 1:1, and other aspect ratios, with clip lengths from three to fifteen seconds. Because HappyHorse is one of ComfyUI's partner API nodes, the generation runs as a hosted service rather than a local checkpoint, so a recent ComfyUI build is all you need to call it. The team also published a reference-to-video template for multi-character scenes.

What to Do Next

Update ComfyUI to the latest version, then search "HappyHorse" in the Node Library or load one of the templates. Pick your mode, add a prompt or reference images, and generate. For a first test, try the text-to-video template with a short scene description and listen to how the dialogue and effects come back in sync. If you want a free, fully local alternative to compare against, our guide to LTX Director 2.0 in ComfyUI walks through editing video without an API node.

HappyHorse 1.1 Brings Audio-Native AI Video to ComfyUI

What Happened

Why It Matters

Key Details

What to Do Next

Keep reading

Gemini 3.5 Flash Adds Computer Use for AI Agents

Figma Config 2026: Code Layers, Motion, AI Agent Skills

Agentic Creative Suites: AI Agents Take the Canvas

What Happened

Why It Matters

Key Details

What to Do Next

Stay ahead of AI

Keep reading

Gemini 3.5 Flash Adds Computer Use for AI Agents

Figma Config 2026: Code Layers, Motion, AI Agent Skills

Agentic Creative Suites: AI Agents Take the Canvas

Stay ahead of Creative AI