Skip to content

Video Models

Comfy Headless supports multiple video generation models, each with curated presets that configure resolution, frame count, and inference steps. This page covers what is available, what hardware each model needs, and how to choose the right one.

ModelVRAMQualitySpeedBest for
LTX-Video 212 GB+ExcellentFastGeneral use, RTX 3080+
Hunyuan 1.514 GB+BestSlowHigh quality, RTX 4080+
Wan 2.1/2.26-16 GBGreatMediumBudget GPUs, efficiency
Mochi12 GB+ExcellentSlowText adherence
AnimateDiff6 GB+GoodFastQuick previews
SVD8 GB+GoodMediumImage-to-video
CogVideoX10 GB+GoodSlowLegacy support

Models in bold are the recommended first choices for new users. The others remain fully supported for specific use cases.

Every model has one or more presets — pre-configured combinations of resolution, frame count, and inference steps that produce good results out of the box.

LTX-Video 2 is the best general-purpose choice. It balances quality and speed well and runs on any GPU with 12 GB or more of VRAM.

PresetResolutionFramesStepsUse case
ltx_quick768x5122520Fast previews and iteration
ltx_standard1280x7204925General-purpose generation
ltx_quality1280x7209730High frame count, best quality

Hunyuan 1.5 produces the highest visual quality but requires more VRAM and runs slower. Best suited for final renders rather than iteration.

PresetResolutionFramesNotes
hunyuan15_720p1280x720121High quality at 720p
hunyuan15_1080p1920x1080Uses super-resolution upscaling

The Wan family offers excellent efficiency. The 1.3B parameter variant runs on GPUs with as little as 6 GB of VRAM, making it the most accessible option for budget hardware.

PresetResolutionFramesVRAMNotes
wan_1.3b720x480496 GBLightweight, budget GPUs
wan_14b1280x7208112 GBFull quality Wan model

Use the built-in recommendation function to let the library choose based on your GPU:

from comfy_headless import get_recommended_preset
preset = get_recommended_preset(vram_gb=16) # Returns "hunyuan15_720p"
preset = get_recommended_preset(vram_gb=8) # Returns "wan_1.3b"
preset = get_recommended_preset(vram_gb=12) # Returns "ltx_standard"
  • 6 GB VRAM — Use Wan 1.3B (wan_1.3b). It is the only model that fits comfortably.
  • 8-10 GB VRAM — AnimateDiff for quick previews, Wan 1.3B for better quality.
  • 12 GB VRAM — LTX-Video 2 is the sweet spot. Fast and high quality.
  • 14-16 GB VRAM — Hunyuan 1.5 at 720p for the best visual quality, or LTX-Video 2 for faster iteration.
  • 16 GB+ VRAM — Hunyuan 1.5 at 1080p with super-resolution for production renders.
from comfy_headless import VIDEO_PRESETS, ComfyClient
# Inspect all presets
for name, preset in VIDEO_PRESETS.items():
print(f"{name}: {preset}")
# Generate with a specific preset
client = ComfyClient()
result = client.generate_video(
prompt="a timelapse of clouds over a valley",
preset="ltx_quality"
)

The VIDEO_PRESETS dictionary maps preset names to their full configuration. You can inspect it to see exactly what settings each preset uses.

Each video model requires specific custom nodes installed in your ComfyUI instance. See the Architecture page for the full list of required nodes per model.

  • Configure WebSocket progress tracking for long video generations in Configuration.
  • See the VideoSettings and VideoModel classes in API Reference.