A collection of powerful custom nodes that extend ComfyUI's capabilities for the LTX-2 video generation model.
LTX-2 is built into ComfyUI core (see it here), making it readily accessible to all ComfyUI users. This repository hosts additional nodes and workflows to help you get the most out of LTX-2's advanced features.
To learn more about LTX-2 See the main LTX-2 repository for model details and additional resources.
Before you begin using an LTX-2 workflow in ComfyUI, make sure you have:
We recommend using the LTX-2 workflows available in Comfy Manager.
The nodes will appear in your node menu under the “LTXVideo” category. Required models will be downloaded on first use.
The ComfyUI-LTXVideo installation includes several example workflows. You can see them all at:
ComfyUI/custom_nodes/ComfyUI-LTXVideo/example_workflows/
LTX-2.3 Workflows:
Text/image to video full/distilled model; single stageText/image to video distilled model; two stages (with upsampling)IC-LoRA distilled model depth + human pose + edgesIC-LoRA distilled model I2V motion trackingIC-LoRA distilled model HDRIC-LoRA distilled model Lipdub; two stages (with upsampling)IC-LoRA distilled model pixel spatial upscalingText to audio distilled model; single stageOlder Workflows (LTX-2.0):
Text to video full modelText to video distilled model (Fast)Image to video full modelImage to video distilled model (Fast)Video to video detailerIC-LoRA distilled model (depth + human pose + edges)IC-LoRA distilled model with downscaled reference latentsWe introduce a new Union IC-LoRA model that combines depth and edge (canny) control conditions into a single unified LoRA.
The union LoRA is trained to understand and respond to both control signals (depth maps and edge maps) within a single model. The model learns to:
We provide an HDR IC-LoRA that generates linear HDR video encoded in ARRI LogC3, enabling workflows that output high-dynamic-range content suitable for grading and EXR export.
LTXVHDRDecodePostprocess node decodes these back to linear HDR values.OPENCV_IO_ENABLE_OPENEXR=1 in the environment before starting ComfyUI. The exported EXR sequence is best viewed in DJV (or DJV for macOS).We provide a Lipdub IC-LoRA that dubs or rephrases speech in video. Given a source video and a text prompt containing the desired dialogue, it generates new lip movements and audio that match the target text while preserving the speaker's identity.
We provide Pixel Spatial Upscaler IC-LoRAs that creatively upscale low-resolution video by synthesizing fine detail rather than simply interpolating pixels. Given a low-resolution reference clip, the model re-renders it at 2× or 4× resolution with generative spatial detail — making it a creative upsampler, not a pixel-accurate refiner.
LTX-2 is a single joint audio/video transformer, but it can generate audio on its own. The LTXVAudioOnlyModel node puts the model into audio-only mode for text-to-audio, with no video output.
run_vx, a2v_cross_attn and v2a_cross_attn flags off, so the audio is denoised with no dependence on the video latent and the video stream is skipped. This matches the reference single-stage T2A pipeline's video=None behavior.[video, audio], so the sampler still needs a video latent at index 0. Use the LTXVAudioOnlyEmptyVideoLatent node (a fixed 64x64 single-frame placeholder, no params to tweak) joined with the audio latent via LTXVConcatAVLatent; with LTXVAudioOnlyModel active it is never attended to and adds negligible cost.LTXVAudioVAEDecode extracts the audio directly from the joint latent, then save it with a standard built-in audio node (for example Save Audio (FLAC)).Download the following models:
LTX-2.3 Model Checkpoint - Choose and download one of the models to COMFYUI_ROOT_FOLDER/models/checkpoints folder.
* ltx-2.3-22b-dev.safetensors
* ltx-2.3-22b-distilled-1.1.safetensors
Spatial Upscaler - Required for current two-stage pipeline implementations in this repository. Download to COMFYUI_ROOT_FOLDER/models/latent_upscale_models folder.
* ltx-2.3-spatial-upscaler-x2-1.1.safetensors
* ltx-2.3-spatial-upscaler-x1.5-1.0.safetensors
Temporal Upscaler - Required for current two-stage pipeline implementations in this repository. Download to COMFYUI_ROOT_FOLDER/models/latent_upscale_models folder.
* ltx-2.3-temporal-upscaler-x2-1.0.safetensors
Distilled LoRA - Required for current two-stage pipeline implementations in this repository (except DistilledPipeline and ICLoraPipeline). Download to COMFYUI_ROOT_FOLDER/models/loras folder.
* ltx-2.3-22b-distilled-lora-384-1.1.safetensors
Gemma Text Encoder Download all files from the repository to COMFYUI_ROOT_FOLDER/models/text_encoders/gemma-3-12b-it-qat-q4_0-unquantized.
* Gemma 3
LoRAs Choose and download to COMFYUI_ROOT_FOLDER/models/loras folder.
* ltx-2.3-22b-ic-lora-union-control-ref0.5.safetensors
* ltx-2.3-22b-ic-lora-motion-track-control-ref0.5.safetensors
* ltx-2.3-22b-ic-lora-hdr-0.9.safetensors
* ltx-2.3-22b-ic-lora-lipdub-0.9.safetensors
* ltx-2-19b-ic-lora-detailer.safetensors
* ltx-2-19b-ic-lora-pose-control.safetensors
* ltx-2-19b-lora-camera-control-dolly-in.safetensors
* ltx-2-19b-lora-camera-control-dolly-left.safetensors
* ltx-2-19b-lora-camera-control-dolly-out.safetensors
* ltx-2-19b-lora-camera-control-dolly-right.safetensors
* ltx-2-19b-lora-camera-control-jib-down.safetensors
* ltx-2-19b-lora-camera-control-jib-up.safetensors
* ltx-2-19b-lora-camera-control-static.safetensors
* ltx-2.3-22b-ic-lora-instant-shave-0.9.safetensors
* ltx-2.3-22b-ic-lora-colorization-0.9.safetensors
* ltx-2.3-22b-ic-lora-cross-eyed-0.9.safetensors
* ltx-2.3-22b-ic-lora-day-to-night-0.9.safetensors
* ltx-2.3-22b-ic-lora-deblur-0.9.safetensors
* ltx-2.3-22b-ic-lora-decompression-0.9.safetensors
* ltx-2.3-22b-ic-lora-in-outpainting-0.9.safetensors
$ claude mcp add ComfyUI-LTXVideo \
-- python -m otcore.mcp_server <graph>