ComfyUI-QwenVL custom node: Integrates the Qwen-VL series, including Qwen2.5-VL and the latest Qwen3-VL, with GGUF support for advanced multimodal AI in text generation, image understanding, and video analysis.
-
Updated
Feb 10, 2026 - Python
ComfyUI-QwenVL custom node: Integrates the Qwen-VL series, including Qwen2.5-VL and the latest Qwen3-VL, with GGUF support for advanced multimodal AI in text generation, image understanding, and video analysis.
Powerful ComfyUI custom node built on the FlashVSR V1.1 model, facilitating real-time diffusion-based video super-resolution for streaming applications.
ComfyUI-MiniMax-Remover is a custom node for ComfyUI that enables fast and efficient object removal using minimax optimization. It works in two stages: first, it trains a remover with a simplified DiT model; then it distills a robust version using CFG guidance and fewer inference steps.
A ComfyUI integration for FireRedTTSβ2, a real-time multi-speaker TTS system enabling high-quality, emotionally expressive dialogue and monologue synthesis. Leveraging a streaming architecture and context-aware prosody modeling, it supports natural speaker turns and stable long-form generation, ideal for interactive chat and podcast applications.
A clean, efficient ComfyUI custom node for VoxCPM TTS (Text-to-Speech) functionality. This implementation provides high-quality speech generation and voice cloning capabilities using the VoxCPM 1.5 model.
A powerful OCR node for ComfyUI that integrates the DeepSeek-OCR model from Hugging Face.
A set of Custom Nodes for Compositing for ComfyUI
β‘ Enhance video quality in real-time with FlashVSR, a cutting-edge diffusion-based method for streaming video super-resolution.
Deploy a local Qwen3.5-9B multimodal AI with GPU inference supporting web search, image queries, file reading, and an OpenAI-compatible API.
π£οΈ Enable text-to-speech with Qwen TTS, a simple API solution that seamlessly integrates into your applications using Docker and Home Assistant.
Explore Qwen-3.5-2B's multimodal vision-language features with an interactive Gradio demo for image and video tasks in real time.
π§ Convert various document formats into high-quality audiobooks with Qwen3 TTS Voice Model for natural speech and voice cloning.
Run Qwen3.5-35B MoE model on RTX 5090 with vLLM using NVFP4 quantization for fast, efficient text generation and extended context length support.
Run Qwen3.6-27B models on a single RTX 3090 GPU using optimized inference configs. See club-3090 for active developments and updated configurations.
Add a description, image, and links to the customnodes topic page so that developers can more easily learn about it.
To associate your repository with the customnodes topic, visit your repo's landing page and select "manage topics."