magic conch Enternalcode
Stars
An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Generate audiobooks from e-books, voice cloning & 1158+ languages!
Transform your favorite cities into beautiful, minimalist designs. MapToPoster lets you create and export visually striking map posters with code.
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Text-audio foundation model from Boson AI
Sharp Monocular View Synthesis in Less Than a Second
Free English to Chinese Dictionary Database
⚡️ Python client for the unofficial ChatGPT API with auto token regeneration, conversation tracking, proxy support and more.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
LiYing is an automated photo processing program designed for automating the post-processing workflow of ID photos in general photo studios. | LiYing 是一套适用于自动化 完成一般照相馆后期证件照处理流程的照片自动处理的程序。
GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to be the ultimate GraphRAG/KG local LLM app.
Empowering RAG with a memory-based data interface for all-purpose applications!
idiap / coqui-ai-TTS
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
The world's 1st open source face recognition SDK for Windows and Linux (Face detection, Face landmark extraction, Face feature extraction, Face template mathcing)
A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.