Pierre Champion pchampio
-
PhD in Anonymizing speech - INRIA
- /usr/bin/nvim
- pchamp.fr
Lists (1)
Sort Name ascending (A-Z)
Stars
Generate degraded speech datasets for noise-robust ASR benchmarking
AI agent toolkit: unified LLM API, agent loop, TUI, coding agent CLI
High-Quality Voice Cloning TTS for 600+ Languages
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
Training code and dataset cleasing with Sidon
anon-uscf / uscf
Forked from kamperh/linearvcUniversal Speech Content Factorization
🌋LavaSR: Fast Speech restoration and enhancement
A state-of-the-art, open-source deepfake detection system built with PyTorch and EfficientNet-B0, featuring a user-friendly web interface for real-time image and video analysis.
A list of tools, papers and code related to Fake Audio Detection.
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
A Neovim plugin that provides VSCode-style diff rendering with two-tier highlighting (line + character level) in side-by-side and inline layouts, using VSCode's algorithm implemented in C.
Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"
Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoregressive.
zero-shot voice conversion & singing voice conversion, with real-time support
[ICASSP'23] Online speaker clustering
A Conversational Speech Generation Model
A TTS model capable of generating ultra-realistic dialogue in one pass.
This repository contains the code and experiments for the paper "Exploring Flan-T5 for Post-ASR Error Correction".
View HTTP/HTTPS requests made by any Linux program
A simple reader/parser for Matrix Market (.mtx) files to represent sparse matrix in text format.
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.