Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

avenstack/ComfyUI-AV-FunASR

Repository files navigation

ComfyUI FunAsr Nodes

使用FunAsr进行语音识别,并转为文本或字幕文件。

workflow

更新

  • [ 1.0.1 ] 1. 提升运行速度,2. 优化工作流,3. 字幕默认保存为utf-8编码

功能

  • 语音识别
  • 语音时间戳预测
  • 语音转字幕

安装

cd ComfyUI/custom_nodes
git clone https://github.com/avenstack/ComfyUI-AV-FunASR.git
cd ComfyUI-AV-FunASR
pip install -r requirements.txt

模型下载

  1. 语音识别
  2. 语音端点检测
  3. 语音时间戳预测

模型存放目录:models/ASR/FunASR/iic

 iic
 ├── speech_fsmn_vad_zh-cn-16k-common-pytorch
 │  ├── README.md
 │  ├── am.mvn
 │  ├── config.yaml
 │  ├── configuration.json
 │  ├── example
 │  ├── fig
 │  └── model.pt
 ├── speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch
 │  ├── README.md
 │  ├── am.mvn
 │  ├── asr_example_hotword.wav
 │  ├── config.yaml
 │  ├── configuration.json
 │  ├── example
 │  ├── fig
 │  ├── model.pt
 │  ├── seg_dict
 │  └── tokens.json
 └── speech_timestamp_prediction-v1-16k-offline
 ├── README.md
 ├── am.mvn
 ├── config.yaml
 ├── configuration.json
 ├── example
 ├── model.pt
 ├── seg_dict
 └── tokens.json

鸣谢

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

AltStyle によって変換されたページ (->オリジナル) /