Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punc...
Robust Speech Recognition via Large-Scale Weak Supervision
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single ...
Deepfakes Software For All
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Model for All.
ChatGLM-6B:开源双语对话语言模型 | An Open Bilingual Dialogue Language Model
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in...
Simplified Chinese translation extension for AUTOMATIC1111's stable diffusion webui