EmotiVoice学习资料汇总 - 一款强大的多语音和情感可控的开源TTS引擎

更新时间：2025-01-03

EmotiVoice简介

EmotiVoice是由网易有道开发的一款开源TTS引擎,具有以下主要特点:

完全免费开源支持中英文双语拥有2000多种不同音色可以合成包含多种情感(如快乐、兴奋、悲伤、愤怒等)的语音提供易用的Web界面和脚本接口

快速开始

Docker镜像方式

最简单的尝试方法是运行Docker镜像:

docker run -dp 127.0.0.1:8501:8501 syq163/emoti-voice:latest

然后访问 http://localhost:8501 即可使用Web界面。

完整安装

创建conda环境:conda create -n EmotiVoice python=3.8 -yconda activate EmotiVoice安装依赖:pip install torch torchaudiopip install numpy numba scipy transformers soundfile yacs g2p_en jieba pypinyin pypinyin_dictpython -m nltk.downloader "averaged_perceptron_tagger_eng"

下载预训练模型文件

运行推理