安装依赖

建议使用python 3.8

conda create -n datasets_tools python==3.8
conda activate datasets_tools

使用cuda加速，需要提前装好cuda环境。

安装ffmpeg

pip install torch==2.0.1+cu117 torchvision==0.15.2+cu117 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cu117
pip install funasr -i https://mirror.sjtu.edu.cn/pypi/web/simple
pip install modelscope -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install hdbscan umap joblib==1.1.0 ffmpeg-python --index-url https://pypi.tuna.tsinghua.edu.cn/simple --extra-index-url https://pypi.artrajz.cn/simple --prefer-binary

依赖可能没写完整，缺啥装啥

使用

可以分开执行需要的脚本，也可以直接运行main.py全部执行

dataset_tools

Commits

增加声道转换

docs

友链

更新安装ffmpeg说明

如果中间停顿过长则用1s静音代替

任务分成短音频和长音视频任务

README

安装依赖

使用