粤语识别语音模型
| |
---|
AI学习交流qq群 | 873673497 |
官网 | turingevo.com |
邮箱 | wmx@turingevo.com |
github | https://github.com/turingevo |
huggingface | https://huggingface.co/turingevo |
原始模型
- | id |
---|
modelscope | iic/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch |
微调数据集
- | id |
---|
modelscope | modelscope/speech_asr_commonvoice_cantonese-CHS_trainsets |
微调结果
- | id |
---|
huggingface | turingevo/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch-lora |

推理
common_voice_yue_31189594.wav 睇我几有礼貌去之前讲返声
input="/media/wmx/soft1/huggingface_cache/data/speech_asr_commonvoice_cantonese-CHS_trainsets/test/common_voice_yue_31189594.wav"
model_dir="/media/wmx/soft1/huggingface_cache/out_models/speech_paraformer-large-vad-punc_asr_nat-zh-cn-16k-common-vocab8404-pytorch-lora"
from funasr import AutoModel
model = AutoModel(model=model_dir)
res = model.generate(input=input)
print(res)
result :
[
{'key': 'common_voice_yue_31189594',
'text': '睇 我 几 有 礼 貌 去 之 前 返 声',
'timestamp': [[1410, 1650], [1730, 1970], [2050, 2270], [2270, 2470], [2470, 2690], [2690, 2930], [3230, 3470], [3550, 3770], [3770, 4010], [4010, 4250], [4270, 4490]]}
]