whisper-larger-v3模型，识别结果没有时间戳，10min音频 #1883

ExpressGit · 2024-07-06T02:42:17Z

10分钟的音频依然没有时间戳
模型；whisper-large-v3

❓ Questions and Help

from funasr import AutoModel

model = AutoModel(
model="iic/Whisper-large-v3",
vad_model="iic/speech_fsmn_vad_zh-cn-16k-common-pytorch",
vad_kwargs={"max_single_segment_time": 30000},
)

DecodingOptions = {
"task": "transcribe",
"language": None,
"without_timestamps": False,
}

res = model.generate(
DecodingOptions=DecodingOptions,
batch_size_s=0,
input="data/ch_multi.wav",
)

print(res)

OS (e.g., Linux): centos7
FunASR Version (e.g., 1.0.0):最新
ModelScope Version (e.g., 1.11.0):1.14.0
PyTorch Version (e.g., 2.0.0):2.1.3
How you installed funasr (pip, source):pip
Python version:3.10
GPU (e.g., V100M32) 3080
CUDA/cuDNN version (e.g., cuda11.7):cuda11.8
Docker version (e.g., funasr-runtime-sdk-cpu-0.4.1)
Any other relevant information:

The text was updated successfully, but these errors were encountered:

ExpressGit added the question Further information is requested label Jul 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

whisper-larger-v3模型，识别结果没有时间戳，10min音频 #1883

whisper-larger-v3模型，识别结果没有时间戳，10min音频 #1883

whisper-larger-v3模型，识别结果没有时间戳，10min音频 #1883

whisper-larger-v3模型，识别结果没有时间戳，10min音频 #1883

Comments

❓ Questions and Help