當前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

whisper 强大且开源的语音转文字

發布時間：2024/1/18 编程问答 33 豆豆

生活随笔收集整理的這篇文章主要介紹了 whisper 强大且开源的语音转文字小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

說起來語音轉換文字，openai旗下的whisper很是好用，推理也很快，同時支持cpu和GPU。

GitHub：GitHub - openai/whisper: Robust Speech Recognition via Large-Scale Weak Supervision

相關的參數和內存使用如下：

SizeParametersEnglish-only modelMultilingual modelRequired VRAMRelative speed

tiny	39 M	tiny.en	tiny	~1 GB	~32x
base	74 M	base.en	base	~1 GB	~16x
small	244 M	small.en	small	~2 GB	~6x
medium	769 M	medium.en	medium	~5 GB	~2x
large	1550 M	N/A	large	~10 GB	1x

CPU推理會慢一些，一般機器使用small模型即可，該模型內存占用不是很高

1.安裝

a.直接通過pip安裝?

pip install -U openai-whisper pip install setuptools-rust

b.通過git倉庫安裝

pip install git+https://github.com/openai/whisper.git

c.將安裝包升級到倉庫最新版

pip install --upgrade --no-deps --force-reinstall git+https://github.com/openai/whisper.git

d.安裝 ffmpeg，本次系統是centos8stream，可以通過下面命令安裝

dnf install -y https://download1.rpmfusion.org/free/el/rpmfusion-free-release-8.noarch.rpm dnf install -y install http://rpmfind.net/linux/epel/7/x86_64/Packages/s/SDL2-2.0.14-2.el7.x86_64.rpm dnf install ffmpeg -y

其他系統可參考如下：

# on Ubuntu or Debian sudo apt update && sudo apt install ffmpeg# on Arch Linux sudo pacman -S ffmpeg# on MacOS using Homebrew (https://brew.sh/) brew install ffmpeg# on Windows using Chocolatey (https://chocolatey.org/) choco install ffmpeg# on Windows using Scoop (https://scoop.sh/) scoop install ffmpeg

2.使用

可以通過Python進行下面操作

import whispermodel = whisper.load_model("small") # 如果模型不存在，會自動下載，默認下載路徑 "~/.cache/whisper" result = model.transcribe("temp.wav") print(result["text"])

總結

以上是生活随笔為你收集整理的whisper 强大且开源的语音转文字的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇：【已阅】日志与时间戳，客户端与服务器端，
下一篇： ResNet（一）相关概念