zoukankan      html  css  js  c++  java
  • 痞子衡嵌入式:语音处理工具pzh-speech诞生记(1)- 环境搭建(Python2.7.14 + PyAudio0.2.11 + Matplotlib2.2.3 + SpeechRecognition3.8.1 + pyttsx3 2.7)


      大家好,我是痞子衡,是正经搞技术的痞子。今天痞子衡给大家介绍的是语音处理工具pzh-py-speech诞生之环境搭建

      在写pzh-py-speech时需要先搭好开发环境,下表列出了开发过程中会用到的所有软件/工具包:

    一、涉及工具列表

    工具 功能 下载地址
    Python 2.7.14 Python官方包(解释器) https://www.python.org/
    PyAudio 0.2.11 跨平台开源Audio I/O库 PortAudio 的Python封装 http://people.csail.mit.edu/hubert/pyaudio/
    Matplotlib 2.2.3 一款非常强大的Python 2D绘图库 https://matplotlib.org/
    https://github.com/matplotlib/matplotlib
    NumPy 1.15.0 基础Python科学计算包 http://www.numpy.org/
    https://www.scipy.org/
    SpeechRecognition 3.8.1 一款支持多引擎的Python语音识别(ASR)库 https://github.com/Uberi/speech_recognition
    PocketSphinx 0.1.15 卡内基-梅隆大学开源语音识别引擎 CMU Sphinx 的Python封装 https://github.com/bambocher/pocketsphinx-python
    https://pypi.org/project/pocketsphinx/
    pyttsx3 2.7 pyTTS, pyttsx项目的延续之作,一款轻量级的Python文语合成引擎 https://github.com/nateshmbhat/pyttsx3
    https://pypi.org/project/pyttsx3/
    eSpeak 1.48.04 一款开源的TTS,可将转换结果保存为wav http://espeak.sourceforge.net/
    wxPython 4.0.3 跨平台开源GUI库 wxWidgets 的Python封装库 https://www.wxpython.org/
    https://pypi.org/project/wxPython/
    wxFormBuilder 3.8.0 wxPython GUI界面构建工具 https://github.com/wxFormBuilder/wxFormBuilder
    PyCharm Community 2018.02 一款流行的Python集成开发环境 http://www.jetbrains.com/pycharm/

    二、基础环境搭建(Python + PyAudio + Matplotlib + NumPy)

      pzh-py-speech工具是一个完全基于Python语言开发的应用软件,首先安装好Python 2.7.14,痞子衡的安装目录为C: ools_mcuPython27,安装完成后确保系统环境变量里包括该路径(C: ools_mcuPython27),因为该路径下包含python.exe,后续python命令需调用这个python.exe完成的。此外pip是Python的包管理工具,我们可以借助pip来安装PyAudio和Matplotlib包(NumPy含在Matplotlib里):

    PS C: ools_mcuPython27Scripts> .pip.exe install pyaudio

    Collecting pyaudio
     Downloading https://files.pythonhosted.org/packages/94/3e/430d4e4e24e89b19c1df052644f69e03d64c1ae2e83f5a14bd365e0236de/PyAudio-0.2.11-cp27-cp27m-win_amd64.whl (52kB)
    Installing collected packages: pyaudio
    Successfully installed pyaudio-0.2.11
    

    PS C: ools_mcuPython27Scripts> .pip.exe install matplotlib

    Collecting matplotlib
      Downloading https://files.pythonhosted.org/packages/f7/5b/4bc804df462961a3f0d138243611ce24b7899db04e6043e46df0ff1080e9/matplotlib-2.2.3-cp27-cp27m-win_amd64.whl (8.4MB)
    Collecting backports.functools-lru-cache (from matplotlib)
      Downloading https://files.pythonhosted.org/packages/03/8e/2424c0e65c4a066e28f539364deee49b6451f8fcd4f718fefa50cc3dcf48/backports.functools_lru_cache-1.5-py2.py3-none-any.whl
    Requirement already satisfied, skipping upgrade: six>=1.10 in c:	ools_mcupython27libsite-packages (from matplotlib) (1.11.0)
    Collecting pytz (from matplotlib)
      Downloading https://files.pythonhosted.org/packages/30/4e/27c34b62430286c6d59177a0842ed90dc789ce5d1ed740887653b898779a/pytz-2018.5-py2.py3-none-any.whl (510kB)
    Collecting cycler>=0.10 (from matplotlib)
      Downloading https://files.pythonhosted.org/packages/f7/d2/e07d3ebb2bd7af696440ce7e754c59dd546ffe1bbe732c8ab68b9c834e61/cycler-0.10.0-py2.py3-none-any.whl
    Collecting kiwisolver>=1.0.1 (from matplotlib)
      Downloading https://files.pythonhosted.org/packages/e0/3a/2fda27dacdfafcf8f40cce2be09890b1443af3e65c3ab8f7294216a2946b/kiwisolver-1.0.1-cp27-none-win_amd64.whl (64kB)
    Collecting python-dateutil>=2.1 (from matplotlib)
      Downloading https://files.pythonhosted.org/packages/cf/f5/af2b09c957ace60dcfac112b669c45c8c97e32f94aa8b56da4c6d1682825/python_dateutil-2.7.3-py2.py3-none-any.whl (211kB)
    Collecting pyparsing!=2.0.4,!=2.1.2,!=2.1.6,>=2.0.1 (from matplotlib)
      Downloading https://files.pythonhosted.org/packages/6a/8a/718fd7d3458f9fab8e67186b00abdd345b639976bc7fb3ae722e1b026a50/pyparsing-2.2.0-py2.py3-none-any.whl (56kB)
    Collecting numpy>=1.7.1 (from matplotlib)
      Downloading https://files.pythonhosted.org/packages/3d/d6/f04730ad69240be04584b3979dcd2f0b25f9e58463547df6fcafa139c567/numpy-1.15.0-cp27-none-win_amd64.whl (13.5MB)
    Requirement already satisfied, skipping upgrade: setuptools in c:	ools_mcupython27libsite-packages (from kiwisolver>=1.0.1->matplotlib) (28.8.0)
    Installing collected packages: backports.functools-lru-cache, pytz, cycler, kiwisolver, python-dateutil, pyparsing, numpy, matplotlib
    Successfully installed backports.functools-lru-cache-1.5 cycler-0.10.0 kiwisolver-1.0.1 matplotlib-2.2.3 numpy-1.15.0 pyparsing-2.2.0 python-dateutil-2.7.3 pytz-2018.5
    

      有了PyAudio便可以读写Audio,有了Matplotlib便可以将Audio以波形方式图形化显示出来。这两个工具安装完成,JaysPySPEECH工具开发的Python基础环境便搭好了。

    Note: 关于GUI及调试等相关工具(wxPython、wxFormBuilder、PyCharm)的安装详见痞子衡另一个作品 tinyPyCOM的环境搭建

    二、高级环境搭建(SpeechRecognition + PocketSphinx + pyttsx3 + eSpeak)

      上一步主要安装了pzh-py-speech的基础开发环境,用于Audio的录播与显示,但是pzh-py-speech设计之初便考虑支持语音识别、文语转换功能,因为我们还需要进一步安装相关Python库。
      首先安装语音识别库,SpeechRecognition是一款非常流行的支持多引擎的语音识别Python库,痞子衡为pzh-py-speech选用的就是SpeechRecognition,其中语音识别引擎选用的是可以离线工作的PocketSphinx,具体安装如下:

    PS C: ools_mcuPython27Scripts> .pip.exe install SpeechRecognition

    Collecting SpeechRecognition
      Downloading https://files.pythonhosted.org/packages/26/e1/7f5678cd94ec1234269d23756dbdaa4c8cfaed973412f88ae8adf7893a50/SpeechRecognition-3.8.1-py2.py3-none-any.whl (32.8MB)
    Installing collected packages: SpeechRecognition
    Successfully installed SpeechRecognition-3.8.1
    

    PS C: ools_mcuPython27Scripts> python -m pip install --upgrade pip setuptools wheel

    Requirement already up-to-date: pip in c:	ools_mcupython27libsite-packages (18.0)
    Collecting setuptools
      Downloading https://files.pythonhosted.org/packages/66/e8/570bb5ca88a8bcd2a1db9c6246bb66615750663ffaaeada95b04ffe74e12/setuptools-40.2.0-py2.py3-none-any.whl (568kB)
    Collecting wheel
      Downloading https://files.pythonhosted.org/packages/81/30/e935244ca6165187ae8be876b6316ae201b71485538ffac1d718843025a9/wheel-0.31.1-py2.py3-none-any.whl (41kB)
    Installing collected packages: setuptools, wheel
      Found existing installation: setuptools 28.8.0
        Uninstalling setuptools-28.8.0:
          Successfully uninstalled setuptools-28.8.0
    Successfully installed setuptools-40.2.0 wheel-0.31.1
    

    PS C: ools_mcuPython27Scripts> .pip.exe install --upgrade pocketsphinx

    Collecting pocketsphinx
      Downloading https://files.pythonhosted.org/packages/38/d3/192476022e989377ab00cb84fb0b18790e400bbd58e464155c58cb4622f8/pocketsphinx-0.1.15-cp27-cp27m-win_amd64.whl (29.1MB)
    Installing collected packages: pocketsphinx
    Successfully installed pocketsphinx-0.1.15
    

      最后安装文语合成库,pyttsx3是一款超轻量级的文语合成Python库,其是经典的pyTTS、pyttsx项目的延续,其内核为Microsoft Speech API (SAPI5),可离线工作,具体安装如下:

    PS C: ools_mcuPython27Scripts> .pip.exe install pyttsx3

    Collecting pyttsx3
      Downloading https://files.pythonhosted.org/packages/24/4e/580726c73272344d3e74b7aaffae55ff6b6450061fbecb8cc6e112531c02/pyttsx3-2.7.tar.gz
    Requirement already satisfied: pypiwin32 in c:	ools_mcupython27libsite-packages (from pyttsx3) (223)
    Requirement already satisfied: pywin32>=223 in c:	ools_mcupython27libsite-packages (from pypiwin32->pyttsx3) (223)
    Building wheels for collected packages: pyttsx3
      Running setup.py bdist_wheel for pyttsx3 ... done
      Stored in directory: C:Users
    xa07314AppDataLocalpipCachewheelsa28afe11112aca9c89142c3a404bc67ef3393a7ad530da26639a05d4
    Successfully built pyttsx3
    Installing collected packages: pyttsx3
    Successfully installed pyttsx3-2.7
    

      pyttsx3仅能在线发声,无法保存到wav文件,因此我们还需要一个可以保存wav文件的TTS,痞子衡选择了eSpeak,其具体安装详见系列第六篇。到了这里,pzh-py-speech工具开发的Python环境便全部搭好了。

      至此,语音处理工具pzh-py-speech诞生之环境搭建痞子衡便介绍完毕了,掌声在哪里~~~

    欢迎订阅

    文章会同时发布到我的 博客园主页CSDN主页微信公众号 平台上。

    微信搜索"痞子衡嵌入式"或者扫描下面二维码,就可以在手机上第一时间看了哦。

  • 相关阅读:
    [Android Pro] 将你的安卓手机屏幕共享到PC或Mac上
    [Android Pro] 使用apktool工具遇到could not decode arsc file的解决办法
    [Android Pro] Test win
    [Android] 深入浅出Android App耗电量统计
    [Android] adb shell dumpsys的使用
    [Android] adb 命令 dumpsys activity , 用来看 task 中的activity。 (uninstall virus)
    [Android Pro] svn实例
    [Android Pro] 告别编译运行 ---- Android Studio 2.0 Preview发布Instant Run功能
    [Android Pro] 网络流量安全测试工具Nogotofail
    python pandas replace函数
  • 原文地址:https://www.cnblogs.com/henjay724/p/9542690.html
Copyright © 2011-2022 走看看