zoukankan      html  css  js  c++  java
  • ffmpeg & ffmpeg-python 试用(音频)

    before

    ffmpeg -i test.wav

    Input #0, wav, from 'test.wav':
      Duration: 00:00:16.31, bitrate: 256 kb/s
      Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 16000 Hz, mono, s16, 256 kb/s
    
    

    ffmpeg -i examples_sample.wav

    Input #0, wav, from 'examples_sample.wav':
      Duration: 00:00:03.42, bitrate: 1572 kb/s
      Stream #0:0: Audio: pcm_s24le ([1][0][0][0] / 0x0001), 65536 Hz, mono, s32 (24 bit), 1572 kb/s
    

    参数简要说明

    -i 设定输入流
    -f 设定输出格式
    -ar 设定采样率
    -ac 设定声音的channel数
    -acodec 设定声音解码器

    相同的输出设置:
    ffmpeg命令行:ffmpeg -i example.wav -acodec pcm_s16le -ar 16000 -ac 1 out.wav
    ffmpeg-python代码:

    import ffmpeg
    input_file='./examples_sample.wav' # 'test.wav'
    (
        ffmpeg
        .input(input_file)
        .output('./testout1.wav',**{'ar':'16000','ac':'1','acodec':'pcm_s16le'})
        .run()
    )
    

    after

    test.wav似乎没什么变化..前后格式一致

    Input #0, wav, from './test.wav':
      Duration: 00:00:16.31, bitrate: 256 kb/s
      Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 16000 Hz, mono, s16, 256 kb/s
    Stream mapping:
      Stream #0:0 -> #0:0 (pcm_s16le (native) -> pcm_s16le (native))
    Press [q] to stop, [?] for help
    Output #0, wav, to './testout.wav':
      Metadata:
        ISFT            : Lavf58.76.100
      Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 16000 Hz, mono, s16, 256 kb/s
        Metadata:
          encoder         : Lavc58.134.100 pcm_s16le
    size=     510kB time=00:00:16.25 bitrate= 256.9kbits/s speed=1.07e+03x
    video:0kB audio:510kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.014943%
    

    examples_sample.wav
    可以观察到采样率有变化了... 文件大小位107kb

    Input #0, wav, from './examples_sample.wav':
      Duration: 00:00:03.42, bitrate: 1572 kb/s
      Stream #0:0: Audio: pcm_s24le ([1][0][0][0] / 0x0001), 65536 Hz, mono, s32 (24 bit), 1572 kb/s
    Stream mapping:
      Stream #0:0 -> #0:0 (pcm_s24le (native) -> pcm_s16le (native))
    Press [q] to stop, [?] for help
    Output #0, wav, to './testout1.wav':
      Metadata:
        ISFT            : Lavf58.76.100
      Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 16000 Hz, mono, s16, 256 kb/s
        Metadata:
          encoder         : Lavc58.134.100 pcm_s16le
    size=     107kB time=00:00:03.42 bitrate= 256.3kbits/s speed= 252x
    video:0kB audio:107kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.071170%
    

    改一下采样率为18000,可以发现文件大小为120kb,相同时间,更高的采样率,音频更大

    stream = ffmpeg.input(input_file)
    #stream = ffmpeg.hflip(stream)
    stream = ffmpeg.output(stream, './testout2.wav',**{'ar':'18000','ac':'1','acodec':'pcm_s16le'})
    ffmpeg.run(stream)
    

    run

    Input #0, wav, from './examples_sample.wav':
      Duration: 00:00:03.42, bitrate: 1572 kb/s
      Stream #0:0: Audio: pcm_s24le ([1][0][0][0] / 0x0001), 65536 Hz, mono, s32 (24 bit), 1572 kb/s
    Stream mapping:
      Stream #0:0 -> #0:0 (pcm_s24le (native) -> pcm_s16le (native))
    Press [q] to stop, [?] for help
    Output #0, wav, to './testout2.wav':
      Metadata:
        ISFT            : Lavf58.76.100
      Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 18000 Hz, mono, s16, 288 kb/s
        Metadata:
          encoder         : Lavc58.134.100 pcm_s16le
    size=     120kB time=00:00:03.42 bitrate= 288.3kbits/s speed= 258x
    video:0kB audio:120kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.063262%
    

    上面的内容,主要还是一些知识点整理和个人的一些思考,权当参考,如有错误或者更好的建议,可以在评论区指正,不胜感激!
    可执行代码

  • 相关阅读:
    最好的云备份选项
    不要让你的云备份策略退居次位
    了解区域类型
    Managing WMI security
    创建、导入、导出、复制以及粘贴 WMI 筛选器
    Gpfixup
    centos 7 安装MySQL 5.7.23
    centos 7 配置 mysql 5.7 主从复制
    oracle DG搭建
    undo表空间丢失、损坏
  • 原文地址:https://www.cnblogs.com/Tester_Dolores/p/14846491.html
Copyright © 2011-2022 走看看