Audio 从mp3创建HLS可流化音频文件

Audio 从mp3创建HLS可流化音频文件,audio,ffmpeg,http-live-streaming,Audio,Ffmpeg,Http Live Streaming,我正在使用以下命令为web流创建hls aac音频文件 ffmpeg -y -i song.mp3 -c:a aac -b:a 128k -f hls -hls_time 7 -hls_list_size 0 -hls_segment_filename file%d.m4a playlist.m3u8 此命令仅适用于某些音频文件。对于许多mp3文件,我收到以下输出: C:\ffmpeg>ffmpeg -y -i song.mp3 -c:a aac -b:a 128k -f hls -hl

我正在使用以下命令为web流创建hls aac音频文件

ffmpeg -y -i song.mp3 -c:a aac -b:a 128k -f hls -hls_time 7 -hls_list_size 0 -hls_segment_filename file%d.m4a playlist.m3u8
此命令仅适用于某些音频文件。对于许多mp3文件,我收到以下输出:

C:\ffmpeg>ffmpeg -y -i song.mp3 -c:a aac -b:a 128k -f hls -hls_time 7 -hls_list_size 0 -hls_segment_filename file%d.m4a playlist.m3u8
ffmpeg version git-2020-01-31-62d92a8 Copyright (c) 2000-2020 the FFmpeg developers
  built with gcc 9.2.1 (GCC) 20200122
  configuration: --enable-gpl --enable-version3 --enable-sdl2 --enable-fontconfig --enable-gnutls --enable-iconv --enable-libass --enable-libdav1d --enable-libbluray --enable-libfreetype --enable-libmp3lame --enable-libopencore-amrnb --enable-libopencore-amrwb --enable-libopenjpeg --enable-libopus --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libtheora --enable-libtwolame --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx264 --enable-libx265 --enable-libxml2 --enable-libzimg --enable-lzma --enable-zlib --enable-gmp --enable-libvidstab --enable-libvorbis --enable-libvo-amrwbenc --enable-libmysofa --enable-libspeex --enable-libxvid --enable-libaom --enable-libmfx --enable-ffnvcodec --enable-cuvid --enable-d3d11va --enable-nvenc --enable-nvdec --enable-dxva2 --enable-avisynth --enable-libopenmpt --enable-amf
  libavutil      56. 38.100 / 56. 38.100
  libavcodec     58. 67.100 / 58. 67.100
  libavformat    58. 37.100 / 58. 37.100
  libavdevice    58.  9.103 / 58.  9.103
  libavfilter     7. 72.100 /  7. 72.100
  libswscale      5.  6.100 /  5.  6.100
  libswresample   3.  6.100 /  3.  6.100
  libpostproc    55.  6.100 / 55.  6.100
[mp3 @ 0000027d800babc0] Estimating duration from bitrate, this may be inaccurate
Input #0, mp3, from 'song.mp3':
  Metadata:
    TSS             : Logic Pro 8.0.2
    iTunNORM        :  000000EE 000000ED 00000C34 00001135 000088F0 0000B505 000080FA 00007577 00009B82 00018F49
    iTunSMPB        :  00000000 00000210 00000A07 00000000008783E9 00000000 007AD4E6 00000000 00000000 00000000 00000000 00000000 00000000
    genre           : Rock
    TCM             : Kevin MacLeod
    album           : Funk and Blues
    TKE             : C
    TBP             : 101
    title           : Funkorama
    artist          : Kevin MacLeod
    date            : 2008-06-16 18:35
  Duration: 00:03:21.46, start: 0.000000, bitrate: 325 kb/s
    Stream #0:0: Audio: mp3, 44100 Hz, stereo, fltp, 320 kb/s
    Stream #0:1: Video: mjpeg (Baseline), yuvj444p(pc, bt470bg/unknown/unknown), 400x400 [SAR 72:72 DAR 1:1], 90k tbr, 90k tbn, 90k tbc (attached pic)
    Metadata:
      comment         : Other
Stream mapping:
  Stream #0:1 -> #0:0 (mjpeg (native) -> h264 (libx264))
  Stream #0:0 -> #0:1 (mp3 (mp3float) -> aac (native))
Press [q] to stop, [?] for help
[hls @ 0000027d80100c40] Frame rate very high for a muxer not efficiently supporting it.
Please consider specifying a lower framerate, a different muxer or -vsync 2
[libx264 @ 0000027d800c1280] using SAR=1/1
[libx264 @ 0000027d800c1280] MB rate (56250000) > level limit (16711680)
[libx264 @ 0000027d800c1280] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2
[libx264 @ 0000027d800c1280] profile High 4:4:4 Predictive, level 6.2, 4:4:4, 8-bit
[libx264 @ 0000027d800c1280] 264 - core 159 - H.264/MPEG-4 AVC codec - Copyleft 2003-2019 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=4 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00
Output #0, hls, to 'playlist.m3u8':
  Metadata:
    TSS             : Logic Pro 8.0.2
    iTunNORM        :  000000EE 000000ED 00000C34 00001135 000088F0 0000B505 000080FA 00007577 00009B82 00018F49
    iTunSMPB        :  00000000 00000210 00000A07 00000000008783E9 00000000 007AD4E6 00000000 00000000 00000000 00000000 00000000 00000000
    genre           : Rock
    TCM             : Kevin MacLeod
    album           : Funk and Blues
    TKE             : C
    TBP             : 101
    title           : Funkorama
    artist          : Kevin MacLeod
    date            : 2008-06-16 18:35
    encoder         : Lavf58.37.100
    Stream #0:0: Video: h264 (libx264), yuvj444p(pc, progressive), 400x400 [SAR 72:72 DAR 1:1], q=-1--1, 90k fps, 90k tbn, 90k tbc (attached pic)
    Metadata:
      comment         : Other
      encoder         : Lavc58.67.100 libx264
    Side data:
      cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A
    Stream #0:1: Audio: aac (LC), 44100 Hz, stereo, fltp, 128 kb/s
    Metadata:
      encoder         : Lavc58.67.100 aac
[mp3float @ 0000027d80146580] overread, skip -7 enddists: -6 -6 speed=68.6x
[mp3float @ 0000027d80146580] overread, skip -6 enddists: -5 -5
[mp3float @ 0000027d80146580] overread, skip -6 enddists: -4 -4
    Last message repeated 2 times
[mp3float @ 0000027d80146580] overread, skip -7 enddists: -6 -6
    Last message repeated 2 times
[mp3float @ 0000027d80146580] overread, skip -5 enddists: -2 -2
[mp3float @ 0000027d80146580] overread, skip -7 enddists: -6 -6
[mp3float @ 0000027d80146580] overread, skip -6 enddists: -4 -4
    Last message repeated 1 times
[mp3float @ 0000027d80146580] overread, skip -7 enddists: -6 -6
    Last message repeated 1 times
[mp3float @ 0000027d80146580] overread, skip -6 enddists: -4 -4
[mp3float @ 0000027d80146580] overread, skip -5 enddists: -3 -3
[mp3float @ 0000027d80146580] overread, skip -6 enddists: -4 -4
[mp3float @ 0000027d80146580] overread, skip -7 enddists: -6 -6
    Last message repeated 2 times
[mp3float @ 0000027d80146580] overread, skip -5 enddists: -4 -4
[hls @ 0000027d80100c40] Opening 'file0.m4a' for writingate=N/A speed=64.1x
[hls @ 0000027d80100c40] Opening 'playlist.m3u8.tmp' for writing
frame=    1 fps=0.3 q=33.0 Lsize=N/A time=00:03:21.45 bitrate=N/A speed=63.7x
video:7kB audio:3209kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: unknown
[libx264 @ 0000027d800c1280] frame I:1     Avg QP:34.64  size:  6567
[libx264 @ 0000027d800c1280] mb I  I16..4: 19.5% 53.0% 27.5%
[libx264 @ 0000027d800c1280] 8x8 transform intra:53.0%
[libx264 @ 0000027d800c1280] coded y,u,v intra: 46.8% 26.1% 15.3%
[libx264 @ 0000027d800c1280] i16 v,h,dc,p: 38% 39%  9% 14%
[libx264 @ 0000027d800c1280] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 21% 14% 26%  8%  5%  6%  5%  7%  7%
[libx264 @ 0000027d800c1280] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 42% 16% 14%  7%  4%  5%  3%  4%  4%
[libx264 @ 0000027d800c1280] kb/s:4728240.00
[aac @ 0000027d800bcc40] Qavg: 2138.508
请注意“mp3float overread”消息

它会生成一个
file0.m4a
文件,而不会按照规定每7秒将其拆分一次。 这是我试图转换为aac hls流的示例音频文件,导致上述问题:


如何使用ffmpeg将音频文件转换为网络友好的hls流?

您正在使用选项
-hls\u list\u size 0
,该选项生成一个容器文件

我在所有HLS视频编码命令中使用
-muxdelay 0-f段-sc_阈值0-段时间15-段列表“playlist.m3u8”-段格式mpegts“文件%d.ts”

要将其放入工作命令中,请执行以下操作:

ffmpeg -y -i "song.mp3" -c:a aac -b:a 128k -muxdelay 0 -f segment -sc_threshold 0 -segment_time 7 -segment_list "playlist.m3u8" -segment_format mpegts "file%d.m4a"

你为什么要这么做?你可以直接播放MP3。。。它是“网络友好”和可流化的。这里根本不需要HLS。如果你想给你的用户一个流畅的聆听体验,你可以对音频文件进行重新编码并分块传送。这几乎是每个音频流网站都在做的事情。我向你保证这是没有必要的。您需要HLS的唯一时间是,如果您正在以多比特率进行流式传输,或者如果您希望在没有专用服务器的情况下进行实时流式传输。浏览器完全能够在需要时从静态音频或视频文件中请求所需内容。此外,您正在失去另一代的质量,因为您从有损源开始编码。你最好直接提供MP3,然后你就完全跳过HLS的开销和兼容性问题。这很有趣。我实际上正在考虑跳过HLS部分。但是,我可能仍然希望将音频文件重新编码为较低的比特率aac,以最小化文件大小。该服务允许上传任何类型的音频文件(有损/无损),并使其可用于桌面或移动设备上的流媒体。即使没有-hls_list_size标志,它仍然会为某些音频文件创建一个单独的文件。您的示例似乎可行,但不包括-f hls命令。这仍然是HLS流吗?我使用-f段标志,基本上是一样的。