Python 现场录音中的延迟
我是使用python进行音频处理的新手。我从互联网上获得了以下python代码来执行实时音频录制。每当我使用下面的代码录制时,在录制的音频文件的开头都会出现一个零块。录制不会立即开始,它会在录制开始前添加零 我希望在开始运行代码时不带任何零的情况下立即进行录制。感谢您的帮助Python 现场录音中的延迟,python,numpy,audio,python-sounddevice,Python,Numpy,Audio,Python Sounddevice,我是使用python进行音频处理的新手。我从互联网上获得了以下python代码来执行实时音频录制。每当我使用下面的代码录制时,在录制的音频文件的开头都会出现一个零块。录制不会立即开始,它会在录制开始前添加零 我希望在开始运行代码时不带任何零的情况下立即进行录制。感谢您的帮助 import argparse import tempfile import queue import sys def int_or_str(text): """Helper function for argum
import argparse
import tempfile
import queue
import sys
def int_or_str(text):
"""Helper function for argument parsing."""
try:
return int(text)
except ValueError:
return text
parser = argparse.ArgumentParser(description=__doc__)
parser.add_argument(
'-l', '--list-devices', action='store_true',
help='show list of audio devices and exit')
parser.add_argument(
'-d', '--device', type=int_or_str,
help='input device (numeric ID or substring)')
parser.add_argument(
'-r', '--samplerate', type=int, help='sampling rate')
parser.add_argument(
'-c', '--channels', type=int, default=1, help='number of input channels')
parser.add_argument(
'filename', nargs='?', metavar='FILENAME',
help='audio file to store recording to')
parser.add_argument(
'-t', '--subtype', type=str, help='sound file subtype (e.g. "PCM_24")')
args = parser.parse_args()
try:
import sounddevice as sd
import soundfile as sf
import numpy # Make sure NumPy is loaded before it is used in the callback
assert numpy # avoid "imported but unused" message (W0611)
if args.list_devices:
print(sd.query_devices())
parser.exit(0)
if args.samplerate is None:
device_info = sd.query_devices(args.device, 'input')
# soundfile expects an int, sounddevice provides a float:
args.samplerate = int(device_info['default_samplerate'])
if args.filename is None:
args.filename = tempfile.mktemp(prefix='delme_rec_unlimited_',
suffix='.wav', dir='')
q = queue.Queue()
def callback(indata, frames, time, status):
"""This is called (from a separate thread) for each audio block."""
if status:
print(status, file=sys.stderr)
q.put(indata.copy())
# Make sure the file is opened before recording anything:
with sf.SoundFile(args.filename, mode='x', samplerate=args.samplerate,
channels=args.channels, subtype=args.subtype) as file:
with sd.InputStream(samplerate=args.samplerate, device=args.device,
channels=args.channels, callback=callback):
print('#' * 80)
print('press Ctrl+C to stop the recording')
print('#' * 80)
while True:
file.write(q.get())
except KeyboardInterrupt:
print('\nRecording finished: ' + repr(args.filename))
parser.exit(0)
except Exception as e:
parser.exit(type(e).__name__ + ': ' + str(e))
您使用的是什么操作系统和主机API?它适用于我在Linux/ALSA上没有零。@Matthias我使用的是windows 10/WASAPI。但是我不确定主机API。我怎么能找到呢?WASAPI是主机API!您可以在设备列表中每一行的末尾找到主机API。我认为您观察到的零是由底层PortAudio库引起的。您应该在PortAudio邮件列表中询问您的问题:。感谢您的建议@马蒂亚斯