- 专利标题: System and method for producing metadata of an audio signal
-
申请号: US17064986申请日: 2020-10-07
-
公开(公告)号: US11756551B2公开(公告)日: 2023-09-12
- 发明人: Niko Moritz , Gordon Wichern , Takaaki Hori , Jonathan Le Roux
- 申请人: Mitsubishi Electric Research Laboratories, Inc.
- 申请人地址: US MA Cambridge
- 专利权人: Mitsubishi Electric Research Laboratories, Inc.
- 当前专利权人: Mitsubishi Electric Research Laboratories, Inc.
- 当前专利权人地址: US MA Cambridge
- 代理商 Gennadiy Vinokur; Hironori Tsukamoto
- 主分类号: G10L15/26
- IPC分类号: G10L15/26 ; G10L15/16
摘要:
An audio processing system is provided. The audio processing system comprises an input interface configured to accept an audio signal. Further, the audio processing system comprises a memory configured to store a neural network trained to determine different types of attributes of multiple concurrent audio events of different origins, wherein the types of attributes include time-dependent and time-agnostic attributes of speech and non-speech audio events. Further, the audio processing system comprises a processor configured to process the audio signal with the neural network to produce metadata of the audio signal, the metadata including one or multiple attributes of one or multiple audio events in the audio signal.
公开/授权文献
信息查询