System and method for producing metadata of an audio signal

发明授权

US11756551B2 System and method for producing metadata of an audio signal 有权

请登陆查看更多内容

专利标题： System and method for producing metadata of an audio signal
申请号： US17064986

申请日： 2020-10-07
公开(公告)号： US11756551B2

公开(公告)日： 2023-09-12
发明人: Niko Moritz , Gordon Wichern , Takaaki Hori , Jonathan Le Roux
申请人： Mitsubishi Electric Research Laboratories, Inc.
申请人地址： US MA Cambridge
专利权人： Mitsubishi Electric Research Laboratories, Inc.
当前专利权人： Mitsubishi Electric Research Laboratories, Inc.
当前专利权人地址： US MA Cambridge
代理商 Gennadiy Vinokur; Hironori Tsukamoto
主分类号： G10L15/26
IPC分类号： G10L15/26 ; G10L15/16

System and method for producing metadata of an audio signal

摘要：

An audio processing system is provided. The audio processing system comprises an input interface configured to accept an audio signal. Further, the audio processing system comprises a memory configured to store a neural network trained to determine different types of attributes of multiple concurrent audio events of different origins, wherein the types of attributes include time-dependent and time-agnostic attributes of speech and non-speech audio events. Further, the audio processing system comprises a processor configured to process the audio signal with the neural network to produce metadata of the audio signal, the metadata including one or multiple attributes of one or multiple audio events in the audio signal.

公开/授权文献

US20220108698A1 System and Method for Producing Metadata of an Audio Signal 公开/授权日：2022-04-07

信息查询

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/26	.语音—正文识别系统（G10L15/08优先）