专利检索 cpc:"G10L21/0308" 第 1 页

1.

发明申请
INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND RECORDING MEDIUM 审中-公开

公开(公告)号：US20190198036A1

公开(公告)日：2019-06-27

申请号：US16326956

申请日：2017-06-01

申请人： Sony Corporation

发明人： Keiichi Osako , Yuhki Mitsufuji , Kohei Asada

IPC分类号： G10L21/028 , G10L25/30 , G06N20/00 , G06N3/08 , G06F17/16

CPC分类号： G10L21/028 , G06F17/16 , G06N3/08 , G06N20/00 , G10L21/0308 , G10L25/30

摘要： [Object] To provide a sound source separation technology capable of improving the separation performance.[Solution] An information processing apparatus including: an acquisition section configured to acquire an observation signal obtained by observing a sound; and a sound source separation section configured to separate the observation signal acquired by the acquisition section into a plurality of separated signals corresponding to a plurality of assumed sound sources by applying a non-linear function to a matrix product of an input vector and a coefficient vector corresponding to each of the plurality of sound sources.

2.

发明申请
SOUND SOURCE SEPARATION METHOD AND SOUND SOURCE SEPARATION APPARATUS 审中-公开

公开(公告)号：US20190115043A1

公开(公告)日：2019-04-18

申请号：US16118986

申请日：2018-08-31

申请人： HITACHI, LTD.

发明人： Rintaro IKESHITA , Yohei KAWAGUCHI

IPC分类号： G10L21/0308 , G10L21/0388

CPC分类号： G10L21/0308 , G10L21/0388

摘要： There is provided a sound source separation method of carrying out sound source separation of an audio signal inputted from an input device by using a modeled sound source distribution, by an information processing apparatus provided with a processing device, a storage device, the input device, and an output device. In this method, as a condition followed by the model, sound sources are independent of one another, powers which the sound sources have are modeled for each of frequency bands obtained through band division, a relationship among the powers for the frequency bands different from each other is modeled by nonnegative matrix factorization, and components obtained through the division of the sound source follow a complex normal distribution.

3.

发明授权
Real-time audio source separation using deep neural networks 有权

公开(公告)号：US10014002B2

公开(公告)日：2018-07-03

申请号：US15792566

申请日：2017-10-24

申请人： Red Pill VR, Inc.

发明人： Alejandro Koretzky , Karthiek Reddy Bokka , Naveen Sasalu Rajashekharappa

IPC分类号： H04R5/00 , G10L25/18 , G10L25/30 , G10L21/028 , G06F3/16

CPC分类号： G10L25/18 , G06F3/165 , G10L21/0272 , G10L21/028 , G10L21/0308 , G10L25/30

摘要： Methods and systems for audio source separation in real-time are described. In an embodiment, the present disclosure describes reading and decoding an audio source into PCM samples, fragmenting Pulse Code Modulation (PCM) samples into fragments, transforming fragments into spectrograms, performing audio source separation using a deep neural network (DNN) to generate an estimated magnitude spectrogram of the component(s) of the audio source, reconstructing the estimated time domain component signals, and streaming the component signals to a playback engine. In an embodiment, a semantic equalizer graphical user allows for real-time mixing of individual component signals.

4.

发明授权
Microphone signal fusion 有权

公开(公告)号：US09961443B2

公开(公告)日：2018-05-01

申请号：US15213203

申请日：2016-07-18

申请人： Knowles Electronics, LLC

发明人： Kuan-Chieh Yen , Thomas E. Miller , Mushtaq Syed

IPC分类号： H04B15/00 , H04R3/00 , G10L21/0216 , G10L21/0232 , G10L21/0308 , H04R1/10 , H04R1/40

CPC分类号： H04R3/005 , G10L21/0216 , G10L21/0232 , G10L21/0308 , G10L2021/02165 , G10L2021/02166 , H04R1/1016 , H04R1/1041 , H04R1/1083 , H04R1/406 , H04R2201/107 , H04R2225/43 , H04R2410/05 , H04R2420/07 , H04R2430/03 , H04R2460/13 , H04R2499/11

摘要： Provided are systems and methods for microphone signal fusion. An example method commences with receiving a first and second signal representing sounds captured, respectively, by external and internal microphones. The internal microphone is located inside an ear canal and sealed for isolation from outside acoustic signals. The external microphone is located outside the ear canal. The first signal comprises a voice component. The second signal comprises a voice component modified by at least human tissue. The first and second signals are processed to obtain noise estimates. The voice component of the second signal is aligned with the voice component of the first signal. The first signal and the aligned voice component of the second signal are blended, based on the noise estimates, to generate an enhanced voice signal. Prior to aligning, the voice component of the second signal may be processed to emphasize high frequency content, improving effective alignment bandwidth.

5.

发明申请
ANALYZING CHANGES IN VOCAL POWER WITHIN MUSIC CONTENT USING FREQUENCY SPECTRUMS 有权

公开(公告)号：US20170372724A1

公开(公告)日：2017-12-28

申请号：US15331651

申请日：2016-10-21

申请人： Microsoft Technology Licensing, LLC

发明人： Stewart Paul Tootill , Kevin Lingley , David Niall Coghlan , Michal Vine , Linden Vongsathorn

IPC分类号： G10L25/18 , G10L21/0308 , G10L25/27 , G10L25/51

CPC分类号： G10L25/18 , G10H1/00 , G10H1/125 , G10H2210/051 , G10H2210/061 , G10H2250/235 , G10H2250/455 , G10L21/028 , G10L21/0308 , G10L25/27 , G10L25/51

摘要： Technologies are described for identifying familiar or interesting parts of music content by analyzing changes in vocal power using frequency spectrums. For example, a frequency spectrum can be generated from digitized audio. Using the frequency spectrum, the harmonic content and percussive content can be separated. The vocal content can then be separated from the harmonic and/or percussive content. The vocal content can then be processed to identify surge points in the digitized audio. In some implementations, the vocal content is included in the harmonic content during the separation procedure and is then separated from the harmonic content.

6.

发明授权
Audio object extraction with sub-band object probability estimation 有权

公开(公告)号：US09820077B2

公开(公告)日：2017-11-14

申请号：US15328631

申请日：2015-07-23

申请人： Dolby Laboratories Licensing Corporation

发明人： Lianwu Chen , Lie Lu

IPC分类号： H04S7/00 , H04S3/00 , G10L21/038

CPC分类号： H04S7/302 , G10L19/008 , G10L21/0308 , G10L21/038 , H04S3/008 , H04S2400/01 , H04S2400/11 , H04S2400/13 , H04S2420/07

摘要： Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.

7.

发明授权
Method and apparatus for detecting a voice activity in an input audio signal 有权

公开(公告)号：US09761246B2

公开(公告)日：2017-09-12

申请号：US15157424

申请日：2016-05-18

申请人： HUAWEI TECHNOLOGIES CO.,LTD.

发明人： Zhe Wang

IPC分类号： G10L15/20 , G10L21/0308 , G10L25/78 , G10L19/02 , G10L19/22 , G10L25/84

CPC分类号： G10L21/0308 , G10L15/20 , G10L19/0208 , G10L19/22 , G10L25/78 , G10L25/84 , G10L2025/786

摘要： The disclosure provides a method and an apparatus for detecting a voice activity in an input audio signal composed of frames. A noise attribute of the input signal is determined based on a received frame of the input audio signal. A voice activity detection (VAD) parameter is derived based on the noise attribute of the input audio signal using an adaptive function. The derived VAD parameter is compared with a threshold value to provide a voice activity detection decision. The input audio signal is processed according to the voice activity detection decision.

8.

发明授权
Non-negative matrix factorization regularized by recurrent neural networks for audio processing 有权

公开(公告)号：US09721202B2

公开(公告)日：2017-08-01

申请号：US14186832

申请日：2014-02-21

申请人： Adobe Systems Incorporated

发明人： Nicolas Maurice Boulanger-Lewandowski , Gautham J. Mysore , Matthew Douglas Hoffman

IPC分类号： G10L21/0272 , G10L21/028 , G10L21/0308 , G06N3/04

CPC分类号： G06N3/0445 , G10L21/0272 , G10L21/028 , G10L21/0308

摘要： Sound processing techniques using recurrent neural networks are described. In one or more implementations, temporal dependencies are captured in sound data that are modeled through use of a recurrent neural network (RNN). The captured temporal dependencies are employed as part of feature extraction performed using nonnegative matrix factorization (NMF). One or more sound processing techniques are performed on the sound data based at least in part on the feature extraction.

9.

发明申请
SOUND SIGNAL PROCESSING DEVICE, SOUND SIGNAL PROCESSING METHOD, AND PROGRAM 有权
标题翻译：声信号处理装置，声信号处理方法和程序

公开(公告)号：US20170047079A1

公开(公告)日：2017-02-16

申请号：US15118239

申请日：2015-01-27

申请人： SONY CORPORATION

发明人： ATSUO HIROE

IPC分类号： G10L21/028 , G10L25/93 , G10L21/0388

CPC分类号： G10L21/028 , G01S3/8006 , G01S3/8083 , G10L15/04 , G10L21/0308 , G10L21/0388 , G10L25/78 , G10L25/93 , G10L2021/02166

摘要： A device and a method for determining a speech segment with a high degree of accuracy from a sound signal in which different sounds coexist are provided. Directional points indicating the direction of arrival of the sound signal are connected in the temporal direction, and a speech segment is detected. In this configuration, pattern classification is performed in accordance with directional characteristics with respect to the direction of arrival, and a directionality pattern and a null beam pattern are generated from the classification results. Also, an average null beam pattern is also generated by calculating the average of the null beam patterns at a time when a non-speech-like signal is input. Further, a threshold that is set at a slightly lower value than the average null beam pattern is calculated as the threshold to be used in detecting the local minimum point corresponding to the direction of arrival from each null beam pattern, and a local minimum point equal to or lower than the threshold is determined to be the point corresponding to the direction of arrival.

摘要翻译： 提供了一种从不同声音共存的声音信号确定高精确度的语音段的装置和方法。指示声音信号的到达方向的方向点在时间方向上连接，并且检测到语音段。在这种配置中，根据到达方向的方向特性执行模式分类，并且从分类结果生成方向性模式和零波束模式。此外，通过计算在输入非语音状信号时的零波束图案的平均值也产生平均零波束图案。此外，将被设置为比平均零波束图案稍低一点的阈值被计算为用于在从每个零波束图案检测与到达方向对应的局部最小点时使用的阈值，并且将局部最小点相等确定为与到达方向对应的点。

10.

发明授权
Microphone signal fusion 有权
标题翻译：麦克风信号融合

公开(公告)号：US09401158B1

公开(公告)日：2016-07-26

申请号：US14853947

申请日：2015-09-14

申请人： Knowles Electronics LLC

发明人： Kuan-Chieh Yen , Thomas Miller , Mushtaq Syed

IPC分类号： H04R3/00 , G10L21/02 , G10L21/0232 , G10L21/0208

CPC分类号： H04R3/005 , G10L21/0216 , G10L21/0232 , G10L21/0308 , G10L2021/02165 , G10L2021/02166 , H04R1/1016 , H04R1/1041 , H04R1/1083 , H04R1/406 , H04R2201/107 , H04R2225/43 , H04R2410/05 , H04R2420/07 , H04R2430/03 , H04R2460/13 , H04R2499/11

摘要： Provided are systems and methods for microphone signal fusion. An example method commences with receiving a first and second signal representing sounds captured, respectively, by internal and external microphones. The second signal includes at least a voice component. The first signal and the voice component are modified by at least human tissue. The first and second signals are processed to obtain noise estimates. The first signal is aligned with the second signal. The second signal and the aligned first signal are blended based on the noise estimates to generate an enhanced voice signal. The internal microphone is located inside an ear canal and sealed for isolation from acoustic signals outside the ear canal. The external microphone is located outside the ear canal. All of parts of the processing, blending and aligning of the systems and method may be performed on a subband basis in the frequency domain.

摘要翻译： 提供了用于麦克风信号融合的系统和方法。示例性方法开始于分别由内部和外部麦克风接收代表所捕获的声音的第一和第二信号。第二信号至少包括一个声音分量。至少人体组织修改了第一个信号和声音部分。处理第一和第二信号以获得噪声估计。第一个信号与第二个信号对齐。基于噪声估计来混合第二信号和对准的第一信号以产生增强的语音信号。内部麦克风位于耳道内，并密封，以便与耳道外的声学信号隔离。外部麦克风位于耳道外。系统和方法的处理，混合和对准的所有部分可以在频域中基于子带进行。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类