-
公开(公告)号:US09607627B2
公开(公告)日:2017-03-28
申请号:US14614793
申请日:2015-02-05
IPC分类号: H04B3/20 , G10L21/02 , G10L21/0208 , G10L21/0216 , G10L21/0232
CPC分类号: G10L21/0208 , G10L21/0216 , G10L21/0232 , G10L2021/02082
摘要: Sound enhancement techniques through dereverberation are described. In one or more implementations, a method is described of enhancing sound data through removal of reverberation from the sound data by one or more computing devices. The method includes obtaining a model that describes primary sound data that is to be utilized as a prior that assumes no prior knowledge about specifics of the sound data from which the reverberation is to be removed. A reverberation kernel is computed having parameters that, when applied to the model that describes the primary sound data, corresponds to the sound data from which the reverberation is to be removed. The reverberation is removed from the sound data using the reverberation kernel.
-
公开(公告)号:US09437208B2
公开(公告)日:2016-09-06
申请号:US13908904
申请日:2013-06-03
发明人: Dennis L. Sun , Gautham J. Mysore
CPC分类号: G10L21/0208 , G06K9/6239 , G10L15/00 , G10L25/27 , G10L25/51 , G10L25/81 , G10L25/84
摘要: Sound decomposition models are described. In one or more implementations, a plurality of individual models is generated for respective ones of a plurality of sound sources. The plurality of models is collected to form a universal audio model that is configured to support sound decomposition of sound data through use of one or more of the models. The plurality of models is not generated using a sound source that originated at least a portion of the sound data.
摘要翻译: 描述声音分解模型。 在一个或多个实现中,为多个声源中的相应的一个声源生成多个单独的模型。 收集多个模型以形成通用音频模型,其被配置为通过使用一个或多个模型来支持声音数据的声音分解。 多个模型不是使用起源于声音数据的至少一部分的声源来生成的。
-
公开(公告)号:US09201580B2
公开(公告)日:2015-12-01
申请号:US13675807
申请日:2012-11-13
IPC分类号: G06F17/00 , G06F3/0484 , H04S7/00
CPC分类号: G06F3/04847 , H04S7/30 , H04S2400/15
摘要: Sound alignment user interface techniques are described. In one or more implementations, a user interface is output having a first representation of sound data generated from a first sound signal and a second representation of sound data generated from a second sound signal. One or more inputs are received, via interaction with the user interface, that indicate that a first point in time in the first representation corresponds to a second point in time in the second representation. Aligned sound data is generated from the sound data from the first and second sound signals based at least in part on correspondence of the first point in time in the sound data generated from the first sound signal to the second point in time in the sound data generated from the second sound signal.
摘要翻译: 描述声音对齐用户界面技术。 在一个或多个实现中,输出具有从第一声音信号产生的声音数据的第一表示和从第二声音信号产生的声音数据的第二表示的用户界面。 通过与用户界面的交互,接收指示第一表示中的第一时间点对应于第二表示中的第二时间点的一个或多个输入。 至少部分地基于从产生的声音数据中的第一声音信号产生的声音数据到第二时间点的第一时间点的对应关系,来自第一和第二声音信号的声音数据产生对准的声音数据 从第二个声音信号。
-
公开(公告)号:US20140133675A1
公开(公告)日:2014-05-15
申请号:US13675844
申请日:2012-11-13
IPC分类号: H04R3/12
CPC分类号: H04R3/005
摘要: Time interval sound alignment techniques are described. In one or more implementations, one or more inputs are received via interaction with a user interface that indicate that a first time interval in a first representation of sound data generated from a first sound signal corresponds to a second time interval in a second representation of sound data generated from a second sound signal. A stretch value is calculated based on an amount of time represented in the first and second time intervals, respectively. Aligned sound data is generated from the sound data for the first and second time intervals based on the calculated stretch value.
摘要翻译: 描述了时间间隔声音对准技术。 在一个或多个实现中,经由与用户界面的交互接收一个或多个输入,其指示从第一声音信号生成的声音数据的第一表示中的第一时间间隔对应于声音的第二表示中的第二时间间隔 从第二声音信号产生的数据。 基于分别在第一和第二时间间隔中表示的时间量来计算拉伸值。 基于计算的拉伸值,从第一和第二时间间隔的声音数据生成对准的声音数据。
-
公开(公告)号:US10176818B2
公开(公告)日:2019-01-08
申请号:US14081479
申请日:2013-11-15
摘要: Sound processing using a product-of-filters model is described. In one or more implementations, a model is formed by one or more computing devices for a time frame of sound data as a product of filters. The model is utilized by the one or more computing devices to perform one or more sound processing techniques on the time frame of the sound data.
-
公开(公告)号:US10002622B2
公开(公告)日:2018-06-19
申请号:US14085650
申请日:2013-11-20
发明人: Minje Kim , Paris Smaragdis , Gautham J. Mysore
CPC分类号: G10L25/51 , G06K9/6244 , G06K2009/4695 , G10L25/18 , G10L25/27
摘要: Pattern identification using convolution is described. In one or more implementations, a representation of a pattern is obtained that is described using data points that include frequency coordinates, time coordinates, and energy values. An identification is made as to whether sound data described using irregularly positioned data points includes the pattern, the identifying including use of a convolution of the frequency or time coordinates to determine correspondence with the representation of the pattern.
-
7.
公开(公告)号:US09721202B2
公开(公告)日:2017-08-01
申请号:US14186832
申请日:2014-02-21
IPC分类号: G10L21/0272 , G10L21/028 , G10L21/0308 , G06N3/04
CPC分类号: G06N3/0445 , G10L21/0272 , G10L21/028 , G10L21/0308
摘要: Sound processing techniques using recurrent neural networks are described. In one or more implementations, temporal dependencies are captured in sound data that are modeled through use of a recurrent neural network (RNN). The captured temporal dependencies are employed as part of feature extraction performed using nonnegative matrix factorization (NMF). One or more sound processing techniques are performed on the sound data based at least in part on the feature extraction.
-
公开(公告)号:US20150142450A1
公开(公告)日:2015-05-21
申请号:US14081479
申请日:2013-11-15
IPC分类号: G10L19/26
摘要: Sound processing using a product-of-filters model is described. In one or more implementations, a model is formed by one or more computing devices for a time frame of sound data as a product of filters. The model is utilized by the one or more computing devices to perform one or more sound processing techniques on the time frame of the sound data.
摘要翻译: 描述使用滤波器产品模型的声音处理。 在一个或多个实现中,由声音数据的时间帧的一个或多个计算设备形成模型作为滤波器的乘积。 该模型被一个或多个计算设备用于在声音数据的时间帧上执行一个或多个声音处理技术。
-
公开(公告)号:US20140358534A1
公开(公告)日:2014-12-04
申请号:US13908904
申请日:2013-06-03
发明人: Dennis L. Sun , Gautham J. Mysore
IPC分类号: G10L21/0208
CPC分类号: G10L21/0208 , G06K9/6239 , G10L15/00 , G10L25/27 , G10L25/51 , G10L25/81 , G10L25/84
摘要: Sound decomposition models are described. In one or more implementations, a plurality of individual models is generated for respective ones of a plurality of sound sources. The plurality of models is collected to form a universal audio model that is configured to support sound decomposition of sound data through use of one or more of the models. The plurality of models is not generated using a sound source that originated at least a portion of the sound data.
摘要翻译: 描述声音分解模型。 在一个或多个实现中,为多个声源中的相应的一个声源生成多个单独的模型。 收集多个模型以形成通用音频模型,其被配置为通过使用一个或多个模型来支持声音数据的声音分解。 多个模型不是使用起源于声音数据的至少一部分的声源来生成的。
-
公开(公告)号:US20140142947A1
公开(公告)日:2014-05-22
申请号:US13681643
申请日:2012-11-20
IPC分类号: G10L21/043
CPC分类号: G10L21/043
摘要: Sound rate modification techniques are described. In one or more implementations, an indication is received of an amount that a rate of output of sound data is to be modified. One or more sound rate rules are applied to the sound data that, along with the received indication, are usable to calculate different rates at which different portions of the sound data are to be modified, respectively. The sound data is then output such that the calculated rates are applied.
摘要翻译: 描述声速修改技术。 在一个或多个实现中,接收到要修改声音数据的输出速率的量的指示。 一个或多个声速规则被应用于声音数据,连同所接收的指示可以分别用于计算声音数据的不同部分将被修改的不同速率。 然后输出声音数据,使得应用计算的速率。
-
-
-
-
-
-
-
-
-