-
公开(公告)号:US20240038256A1
公开(公告)日:2024-02-01
申请号:US17878659
申请日:2022-08-01
发明人: NICOLAS TSINGOS , SCOTT LEVINE
IPC分类号: G10L21/0316 , G10L25/21
CPC分类号: G10L21/0316 , G10L25/21
摘要: Some implementations of the disclosure relate to a non-transitory computer-readable medium having executable instructions stored thereon that, when executed by a processor, cause a system to perform operations comprising: obtaining a first energy-based target for audio; obtaining a first version of a sound mix including one or more audio components; computing, for each audio frame of multiple audio frames of each of the one or more audio components, a first audio feature measurement value; optimizing, based at least on the first energy-based target and the first audio feature measurement values, gain values of the audio frames; and after optimizing the gain values, applying the gain values to the first version of sound mix to obtain a second version of the sound mix.
-
公开(公告)号:US20230145605A1
公开(公告)日:2023-05-11
申请号:US17983147
申请日:2022-11-08
申请人: Apurva Shah , Roberto De Ioris
发明人: Apurva Shah , Roberto De Ioris
IPC分类号: H04L65/75 , G10L21/0316
CPC分类号: H04L65/75 , G10L21/0316
摘要: A computer-implemented method includes receiving audio packets associated with a first client device, where the audio packets each include an audio capture waveform, a timestamp, and a digital entity identification (ID). The method further includes determining, based on the digital entity ID, a position of a first digital entity in a metaverse. The method further includes determining a subset of other digital entities in a metaverse that are within an audio area of the first digital entity based on (a) a falloff distance between the first digital entity and each of the other digital entities and (b) a direction of audio propagation between the first digital entity and each of the other digital entities. The method further includes transmitting the audio packets to second client devices associated with the subset of other digital entities in the metaverse.
-
公开(公告)号:US20190206424A1
公开(公告)日:2019-07-04
申请号:US16297108
申请日:2019-03-08
申请人: Cogito Corporation
发明人: Joshua Feast , Ali Azarbayejani , Skyler Place
IPC分类号: G10L25/63 , H04L29/08 , G06F17/27 , G10L15/28 , G06Q30/02 , G10L21/0316 , G10L25/66 , H04W4/14 , G06F17/00
CPC分类号: G10L25/63 , G06F17/00 , G06F17/27 , G06F19/00 , G06Q30/0269 , G10L15/02 , G10L15/187 , G10L15/28 , G10L21/0316 , G10L25/66 , G10L25/90 , H04L67/22 , H04W4/14
摘要: Systems and methods are provided for analyzing voice-based audio inputs. A voice-based audio input associated with a user (e.g., wherein the voice-based audio input is a prompt or a command) is received and measures of one or more features are extracted. One or more parameters are calculated based on the measures of the one or more features. The occurrence of one or more mistriggers is identified by inputting the one or more parameters into a predictive model. Further, systems and methods are provided for identifying human mental health states using mobile device data. Mobile device data (including sensor data) associated with a mobile device corresponding to a user is received. Measurements are derived from the mobile device data and input into a predictive model. The predictive model is executed and outputs probability values of one or more symptoms associated with the user.
-
公开(公告)号:US20190115040A1
公开(公告)日:2019-04-18
申请号:US16208451
申请日:2018-12-03
发明人: PRATIK M. KAMDAR , JINCHENG WU , JOEL A. CLARK , MALAY GUPTA , PLAMEN A. IVANOV
IPC分类号: G10L21/0232 , G10L21/0272 , G10L21/0316 , G10L25/84 , G10L25/21
CPC分类号: G10L21/0232 , G10L21/0272 , G10L21/0316 , G10L25/21 , G10L25/78 , G10L25/84 , G10L2021/02082
摘要: A method includes obtaining, by a processor, an audio echo signal and an audio desired signal from an acoustic echo correction stage of an electronic device, and converting the echo signal and the desired signal to the frequency domain. The method further includes grouping, by the processor, frequency bin results of respective frequency domain converted echo and desired signals into respective echo and desired sub-bands. A sub-band suppressor gain is estimated based on an estimated sub-band energy for the echo and desired sub-bands. The method further includes modulating the frequency domain converted desired signal to compensate for residual echo, the modulating based, at least in part, on the estimated sub-band suppressor gain, and the modulating producing a compensated frequency domain converted echo signal. The method also includes converting the compensated frequency domain converted desired signal into time domain converted audio output signal.
-
公开(公告)号:US20190027159A1
公开(公告)日:2019-01-24
申请号:US16067850
申请日:2016-12-20
发明人: Akihiko SUGIYAMA , Ryoji MIYAHARA
IPC分类号: G10L21/028 , G10L21/0316 , G10L25/84
CPC分类号: G10L21/028 , G10L21/0208 , G10L21/0272 , G10L21/0316 , G10L25/84 , H04R3/00 , H04R2420/01 , H04R2430/01
摘要: There is provided a signal processing apparatus for amplifying or attenuating, with respect to a signal in which a desired signal and another signal are mixed, the desired signal and the other signal at different ratios. The signal processing apparatus includes a separator that obtains an estimated first signal and an estimated second signal by receiving a mixed signal in which a first signal (for example, speech) and a second signal (for example, noise) are mixed and estimating the first signal and the second signal. Furthermore, the signal processing apparatus includes a gain adjuster that obtains a gain-adjusted mixed signal by receiving the estimated first signal and the estimated second signal.
-
公开(公告)号:US10014838B2
公开(公告)日:2018-07-03
申请号:US15672405
申请日:2017-08-09
申请人: FUJITSU LIMITED
发明人: Sayuri Nakayama , Taro Togawa , Takeshi Otani
IPC分类号: H04R3/00 , H04R3/02 , H03G3/20 , G10L21/0272 , G10L21/0216 , G10L17/00 , H04R1/08 , H04R1/00
CPC分类号: H03G3/20 , G10L17/005 , G10L21/0216 , G10L21/0272 , G10L21/0316 , H04M3/568 , H04R1/00 , H04R1/08 , H04R3/005 , H04R2430/01
摘要: A gain adjustment apparatus includes a first output device configured to output a first audio signal, a second output device configured to output a second audio signal, a memory, and a processor coupled to the memory and configured to convert the first audio signal and the second audio signal to a first frequency spectrum and a second frequency spectrum, calculate an estimated difference between the first frequency spectrum and the second frequency spectrum based on a comparison of the first frequency spectrum with the second frequency spectrum, and output a first and second adjustment spectra corresponding to the first and second frequency spectra, the first and second adjustment spectra being adjusted on the basis of the first and second frequency spectra and the estimated difference.
-
公开(公告)号:US09972338B2
公开(公告)日:2018-05-15
申请号:US15603661
申请日:2017-05-24
申请人: FUJITSU LIMITED
发明人: Naoshi Matsuo
IPC分类号: G10L21/0264 , G10L25/21 , H04R3/04 , G10L21/0216 , H04R3/00
CPC分类号: G10L21/0264 , G10L21/0208 , G10L21/0224 , G10L21/0316 , G10L25/21 , G10L2021/02082 , G10L2021/02087 , G10L2021/02165 , H03G3/32 , H04R3/005 , H04R3/04
摘要: A noise suppression device includes: an adaptive filter unit that suppresses, using an adaptive filter, a noise component contained in a voice signal generated from a voice captured by a voice input unit to generate a corrected voice signal; a noise generation detection unit that detects timing of generation of the noise component in the voice signal; and a period suppression unit that suppresses the corrected voice signal during a predetermined period of time after the timing of the generation of the noise component.
-
公开(公告)号:US09961441B2
公开(公告)日:2018-05-01
申请号:US14314064
申请日:2014-06-25
申请人: DSP Group LTD.
发明人: Yaakov Chen
IPC分类号: H04R3/00 , G10L21/043 , G10L21/0316 , G10L21/0208
CPC分类号: H04R3/002 , G10L21/0208 , G10L21/0316 , G10L21/043 , H04R2410/05 , H04R2499/11
摘要: Methods and systems are provided for enhancing listening intelligibility in electronic devices. A vibration sensor may be used to generate feedback corresponding to vibrations caused by the outputting of the acoustic signals, and the feedback may be used in adjusting the listening intelligibility stage. In some instances, a microphone may be used to obtain audio input corresponding to ambient noise affecting intelligibility of audio outputted, as acoustic signals, via a speaker, to a user. The audio input may be used to control a listening intelligibility stage applied to audio content when the acoustic signals are generated for outputting by the speaker. In particular, the listening intelligibility stage may comprise application of dynamic time-scale modifications.
-
公开(公告)号:US20180090134A1
公开(公告)日:2018-03-29
申请号:US15277278
申请日:2016-09-27
申请人: Vocollect, Inc.
发明人: Kurt Charles Miller , Arthur McNair , Vanessa Cassandra Sanchez , Philip E. Russell , Allan Strane
IPC分类号: G10L15/20 , G10L13/02 , G10L15/30 , G10L21/0232 , G10L21/0364
CPC分类号: G10L15/20 , G10L13/02 , G10L15/22 , G10L15/30 , G10L21/0232 , G10L21/0316 , G10L21/0364 , G10L2015/226
摘要: A portable terminal has a network interface that receives a set of instructions having a sequence of at least one location and audio properties associated with the at least one location from a server. An audio circuit receives audio signals picked up by a microphone and processes the audio signals in a manner defined by the audio properties associated with the at least one location. A speech recognition module receives processed signals from the audio circuit and carries out a speech recognition process thereupon.
-
公开(公告)号:US20180062597A1
公开(公告)日:2018-03-01
申请号:US15672405
申请日:2017-08-09
申请人: FUJITSU LIMITED
发明人: Sayuri Nakayama , Taro Togawa , Takeshi Otani
IPC分类号: H03G3/20 , G10L21/0272 , G10L21/0216 , G10L17/00 , H04R1/08
CPC分类号: H03G3/20 , G10L17/005 , G10L21/0216 , G10L21/0272 , G10L21/0316 , H04M3/568 , H04R1/00 , H04R1/08 , H04R3/005 , H04R2430/01
摘要: A gain adjustment apparatus includes a first output device configured to output a first audio signal, a second output device configured to output a second audio signal, a memory, and a processor coupled to the memory and configured to convert the first audio signal and the second audio signal to a first frequency spectrum and a second frequency spectrum, calculate an estimated difference between the first frequency spectrum and the second frequency spectrum based on a comparison of the first frequency spectrum with the second frequency spectrum, and output a first and second adjustment spectra corresponding to the first and second frequency spectra, the first and second adjustment spectra being adjusted on the basis of the first and second frequency spectra and the estimated difference.
-
-
-
-
-
-
-
-
-