-
公开(公告)号:US12058494B2
公开(公告)日:2024-08-06
申请号:US17583105
申请日:2022-01-24
申请人: Apple Inc.
发明人: John Woodruff , Yacine Azmi , Ian M. Fisch , Jing Xia
IPC分类号: H04R25/00 , G10L21/0324 , G10L21/10 , H04R3/04
CPC分类号: H04R25/505 , G10L21/0324 , G10L21/10 , H04R3/04 , H04R25/70 , H04R2205/041 , H04R2225/43 , H04R2430/01
摘要: A media system and a method of using the media system to accommodate hearing loss of a user, are described. The method includes selecting a personal level-and-frequency dependent audio filter that corresponds to a hearing loss profile of the user. The personal level-and-frequency dependent audio filter can be one of several level-and-frequency-dependent audio filters having respective average gain levels and respective gain contours. An accommodative audio output signal can be generated by applying the personal level-and-frequency dependent audio filter to an audio input signal to enhance the audio input signal based on an input level and an input frequency of the audio input signal. The audio output signal can be played by an audio output device to deliver speech or music that the user perceives clearly, despite the hearing loss of the user. Other aspects are also described and claimed.
-
公开(公告)号:US12014747B2
公开(公告)日:2024-06-18
申请号:US18308293
申请日:2023-04-27
IPC分类号: G10L19/26 , G10L19/02 , G10L19/028 , G10L19/03 , G10L19/032 , G10L19/04 , G10L19/12 , G10L19/16 , G10L21/007 , G10L21/02 , G10L21/0208 , G10L21/0324 , G10L21/038 , G10L25/15 , G10L25/18
CPC分类号: G10L19/265 , G10L19/0204 , G10L19/03 , G10L19/032 , G10L19/12 , G10L19/16 , G10L19/26 , G10L21/007 , G10L21/02 , G10L21/0208 , G10L21/0324 , G10L25/15 , G10L25/18 , G10L19/02 , G10L19/028 , G10L19/04 , G10L21/038
摘要: An audio encoder for encoding an audio signal having a lower frequency band and an upper frequency band includes: a detector for detecting a peak spectral region in the upper frequency band of the audio signal; a shaper for shaping the lower frequency band using shaping information for the lower band and for shaping the upper frequency band using at least a portion of the shaping information for the lower band, wherein the shaper is configured to additionally attenuate spectral values in the detected peak spectral region in the upper frequency band; and a quantizer and coder stage for quantizing a shaped lower frequency band and a shaped upper frequency band and for entropy coding quantized spectral values from the shaped lower frequency band and the shaped upper frequency band.
-
公开(公告)号:US20240005943A1
公开(公告)日:2024-01-04
申请号:US18345666
申请日:2023-06-30
发明人: Ehsan Younessian
IPC分类号: G10L21/0324 , G10L25/51 , H04N21/439 , G06V10/40 , G06V20/00 , G06V30/19
CPC分类号: G10L21/0324 , G10L25/51 , H04N21/4394 , G06V10/40 , G06V20/00 , G06V30/19173
摘要: The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.
-
公开(公告)号:US11769517B2
公开(公告)日:2023-09-26
申请号:US17270356
申请日:2018-08-24
发明人: Akihiko Sugiyama , Ryoji Miyahara
IPC分类号: G10L21/0324 , G10L25/93 , G10L21/0232 , G10L21/0316 , G10L21/0216 , G10L21/0208
CPC分类号: G10L21/0324 , G10L21/0208 , G10L21/0216 , G10L21/0232 , G10L21/0316 , G10L25/93 , G10L2025/935
摘要: This invention provides a signal processing apparatus capable of obtaining an output signal of sufficiently high quality if the phase of an input signal is largely different from the phase of a true voice. The signal processing apparatus includes a voice detector that receives a mixed signal including a voice and a signal other than the voice and obtains existence of the voice as a voice flag, a corrector that receives the mixed signal and the voice flag and obtains a corrected mixed signal generated by correcting the mixed signal in accordance with a state of the voice flag, and a shaper that receives the corrected mixed signal and shapes the corrected mixed signal.
-
公开(公告)号:US11763798B2
公开(公告)日:2023-09-19
申请号:US17377347
申请日:2021-07-15
发明人: Gierad Laput , Karan Ahuja , Mayank Goel , Christopher Harrison
IPC分类号: G10L21/0324 , G10L13/033 , G10L15/18
CPC分类号: G10L13/033 , G10L15/18 , G10L21/0324
摘要: Embodiments are provided to recognize features and activities from an audio signal. In one embodiment, a model is generated from sound effect data, which is augmented and projected into an audio domain to form a training dataset efficiently. Sound effect data is data that has been artificially created or from enhanced sounds or sound processes to provide a more accurate baseline of sound data than traditional training data. The sound effect data is augmented to create multiple variants to broaden the sound effect data. The augmented sound effects are projected into various audio domains, such as indoor, outdoor, urban, based on mixing background sounds consistent with these audio domains. The model is installed on any computing device, such as a laptop, smartphone, or other device. Features and activities from an audio signal are then recognized by the computing device based on the model without the need for in-situ training.
-
公开(公告)号:US11735203B2
公开(公告)日:2023-08-22
申请号:US17896785
申请日:2022-08-26
发明人: Ehsan Younessian
IPC分类号: G10L21/0324 , G10L25/51 , H04N21/439 , G06V10/40 , G06V20/00 , G06V30/19
CPC分类号: G10L21/0324 , G06V10/40 , G06V20/00 , G06V30/19173 , G10L25/51 , H04N21/4394
摘要: The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.
-
公开(公告)号:US11636865B2
公开(公告)日:2023-04-25
申请号:US17392908
申请日:2021-08-03
发明人: Martin Sehlstedt
IPC分类号: G10L19/02 , G10L25/78 , G10L21/0324 , G10L21/0388 , G10L19/012 , G10L25/03
摘要: Background noise estimators and methods are disclosed for estimating background noise in an audio signal. Some methods include obtaining at least one parameter associated with an audio signal segment, such as a frame or part of a frame, based on a first linear prediction gain, calculated as a quotient between a residual signal from a 0th-order linear prediction and a residual signal from a 2nd-order linear prediction for the audio signal segment. A second linear prediction gain is calculated as a quotient between a residual signal from a 2nd-order linear prediction and a residual signal from a 16th-order linear prediction for the audio signal segment. Whether the audio signal segment comprises a pause is determined based at least on the obtained at least one parameter; and a background noise estimate is updated based on the audio signal segment when the audio signal segment comprises a pause.
-
公开(公告)号:US20220415339A1
公开(公告)日:2022-12-29
申请号:US17896785
申请日:2022-08-26
发明人: Ehsan Younessian
IPC分类号: G10L21/0324 , G10L25/51 , H04N21/439 , G06V10/40 , G06V20/00
摘要: The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.
-
公开(公告)号:US20220189454A1
公开(公告)日:2022-06-16
申请号:US17117148
申请日:2020-12-10
发明人: Dongyan Huang , Leyuan Sheng , Youjun Xiong
摘要: A computer-implemented method for speech synthesis, a computer device, and a non-transitory computer readable storage medium are provided. The method includes: obtaining a speech text to be synthesized; obtaining a Mel spectrum corresponding to the speech text to be synthesized according to the speech text to be synthesized; inputting the Mel spectrum into a complex neural network, and obtaining a complex spectrum corresponding to the speech text to be synthesized, wherein the complex spectrum comprises real component information and imaginary component information; and obtaining a synthetic speech corresponding to the speech text to be synthesized, according to the complex spectrum. The method can efficiently and simply complete speech synthesis.
-
公开(公告)号:US20210335379A1
公开(公告)日:2021-10-28
申请号:US17270356
申请日:2018-08-24
发明人: Akihiko SUGIYAMA , Ryoji MIYAHARA
IPC分类号: G10L21/0324 , G10L25/93
摘要: This invention provides a signal processing apparatus capable of obtaining an output signal of sufficiently high quality if the phase of an input signal is largely different from the phase of a true voice. The signal processing apparatus includes a voice detector that receives a mixed signal including a voice and a signal other than the voice and obtains existence of the voice as a voice flag, a corrector that receives the mixed signal and the voice flag and obtains a corrected mixed signal generated by correcting the mixed signal in accordance with a state of the voice flag, and a shaper that receives the corrected mixed signal and shapes the corrected mixed signal.
-
-
-
-
-
-
-
-
-