-
公开(公告)号:US12058494B2
公开(公告)日:2024-08-06
申请号:US17583105
申请日:2022-01-24
Applicant: Apple Inc.
Inventor: John Woodruff , Yacine Azmi , Ian M. Fisch , Jing Xia
IPC: H04R25/00 , G10L21/0324 , G10L21/10 , H04R3/04
CPC classification number: H04R25/505 , G10L21/0324 , G10L21/10 , H04R3/04 , H04R25/70 , H04R2205/041 , H04R2225/43 , H04R2430/01
Abstract: A media system and a method of using the media system to accommodate hearing loss of a user, are described. The method includes selecting a personal level-and-frequency dependent audio filter that corresponds to a hearing loss profile of the user. The personal level-and-frequency dependent audio filter can be one of several level-and-frequency-dependent audio filters having respective average gain levels and respective gain contours. An accommodative audio output signal can be generated by applying the personal level-and-frequency dependent audio filter to an audio input signal to enhance the audio input signal based on an input level and an input frequency of the audio input signal. The audio output signal can be played by an audio output device to deliver speech or music that the user perceives clearly, despite the hearing loss of the user. Other aspects are also described and claimed.
-
公开(公告)号:US12014747B2
公开(公告)日:2024-06-18
申请号:US18308293
申请日:2023-04-27
Inventor: Markus Multrus , Christian Neukam , Markus Schnell , Benjamin Schubert
IPC: G10L19/26 , G10L19/02 , G10L19/028 , G10L19/03 , G10L19/032 , G10L19/04 , G10L19/12 , G10L19/16 , G10L21/007 , G10L21/02 , G10L21/0208 , G10L21/0324 , G10L21/038 , G10L25/15 , G10L25/18
CPC classification number: G10L19/265 , G10L19/0204 , G10L19/03 , G10L19/032 , G10L19/12 , G10L19/16 , G10L19/26 , G10L21/007 , G10L21/02 , G10L21/0208 , G10L21/0324 , G10L25/15 , G10L25/18 , G10L19/02 , G10L19/028 , G10L19/04 , G10L21/038
Abstract: An audio encoder for encoding an audio signal having a lower frequency band and an upper frequency band includes: a detector for detecting a peak spectral region in the upper frequency band of the audio signal; a shaper for shaping the lower frequency band using shaping information for the lower band and for shaping the upper frequency band using at least a portion of the shaping information for the lower band, wherein the shaper is configured to additionally attenuate spectral values in the detected peak spectral region in the upper frequency band; and a quantizer and coder stage for quantizing a shaped lower frequency band and a shaped upper frequency band and for entropy coding quantized spectral values from the shaped lower frequency band and the shaped upper frequency band.
-
公开(公告)号:US20240005943A1
公开(公告)日:2024-01-04
申请号:US18345666
申请日:2023-06-30
Applicant: Comcast Cable Communications, LLC
Inventor: Ehsan Younessian
IPC: G10L21/0324 , G10L25/51 , H04N21/439 , G06V10/40 , G06V20/00 , G06V30/19
CPC classification number: G10L21/0324 , G10L25/51 , H04N21/4394 , G06V10/40 , G06V20/00 , G06V30/19173
Abstract: The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.
-
公开(公告)号:US11769517B2
公开(公告)日:2023-09-26
申请号:US17270356
申请日:2018-08-24
Applicant: NEC Corporation , NEC Platforms, Ltd.
Inventor: Akihiko Sugiyama , Ryoji Miyahara
IPC: G10L21/0324 , G10L25/93 , G10L21/0232 , G10L21/0316 , G10L21/0216 , G10L21/0208
CPC classification number: G10L21/0324 , G10L21/0208 , G10L21/0216 , G10L21/0232 , G10L21/0316 , G10L25/93 , G10L2025/935
Abstract: This invention provides a signal processing apparatus capable of obtaining an output signal of sufficiently high quality if the phase of an input signal is largely different from the phase of a true voice. The signal processing apparatus includes a voice detector that receives a mixed signal including a voice and a signal other than the voice and obtains existence of the voice as a voice flag, a corrector that receives the mixed signal and the voice flag and obtains a corrected mixed signal generated by correcting the mixed signal in accordance with a state of the voice flag, and a shaper that receives the corrected mixed signal and shapes the corrected mixed signal.
-
公开(公告)号:US11763798B2
公开(公告)日:2023-09-19
申请号:US17377347
申请日:2021-07-15
Applicant: Carnegie Mellon University
Inventor: Gierad Laput , Karan Ahuja , Mayank Goel , Christopher Harrison
IPC: G10L21/0324 , G10L13/033 , G10L15/18
CPC classification number: G10L13/033 , G10L15/18 , G10L21/0324
Abstract: Embodiments are provided to recognize features and activities from an audio signal. In one embodiment, a model is generated from sound effect data, which is augmented and projected into an audio domain to form a training dataset efficiently. Sound effect data is data that has been artificially created or from enhanced sounds or sound processes to provide a more accurate baseline of sound data than traditional training data. The sound effect data is augmented to create multiple variants to broaden the sound effect data. The augmented sound effects are projected into various audio domains, such as indoor, outdoor, urban, based on mixing background sounds consistent with these audio domains. The model is installed on any computing device, such as a laptop, smartphone, or other device. Features and activities from an audio signal are then recognized by the computing device based on the model without the need for in-situ training.
-
公开(公告)号:US11735203B2
公开(公告)日:2023-08-22
申请号:US17896785
申请日:2022-08-26
Applicant: COMCAST CABLE COMMUNICATIONS, LLC
Inventor: Ehsan Younessian
IPC: G10L21/0324 , G10L25/51 , H04N21/439 , G06V10/40 , G06V20/00 , G06V30/19
CPC classification number: G10L21/0324 , G06V10/40 , G06V20/00 , G06V30/19173 , G10L25/51 , H04N21/4394
Abstract: The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.
-
公开(公告)号:US20180278225A1
公开(公告)日:2018-09-27
申请号:US15763342
申请日:2016-09-26
Applicant: Lawrence G. Ryckman , Sheldon G. Yakus , Ari Blitz
Inventor: Lawrence G. Ryckman , Sheldon G. Yakus , Ari Blitz
CPC classification number: H03G5/165 , G10L21/0324 , H03G3/3005 , H03G7/002
Abstract: A device for audio leveling and sound enhancement to overcome the lower sounding level than an audio video transmission. The device is interposed at source, such an HDMI cable and a device that can reproduce the enhanced sound such as a television, computer monitor and the like. If the device includes an audio leveler and a pc board or chip having the requisite circuitry and/or software for enhancing the sound and which is in electrical communication with the audio leveler. The housing includes an input port for receiving the initial audio signal and an output port which connects to a audio reproducing device after the inputted signal has been processed.
-
公开(公告)号:US20180269841A1
公开(公告)日:2018-09-20
申请号:US15983339
申请日:2018-05-18
Applicant: Nokia Technologies Oy
Inventor: Jukka Vesa Rauhala , Koray Ozcan
IPC: H03G3/20 , G10L21/02 , H03G3/30 , G10L25/69 , G10L21/00 , H03G7/00 , G10L21/0324 , G10L21/0364
CPC classification number: H03G3/20 , G10L21/00 , G10L21/02 , G10L21/0316 , G10L21/0324 , G10L21/034 , G10L21/0364 , G10L25/69 , H03G3/301 , H03G7/002
Abstract: An apparatus comprising at least one processor and at least one memory including computer program code. The at least one memory and the computer program code is configured to, with the at least one processor, cause the apparatus at least to determine a loudness estimate of a first audio signal, generate a parameter dependent on the loudness estimate; and control the first audio signal dependent on the parameter.
-
公开(公告)号:US20180232511A1
公开(公告)日:2018-08-16
申请号:US15950178
申请日:2018-04-11
Applicant: VocalZoom Systems Ltd.
Inventor: Tal Bakish
CPC classification number: G06F21/32 , G06F21/6218 , G07C9/00158 , G10L15/20 , G10L17/22 , G10L21/0324 , G10L2021/03646 , H04L9/3231 , H04L9/3271
Abstract: Device, system, and method of voice-based user authentication utilizing a challenge. A system includes a voice-based user-authentication unit, to authenticate a user based on a voice sample uttered by the user. A voice-related challenge generator operates to generate a voice-related challenge that induces the user to modify one or more vocal properties of the user. A reaction-to-challenge detector operates to detect a user-specific vocal modification in reaction to the voice-related challenge; by using a processor as well as an acoustic microphone, an optical microphone, or a hybrid acoustic-and-optical microphone. The voice-based user-authentication unit utilizes the user-specific vocal modification, that was detected as reaction to the voice-related challenge, as part of a user-authentication process.
-
公开(公告)号:US10020000B2
公开(公告)日:2018-07-10
申请号:US14589710
申请日:2015-01-05
Applicant: Samsung Electronics Co., Ltd.
Inventor: Hossein Najaf-Zadeh , Yeshwant Muthusamy
IPC: G10L19/00 , G10L21/00 , G10L19/02 , G10L21/0364 , G10L19/008 , G10L21/0324 , G10L25/18 , G10L25/06
CPC classification number: G10L19/0212 , G10L19/008 , G10L21/0324 , G10L21/0364 , G10L25/06 , G10L25/18
Abstract: An embodiment of this disclosure provides an audio receiver. The audio receiver includes a memory configured to store an audio signal and processing circuitry coupled to the memory. The processing circuitry is configured to receive the audio signal. The audio signal comprises a plurality of ambisonic components. The processing circuitry is also configured to separate the audio signal into a plurality of independent ambisonic subcomponents such that each of the independent ambisonic subcomponents is from a different source. The processing circuitry is also configured to decode each of the independent ambisonic subcomponents. The processing circuitry is also configured to combine each of the decoded independent ambisonic subcomponents into speaker signals.
-
-
-
-
-
-
-
-
-