Patent search ipc:"G10L21/0324" Page 1

1.

发明授权
Media system and method of accommodating hearing loss using a personalized audio filter 有权

公开(公告)号：US12058494B2

公开(公告)日：2024-08-06

申请号：US17583105

申请日：2022-01-24

Applicant: Apple Inc.

Inventor： John Woodruff , Yacine Azmi , Ian M. Fisch , Jing Xia

IPC: H04R25/00 , G10L21/0324 , G10L21/10 , H04R3/04

CPC classification number: H04R25/505 , G10L21/0324 , G10L21/10 , H04R3/04 , H04R25/70 , H04R2205/041 , H04R2225/43 , H04R2430/01

Abstract: A media system and a method of using the media system to accommodate hearing loss of a user, are described. The method includes selecting a personal level-and-frequency dependent audio filter that corresponds to a hearing loss profile of the user. The personal level-and-frequency dependent audio filter can be one of several level-and-frequency-dependent audio filters having respective average gain levels and respective gain contours. An accommodative audio output signal can be generated by applying the personal level-and-frequency dependent audio filter to an audio input signal to enhance the audio input signal based on an input level and an input frequency of the audio input signal. The audio output signal can be played by an audio output device to deliver speech or music that the user perceives clearly, despite the hearing loss of the user. Other aspects are also described and claimed.

2.

发明授权
Audio encoder for encoding an audio signal, method for encoding an audio signal and computer program under consideration of a detected peak spectral region in an upper frequency band 有权

公开(公告)号：US12014747B2

公开(公告)日：2024-06-18

申请号：US18308293

申请日：2023-04-27

Applicant: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventor： Markus Multrus , Christian Neukam , Markus Schnell , Benjamin Schubert

IPC: G10L19/26 , G10L19/02 , G10L19/028 , G10L19/03 , G10L19/032 , G10L19/04 , G10L19/12 , G10L19/16 , G10L21/007 , G10L21/02 , G10L21/0208 , G10L21/0324 , G10L21/038 , G10L25/15 , G10L25/18

CPC classification number: G10L19/265 , G10L19/0204 , G10L19/03 , G10L19/032 , G10L19/12 , G10L19/16 , G10L19/26 , G10L21/007 , G10L21/02 , G10L21/0208 , G10L21/0324 , G10L25/15 , G10L25/18 , G10L19/02 , G10L19/028 , G10L19/04 , G10L21/038

Abstract: An audio encoder for encoding an audio signal having a lower frequency band and an upper frequency band includes: a detector for detecting a peak spectral region in the upper frequency band of the audio signal; a shaper for shaping the lower frequency band using shaping information for the lower band and for shaping the upper frequency band using at least a portion of the shaping information for the lower band, wherein the shaper is configured to additionally attenuate spectral values in the detected peak spectral region in the upper frequency band; and a quantizer and coder stage for quantizing a shaped lower frequency band and a shaped upper frequency band and for entropy coding quantized spectral values from the shaped lower frequency band and the shaped upper frequency band.

3.

发明公开
METHODS AND SYSTEMS FOR AUGMENTING AUDIO CONTENT 审中-公开

公开(公告)号：US20240005943A1

公开(公告)日：2024-01-04

申请号：US18345666

申请日：2023-06-30

Applicant: Comcast Cable Communications, LLC

Inventor： Ehsan Younessian

IPC: G10L21/0324 , G10L25/51 , H04N21/439 , G06V10/40 , G06V20/00 , G06V30/19

CPC classification number: G10L21/0324 , G10L25/51 , H04N21/4394 , G06V10/40 , G06V20/00 , G06V30/19173

Abstract: The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.

4.

发明授权
Signal processing apparatus, signal processing method, and signal processing program 有权

公开(公告)号：US11769517B2

公开(公告)日：2023-09-26

申请号：US17270356

申请日：2018-08-24

Applicant: NEC Corporation , NEC Platforms, Ltd.

Inventor： Akihiko Sugiyama , Ryoji Miyahara

IPC: G10L21/0324 , G10L25/93 , G10L21/0232 , G10L21/0316 , G10L21/0216 , G10L21/0208

CPC classification number: G10L21/0324 , G10L21/0208 , G10L21/0216 , G10L21/0232 , G10L21/0316 , G10L25/93 , G10L2025/935

Abstract: This invention provides a signal processing apparatus capable of obtaining an output signal of sufficiently high quality if the phase of an input signal is largely different from the phase of a true voice. The signal processing apparatus includes a voice detector that receives a mixed signal including a voice and a signal other than the voice and obtains existence of the voice as a voice flag, a corrector that receives the mixed signal and the voice flag and obtains a corrected mixed signal generated by correcting the mixed signal in accordance with a state of the voice flag, and a shaper that receives the corrected mixed signal and shapes the corrected mixed signal.

5.

发明授权
System and method for acoustic activity recognition 有权

公开(公告)号：US11763798B2

公开(公告)日：2023-09-19

申请号：US17377347

申请日：2021-07-15

Applicant: Carnegie Mellon University

Inventor： Gierad Laput , Karan Ahuja , Mayank Goel , Christopher Harrison

IPC: G10L21/0324 , G10L13/033 , G10L15/18

CPC classification number: G10L13/033 , G10L15/18 , G10L21/0324

Abstract: Embodiments are provided to recognize features and activities from an audio signal. In one embodiment, a model is generated from sound effect data, which is augmented and projected into an audio domain to form a training dataset efficiently. Sound effect data is data that has been artificially created or from enhanced sounds or sound processes to provide a more accurate baseline of sound data than traditional training data. The sound effect data is augmented to create multiple variants to broaden the sound effect data. The augmented sound effects are projected into various audio domains, such as indoor, outdoor, urban, based on mixing background sounds consistent with these audio domains. The model is installed on any computing device, such as a laptop, smartphone, or other device. Features and activities from an audio signal are then recognized by the computing device based on the model without the need for in-situ training.

6.

发明授权
Methods and systems for augmenting audio content 有权

公开(公告)号：US11735203B2

公开(公告)日：2023-08-22

申请号：US17896785

申请日：2022-08-26

Applicant: COMCAST CABLE COMMUNICATIONS, LLC

Inventor： Ehsan Younessian

IPC: G10L21/0324 , G10L25/51 , H04N21/439 , G06V10/40 , G06V20/00 , G06V30/19

CPC classification number: G10L21/0324 , G06V10/40 , G06V20/00 , G06V30/19173 , G10L25/51 , H04N21/4394

Abstract: The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.

7.

发明授权
Estimation of background noise in audio signals 有权

公开(公告)号：US11636865B2

公开(公告)日：2023-04-25

申请号：US17392908

申请日：2021-08-03

Applicant: Telefonaktiebolaget LM Ericsson (publ)

Inventor： Martin Sehlstedt

IPC: G10L19/02 , G10L25/78 , G10L21/0324 , G10L21/0388 , G10L19/012 , G10L25/03

Abstract: Background noise estimators and methods are disclosed for estimating background noise in an audio signal. Some methods include obtaining at least one parameter associated with an audio signal segment, such as a frame or part of a frame, based on a first linear prediction gain, calculated as a quotient between a residual signal from a 0th-order linear prediction and a residual signal from a 2nd-order linear prediction for the audio signal segment. A second linear prediction gain is calculated as a quotient between a residual signal from a 2nd-order linear prediction and a residual signal from a 16th-order linear prediction for the audio signal segment. Whether the audio signal segment comprises a pause is determined based at least on the obtained at least one parameter; and a background noise estimate is updated based on the audio signal segment when the audio signal segment comprises a pause.

8.

发明申请
METHODS AND SYSTEMS FOR AUGMENTING AUDIO CONTENT 有权

公开(公告)号：US20220415339A1

公开(公告)日：2022-12-29

申请号：US17896785

申请日：2022-08-26

Applicant: COMCAST CABLE COMMUNICATIONS, LLC

Inventor： Ehsan Younessian

IPC: G10L21/0324 , G10L25/51 , H04N21/439 , G06V10/40 , G06V20/00

Abstract: The audio content (e.g., an audio track, an audio file, an audio signal, etc.) of a content item (e.g., multimedia content, a movie, streaming content, etc.) may be modified to augment and/or include one or more auditory events, such as a sound, a plurality of sounds, a sound effect(s), a voice(s), and/or music.

9.

发明申请
COMPUTER-IMPLEMENTED METHOD FOR SPEECH SYNTHESIS, COMPUTER DEVICE, AND NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM 有权

公开(公告)号：US20220189454A1

公开(公告)日：2022-06-16

申请号：US17117148

申请日：2020-12-10

Applicant: UBTECH ROBOTICS CORP LTD

Inventor： Dongyan Huang , Leyuan Sheng , Youjun Xiong

IPC: G10L13/02 , G10L25/30 , G10L25/24 , G10L21/0324 , G06N3/08 , G06N20/10 , G06F17/14

Abstract: A computer-implemented method for speech synthesis, a computer device, and a non-transitory computer readable storage medium are provided. The method includes: obtaining a speech text to be synthesized; obtaining a Mel spectrum corresponding to the speech text to be synthesized according to the speech text to be synthesized; inputting the Mel spectrum into a complex neural network, and obtaining a complex spectrum corresponding to the speech text to be synthesized, wherein the complex spectrum comprises real component information and imaginary component information; and obtaining a synthetic speech corresponding to the speech text to be synthesized, according to the complex spectrum. The method can efficiently and simply complete speech synthesis.

10.

发明申请
SIGNAL PROCESSING APPARATUS, SIGNAL PROCESSING METHOD, AND SIGNAL PROCESSING PROGRAM 有权

公开(公告)号：US20210335379A1

公开(公告)日：2021-10-28

申请号：US17270356

申请日：2018-08-24

Applicant: NEC Corporation , NEC p|atforms, Ltd.

Inventor： Akihiko SUGIYAMA , Ryoji MIYAHARA

IPC: G10L21/0324 , G10L25/93

Abstract: This invention provides a signal processing apparatus capable of obtaining an output signal of sufficiently high quality if the phase of an input signal is largely different from the phase of a true voice. The signal processing apparatus includes a voice detector that receives a mixed signal including a voice and a signal other than the voice and obtains existence of the voice as a voice flag, a corrector that receives the mixed signal and the voice flag and obtains a corrected mixed signal generated by correcting the mixed signal in accordance with a state of the voice flag, and a shaper that receives the corrected mixed signal and shapes the corrected mixed signal.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification