Patent search ap:("SoundHound Page Inc.") AND inv:"Monika Almudafar-Depeyrot"

1.

发明授权
Parametric adaptation of voice synthesis 有权

公开(公告)号：US10586079B2

公开(公告)日：2020-03-10

申请号：US15406213

申请日：2017-01-13

Applicant: SoundHound, Inc.

Inventor： Monika Almudafar-Depeyrot , Bernard Mont-Reynaud

IPC: G10L13/033 , G10L13/10 , G06F40/30 , G10L13/00

Abstract: Software-based systems perform parametric speech synthesis. TTS voice parameters determine the generated speech audio. Voice parameters include gender, age, dialect, donor, arousal, authoritativeness, pitch, range, speech rate, volume, flutter, roughness, breath, frequencies, bandwidths, and relative amplitudes of formants and nasal sounds. The system chooses TTS parameters based on one or more of: user profile attributes including gender, age, and dialect; situational attributes such as location, noise level, and mood; natural language semantic attributes such as domain of conversation, expression type, dimensions of affect, word emphasis and sentence structure; and analysis of target speaker voices. The system chooses TTS parameters to improve listener satisfaction or other desired listener behavior. Choices may be made by specified algorithms defined by code developers, or by machine learning algorithms trained on labeled samples of system performance.

2.

发明申请
PARAMETRIC ADAPTATION OF VOICE SYNTHESIS 审中-公开

公开(公告)号：US20180182373A1

公开(公告)日：2018-06-28

申请号：US15406213

申请日：2017-01-13

Applicant: SoundHound, Inc.

Inventor： Monika Almudafar-Depeyrot , Bernard Mont-Reynaud

IPC: G10L13/033 , G10L13/10 , G10L13/047 , G06F17/27

CPC classification number: G06F17/2785 , G10L13/00

Abstract: Software-based systems perform parametric speech synthesis. TTS voice parameters determine the generated speech audio. Voice parameters include gender, age, dialect, donor, arousal, authoritativeness, pitch, range, speech rate, volume, flutter, roughness, breath, frequencies, bandwidths, and relative amplitudes of formants and nasal sounds. The system chooses TTS parameters based on one or more of: user profile attributes including gender, age, and dialect; situational attributes such as location, noise level, and mood; natural language semantic attributes such as domain of conversation, expression type, dimensions of affect, word emphasis and sentence structure; and analysis of target speaker voices. The system chooses TTS parameters to improve listener satisfaction or other desired listener behavior. Choices may be made by specified algorithms defined by code developers, or by machine learning algorithms trained on labeled samples of system performance.

3.

发明申请
VIRTUAL ASSISTANT CONFIGURED BY SELECTION OF WAKE-UP PHRASE 审中-公开

公开(公告)号：US20180108343A1

公开(公告)日：2018-04-19

申请号：US15294234

申请日：2016-10-14

Applicant: SoundHound, Inc.

Inventor： Mark Stevans , Monika Almudafar-Depeyrot , Keyvan Mohajer

IPC: G10L13/08 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/02

CPC classification number: G10L13/08 , G10L13/043 , G10L15/18 , G10L15/22 , G10L15/30 , G10L2015/025 , G10L2015/088 , G10L2015/223

Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphing of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.

4.

发明授权
Text-to-speech adapted by machine learning 有权

公开(公告)号：US11531819B2

公开(公告)日：2022-12-20

申请号：US16742006

申请日：2020-01-14

Applicant: SoundHound, Inc.

Inventor： Bernard Mont-Reynaud , Monika Almudafar-Depeyrot

IPC: G06F40/30 , G10L13/00 , G10L13/033 , G10L13/04 , G10L13/10

Abstract: Machine learned models take in vectors representing desired behaviors and generate voice vectors that provide the parameters for text-to-speech (TTS) synthesis. Models may be trained on behavior vectors that include user profile attributes, situational attributes, or semantic attributes. Situational attributes may include age of people present, music that is playing, location, noise, and mood. Semantic attributes may include presence of proper nouns, number of modifiers, emotional charge, and domain of discourse. TTS voice parameters may apply per utterance and per word as to enable contrastive emphasis.

5.

发明申请
Text-to-Speech Adapted by Machine Learning 有权

公开(公告)号：US20220148566A1

公开(公告)日：2022-05-12

申请号：US17580289

申请日：2022-01-20

Applicant: SoundHound, Inc.

Inventor： Bernard Mont-Reynaud , Monika Almudafar-Depeyrot

IPC: G10L13/10 , G10L13/04 , G10L13/033

Abstract: Machine learned models take in vectors representing desired behaviors and generate voice vectors that provide the parameters for text-to-speech (TTS) synthesis. Models may be trained on behavior vectors that include user profile attributes, situational attributes, or semantic attributes. Situational attributes may include age of people present, music that is playing, location, noise, and mood. Semantic attributes may include presence of proper nouns, number of modifiers, emotional charge, and domain of discourse. TTS voice parameters may apply per utterance and per word as to enable contrastive emphasis.

6.

发明授权
Virtual assistant configured by selection of wake-up phrase 有权

公开(公告)号：US10217453B2

公开(公告)日：2019-02-26

申请号：US15294234

申请日：2016-10-14

Applicant: SoundHound, Inc.

Inventor： Mark Stevans , Monika Almudafar-Depeyrot , Keyvan Mohajer

IPC: G10L13/04 , G10L13/08 , G10L15/02 , G10L15/08 , G10L15/18 , G10L15/22 , G10L15/30

Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphing of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.

7.

发明授权
Integration of third party virtual assistants 有权

公开(公告)号：US10783872B2

公开(公告)日：2020-09-22

申请号：US16246543

申请日：2019-01-13

Applicant: SoundHound, Inc.

Inventor： Monika Almudafar-Depeyrot , Keyvan Mohajer , Mark Stevans

IPC: G10L13/08 , G10L15/18 , G10L15/22 , G10L13/04 , G10L15/26 , G10L15/30 , G10L15/08 , G10L15/02

Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphine of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.

8.

发明申请
Text-to-Speech Adapted by Machine Learning 审中-公开

公开(公告)号：US20200151394A1

公开(公告)日：2020-05-14

申请号：US16742006

申请日：2020-01-14

Applicant: SoundHound, Inc.

Inventor： Bernard Mont-Reynaud , Monika Almudafar-Depeyrot

IPC: G06F40/30 , G10L13/00

Abstract: Machine learned models take in vectors representing desired behaviors and generate voice vectors that provide the parameters for text-to-speech (TTS) synthesis. Models may be trained on behavior vectors that include user profile attributes, situational attributes, or semantic attributes. Situational attributes may include age of people present, music that is playing, location, noise, and mood. Semantic attributes may include presence of proper nouns, number of modifiers, emotional charge, and domain of discourse. TTS voice parameters may apply per utterance and per word as to enable contrastive emphasis.

9.

发明申请
INTEGRATION OF THIRD PARTY VIRTUAL ASSISTANTS 审中-公开

公开(公告)号：US20190147850A1

公开(公告)日：2019-05-16

申请号：US16246543

申请日：2019-01-13

Applicant: SoundHound, Inc.

Inventor： Monika Almudafar-Depeyrot , Keyvan Mohajer , Mark Stevans

IPC: G10L13/08 , G10L15/18 , G10L15/22 , G10L13/04

Abstract: A virtual assistant device recognizes multiple wake-up phrases. In response to a particular wake-up phrase the device sends speech audio to either a default or a third party virtual assistant server. A virtual assistant server can receive speech audio and an indication of which of multiple wake-up phrases was used and, accordingly, send the speech audio, or text recognized from the speech audio using automatic speech recognition, to a third party server. A response from the third party server can be voice audio or text for the virtual assistant server to synthesize distinctively corresponding to the wake-up phrase.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification