-
公开(公告)号:US20220122607A1
公开(公告)日:2022-04-21
申请号:US17562891
申请日:2021-12-27
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt , Keyvan Mohajer , Bernard Mont-Reynaud
Abstract: A method of controlling an engagement state of an agent during a human-machine dialog is provided. The method can include receiving a spoken request that is a conditional locking request, wherein the conditional locking request uses a natural language expression to explicitly specify a locking condition, which is a predicate, storing the predicate in a format that can be evaluated when needed by the agent, entering a conditionally locked state in response to the conditional locking request, in the conditionally locked state, receiving a multiplicity of requests without a need for a wakeup indicator, and for a request from the multiplicity of requests evaluating the predicate upon receiving the request, and processing the request if the predicate is true.
-
公开(公告)号:US11250844B2
公开(公告)日:2022-02-15
申请号:US15881553
申请日:2018-01-26
Applicant: SoundHound, Inc.
Inventor: Bernard Mont-Reynaud , Scott Halstvedt , Keyvan Mohajer
IPC: G10L15/22 , G10L17/22 , G10L17/04 , G10L15/08 , G10L17/06 , G06F3/16 , G06F21/32 , G06K9/00 , G10L17/00
Abstract: Agents engage and disengage with users intelligently. Users can tell agents to remain engaged without requiring a wakeword. Engaged states can support modal dialogs and barge-in. Users can cause disengagement explicitly. Disengagement can be conditional based on timeout, change of user, or environmental conditions. Engagement can be one-time or recurrent. Recurrent states can be attentive or locked. Locked states can be unconditional or conditional, including being reserved to support user continuity. User continuity can be tested by matching parameters or tracking user by many modalities including microphone arrays, cameras, and other sensors.
-
公开(公告)号:US11113473B2
公开(公告)日:2021-09-07
申请号:US15942875
申请日:2018-04-02
Applicant: SoundHound, Inc.
Inventor: Christopher S. Wilson , Keyvan Mohajer , Bernard Mont-Reynaud
Abstract: The present invention extends to methods, systems, and computer program products for interpreting expressions having potentially ambiguous meanings in different domains. Multi-domain natural language understanding systems can support a variety of different types of clients. Expressions can be interpreted across multiple domains. Weights can be assigned to domains. Weights can be client specific or expression specific so that a chosen interpretation is more likely correct for the type of client or for its context. Stored weight sets can be chosen according to identifying information carried as metadata with expressions or weight sets carried directly as metadata. Domains can additionally or alternatively be ranked in ordered lists or comparative domain pairs of to favor some domains over others as appropriate for client type or client context.
-
公开(公告)号:US20200312329A1
公开(公告)日:2020-10-01
申请号:US16900857
申请日:2020-06-12
Applicant: SoundHound, Inc.
Inventor: Keyvan Mohajer , Timothy Stonehocker , Bernard Mont-Reynaud
Abstract: A method of a local recognition system controlling a host device to perform one or more operations is provided. The method includes receiving, by the local recognition system, a query, performing speech recognition on the received query by implementing, by the local recognition system, a local language context comprising a set of words comprising descriptions in terms of components smaller than the words, and performing speech recognition, using the local language context, to create a transcribed query. Further, the method includes controlling the host device in dependence upon the speech recognition performed on the transcribed query.
-
35.
公开(公告)号:US10657174B2
公开(公告)日:2020-05-19
申请号:US16044331
申请日:2018-07-24
Applicant: SoundHound, Inc.
Inventor: Aaron Master , Bernard Mont-Reynaud , Keyvan Mohajer , Timothy Stonehocker
IPC: G06F16/683 , G06F16/68 , G06F16/432 , G06F16/638
Abstract: The present invention relates to providing identification information in response to an audio segment using a first mode of operation including receiving an audio segment and sending the audio segment to a remote server and receiving, from the remote server, identification information relating to the audio segment, and a second mode of operation of receiving an audio segment and using stored information to obtain identification information relating to the received audio segment received, without sending the audio segment to the remote server. The present invention further includes using identification information from the remote server and using local identification information and selecting either identification information from the remote server or local identification information based on selection criteria, and generating an output based on the selected identification information.
-
公开(公告)号:US10217453B2
公开(公告)日:2019-02-26
申请号:US15294234
申请日:2016-10-14
Applicant: SoundHound, Inc.
Inventor: Mark Stevans , Monika Almudafar-Depeyrot , Keyvan Mohajer
Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphing of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.
-
公开(公告)号:US20180301151A1
公开(公告)日:2018-10-18
申请号:US15881553
申请日:2018-01-26
Applicant: SoundHound, Inc.
Inventor: Bernard Mont-Reynaud , Scott Halstvedt , Keyvan Mohajer
Abstract: Agents engage and disengage with users intelligently. Users can tell agents to remain engaged without requiring a wakeword. Engaged states can support modal dialogs and barge-in. Users can cause disengagement explicitly. Disengagement can be conditional based on timeout, change of user, or environmental conditions. Engagement can be one-time or recurrent. Recurrent states can be attentive or locked. Locked states can be unconditional or conditional, including being reserved to support user continuity. User continuity can be tested by matching parameters or tracking user by many modalities including microphone arrays, cameras, and other sensors.
-
公开(公告)号:US10102201B2
公开(公告)日:2018-10-16
申请号:US14954810
申请日:2015-11-30
Applicant: SoundHound Inc.
Inventor: Keyvan Mohajer , Kamyar Mohajer , Bernard Mont-Reynaud , Pranav Singh
Abstract: The present invention extends to methods, systems, and computer program products for a natural language module store. In general, the invention can be used to manage natural language modules offered through a natural language module store. Natural language module (NLM) developers can post NLMs at a NLM store to make the NLMs available for use by others. Developers can select NLMs for inclusion in natural language interpreters (NLIs) containing (and possibly integrating the functionality of) one or more NLMs. Prior to selecting a NLM, a developer can search or browse NLMs to identify an appropriate NLM. Optionally, a developer can test a NLM in the NLM store prior to inclusion in an NLI. For example, multiple NLMs purporting to provide the same specified natural language functionality can be tested relative to one another prior to selection of one of the NLMs for inclusion in an NLI.
-
公开(公告)号:US20180108050A1
公开(公告)日:2018-04-19
申请号:US15293931
申请日:2016-10-14
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt , Keyvan Mohajer
CPC classification number: G06Q30/0275 , G06F17/2775 , G06F17/2785
Abstract: An ad processor evaluates bid functions that are based on concepts that might be generated from interpretations of natural language expressions. Ad buyers provide the functions with corresponding ads to ad processors. Bid functions are further based on the values of semantic information referenced by expressions. Bid functions are further based on environmental information. Ad buyers are able to modify bid functions. Ads may be provided in the form of questions, and may be indicated by an identifying sound. Upon finding no expression concepts within a bid function, the set of expression concepts is expanded according to strengths of connections between concepts in a concept graph.
-
公开(公告)号:US09633371B1
公开(公告)日:2017-04-25
申请号:US14696308
申请日:2015-04-24
Applicant: SoundHound, Inc.
Inventor: Keyvan Mohajer , Aaron Master
CPC classification number: G06Q30/0261 , G06Q30/0251 , G06Q30/0275
Abstract: The present disclosure relates to systems and methods that recognize audio queries and select related information to return in response to recognition of the audio queries. The technology disclosed facilitates easy designation of aggregate user experience categories and custom audio references to be recognized. It facilitates linking and returning of selected information in response to recognition of audio queries that match the designated aggregate user experience categories or custom audio references to be recognized.
-
-
-
-
-
-
-
-
-