-
公开(公告)号:US20180182398A1
公开(公告)日:2018-06-28
申请号:US15389122
申请日:2016-12-22
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt , Bernard Mont-Reynaud , Kazi Asif Wadud
CPC classification number: G10L15/30 , G06F17/279 , G10L15/22 , G10L15/222 , G10L2015/223
Abstract: A query-processing system processes an input audio stream that represents a succession of queries spoken by a user. The query-processing system listens continuously to the input audio stream, parses queries and takes appropriate actions in mid-stream. In some embodiments, the system processes queries in parallel, limited by serial constraints. In some embodiments, the system parses and executes queries while a previous query's execution is still in progress. To accommodate users who tend to speak slowly and express a thought in separate parts, the query-processing system halts the outputting of results corresponding to a previous query if it detects that a new speech utterance modifies the meaning of the previous query.
-
公开(公告)号:US20180121508A1
公开(公告)日:2018-05-03
申请号:US15342970
申请日:2016-11-03
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt
IPC: G06F17/30
Abstract: A virtual assistant receives natural language interpretation hypotheses for user queries, determines entities and attributes from the interpretations, and requests data from appropriate data sources. A cost function estimates the cost of each data source request. Cost functions include factors such as contract pricing, access latency, and data quality. Based on the estimated cost, the virtual assistant sends requests to a plurality of data sources, each of which might be able to provide data necessary to answer the user query. By including user credits in the cost function, the virtual assistant provides better quality of results and answer latency for paying users. The virtual assistant minimizes latency by answering using data from the first responding data source or provides a latency guarantee by answering with the most accurate data received by a deadline. The virtual assistant measures data source response latency and caches responses for expensive requests.
-
公开(公告)号:US12125484B2
公开(公告)日:2024-10-22
申请号:US17562891
申请日:2021-12-27
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt , Keyvan Mohajer , Bernard Mont-Reynaud
IPC: G10L15/22 , G06F3/16 , G06F21/32 , G06V40/16 , G10L15/08 , G10L17/00 , G10L17/04 , G10L17/06 , G10L17/22
CPC classification number: G10L15/22 , G06F3/167 , G06F21/32 , G10L15/08 , G10L17/04 , G10L17/06 , G10L17/22 , G06V40/16 , G06V40/166 , G10L2015/088 , G10L2015/223 , G10L17/00
Abstract: A method of controlling an engagement state of an agent during a human-machine dialog is provided. The method can include receiving a spoken request that is a conditional locking request, wherein the conditional locking request uses a natural language expression to explicitly specify a locking condition, which is a predicate, storing the predicate in a format that can be evaluated when needed by the agent, entering a conditionally locked state in response to the conditional locking request, in the conditionally locked state, receiving a multiplicity of requests without a need for a wakeup indicator, and for a request from the multiplicity of requests evaluating the predicate upon receiving the request, and processing the request if the predicate is true.
-
公开(公告)号:US10699713B2
公开(公告)日:2020-06-30
申请号:US16388526
申请日:2019-04-18
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt , Bernard Mont-Reynaud , Kazi Asif Wadud
Abstract: A server receives a user audio stream, the stream comprising multiple utterances. A query-processing module of the server continuously listens to and processes the utterances. The processing includes parsing successive utterances and recognizing corresponding queries, taking appropriate actions while the utterances are being received. In some embodiments, a query may be parsed and executed before the previous query's execution is complete.
-
公开(公告)号:US20200013094A1
公开(公告)日:2020-01-09
申请号:US16572179
申请日:2019-09-16
Applicant: SoundHound, Inc.
Inventor: Keyvan Mohajer , Scott Halstvedt
Abstract: Original concepts obtained from a query may be augmented with additional concepts connected to the original concepts in a concept graph in response to determining that the original concepts did not match a sufficient number of bid functions. The augmented set of concepts may then be evaluated with respect to the bid functions to identify matching ad functions. This process may be repeated until a sufficient number of matching ad functions are found. A bid amount of the matching bid functions may be calculated, such as based on semantic information obtained as a result of the query. The bid amounts may further be based on environmental information. A bid function is selected based on the bid amounts and the content associated with the bid function is provided to the source of the query. The content may be selected based on the semantic information.
-
公开(公告)号:US20220122607A1
公开(公告)日:2022-04-21
申请号:US17562891
申请日:2021-12-27
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt , Keyvan Mohajer , Bernard Mont-Reynaud
Abstract: A method of controlling an engagement state of an agent during a human-machine dialog is provided. The method can include receiving a spoken request that is a conditional locking request, wherein the conditional locking request uses a natural language expression to explicitly specify a locking condition, which is a predicate, storing the predicate in a format that can be evaluated when needed by the agent, entering a conditionally locked state in response to the conditional locking request, in the conditionally locked state, receiving a multiplicity of requests without a need for a wakeup indicator, and for a request from the multiplicity of requests evaluating the predicate upon receiving the request, and processing the request if the predicate is true.
-
公开(公告)号:US11250844B2
公开(公告)日:2022-02-15
申请号:US15881553
申请日:2018-01-26
Applicant: SoundHound, Inc.
Inventor: Bernard Mont-Reynaud , Scott Halstvedt , Keyvan Mohajer
IPC: G10L15/22 , G10L17/22 , G10L17/04 , G10L15/08 , G10L17/06 , G06F3/16 , G06F21/32 , G06K9/00 , G10L17/00
Abstract: Agents engage and disengage with users intelligently. Users can tell agents to remain engaged without requiring a wakeword. Engaged states can support modal dialogs and barge-in. Users can cause disengagement explicitly. Disengagement can be conditional based on timeout, change of user, or environmental conditions. Engagement can be one-time or recurrent. Recurrent states can be attentive or locked. Locked states can be unconditional or conditional, including being reserved to support user continuity. User continuity can be tested by matching parameters or tracking user by many modalities including microphone arrays, cameras, and other sensors.
-
公开(公告)号:US10585891B2
公开(公告)日:2020-03-10
申请号:US15342970
申请日:2016-11-03
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt
IPC: G06F16/2453 , G06F16/33
Abstract: A virtual assistant receives natural language interpretation hypotheses for user queries, determines entities and attributes from the interpretations, and requests data from appropriate data sources. A cost function estimates the cost of each data source request. Cost functions include factors such as contract pricing, access latency, and data quality. Based on the estimated cost, the virtual assistant sends requests to a plurality of data sources, each of which might be able to provide data necessary to answer the user query. By including user credits in the cost function, the virtual assistant provides better quality of results and answer latency for paying users. The virtual assistant minimizes latency by answering using data from the first responding data source or provides a latency guarantee by answering with the most accurate data received by a deadline. The virtual assistant measures data source response latency and caches responses for expensive requests.
-
公开(公告)号:US20180301151A1
公开(公告)日:2018-10-18
申请号:US15881553
申请日:2018-01-26
Applicant: SoundHound, Inc.
Inventor: Bernard Mont-Reynaud , Scott Halstvedt , Keyvan Mohajer
Abstract: Agents engage and disengage with users intelligently. Users can tell agents to remain engaged without requiring a wakeword. Engaged states can support modal dialogs and barge-in. Users can cause disengagement explicitly. Disengagement can be conditional based on timeout, change of user, or environmental conditions. Engagement can be one-time or recurrent. Recurrent states can be attentive or locked. Locked states can be unconditional or conditional, including being reserved to support user continuity. User continuity can be tested by matching parameters or tracking user by many modalities including microphone arrays, cameras, and other sensors.
-
公开(公告)号:US20180108050A1
公开(公告)日:2018-04-19
申请号:US15293931
申请日:2016-10-14
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt , Keyvan Mohajer
CPC classification number: G06Q30/0275 , G06F17/2775 , G06F17/2785
Abstract: An ad processor evaluates bid functions that are based on concepts that might be generated from interpretations of natural language expressions. Ad buyers provide the functions with corresponding ads to ad processors. Bid functions are further based on the values of semantic information referenced by expressions. Bid functions are further based on environmental information. Ad buyers are able to modify bid functions. Ads may be provided in the form of questions, and may be indicated by an identifying sound. Upon finding no expression concepts within a bid function, the set of expression concepts is expanded according to strengths of connections between concepts in a concept graph.
-
-
-
-
-
-
-
-
-