-
公开(公告)号:US20190163743A1
公开(公告)日:2019-05-30
申请号:US16244039
申请日:2019-01-09
Applicant: SoundHound, Inc.
Inventor: Kheng KHOV , Pranav SINGH , Bernard MONT-REYNAUD , Jonah PROBELL
IPC: G06F17/27 , G06F16/00 , G06F16/9537 , G06F16/29 , G06Q30/02
Abstract: A method of determining a count of occurrences of concepts within regions is provided. The method includes receiving natural language expressions, each expression being uttered by a person located at a different geolocation, receiving geolocation information of each person having uttered the natural language expressions and associating the geolocation information of each person with a corresponding natural language expression and for each natural language expression: parsing the natural language expression to create an interpretation, deriving concepts, and recording, in a database, concepts, geolocation, and associations of the concepts and geolocations; and accumulating, for each region, a count of occurrences of each concept having an associated geolocation within the region.
-
公开(公告)号:US20220189464A1
公开(公告)日:2022-06-16
申请号:US17653365
申请日:2022-03-03
Applicant: SoundHound, Inc.
Inventor: Sudharsan KRISHNASWAMY , Maisy WIEMAN , Jonah PROBELL
IPC: G10L15/06 , G10L15/16 , G10L15/18 , G10L13/02 , G10L15/197 , G10L15/22 , G10L15/187
Abstract: A system and method invoke virtual assistant action, which may comprise an argument. From audio, a probability of an intent is inferred. A probability of a domain and a plurality of variable values may also be inferred. Invoking the action is in response to the intent probability exceeding a threshold. Invoking the action may also be in response to the domain probability exceeding a threshold, a variable value probability exceeding a threshold, detecting an end of utterance, and a specific amount of time having elapsed. The intent probability may increase when the audio includes speech of words with the same meaning in multiple natural languages. Invoking the action may also be conditional on the variable value exceeding its threshold within a certain period of time of the intent probability exceeding its threshold.
-
公开(公告)号:US20200219513A1
公开(公告)日:2020-07-09
申请号:US16824308
申请日:2020-03-19
Applicant: SoundHound, Inc.
Inventor: Patricia Pozon AGUAYO , Jennifer Hee Young ZHANG , Jonah PROBELL
Abstract: A processing system detects a period of non-voice activity and compares its duration to a cutoff period. The system adapts the cutoff period based on parsing previously-recognized speech to determine, according to a model, such as a machine-learned model, the probability that the speech recognized so far is a prefix to a longer complete utterance. The cutoff period is longer when a parse of previously recognized speech has a high probability of being a prefix of a longer utterance.
-
公开(公告)号:US20190198012A1
公开(公告)日:2019-06-27
申请号:US15855908
申请日:2017-12-27
Applicant: SoundHound, Inc.
Inventor: Jennifer Hee Young ZHANG , Patricia Pozon AGUAYO , Jonah PROBELL
CPC classification number: G10L15/05 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L25/78 , G10L2015/088
Abstract: A speech-based human-machine interface that parses words spoken to detect a complete parse and, responsive to so detecting, computes a hypothesis as to whether the words are a prefix to another complete parse. The duration of no voice activity period to determine an end of a sentence depends on the prefix hypothesis. The user's typical speech speed profile and a short-term measure of speech speed also scale the period. Speech speed is measured by the time between words, and the period scaling uses a continuously adaptive algorithm. The system uses a longer cut-off period after a system wake-up event but before it detects any voice activity.
-
公开(公告)号:US20220075956A1
公开(公告)日:2022-03-10
申请号:US17527154
申请日:2021-11-15
Applicant: SoundHound, Inc.
Inventor: Bernard MONT-REYNAUD , Jonah PROBELL , Pranav SINGH , Kheng KHOV
IPC: G06F40/30 , G06Q30/02 , G06F16/00 , G06F16/29 , G06F16/9537 , G06F40/289
Abstract: A method of providing relevant messages to an automotive virtual assistant is provided. The method includes receiving a spoken utterance and corresponding first geolocation information detected by a subsystem of a first automobile, parsing the spoken utterance to determine concepts and storing the concepts in a concept database indexed by the corresponding first geolocation information. The method further includes receiving second geolocation information detected by a subsystem of a second automobile, searching the concept database for an index based on the second geolocation information to find a stored concept of the stored concepts, searching a natural language expression database using the stored concept as an index to find an assistive natural language expression, wherein the assistive natural language expression includes a constituent part, and sending the assistive natural language expression to the second automobile with the stored concept in place of the constituent part.
-
公开(公告)号:US20240135927A1
公开(公告)日:2024-04-25
申请号:US18401770
申请日:2024-01-02
Applicant: SoundHound, Inc.
Inventor: Patricia Pozon AGUAYO , Jennifer Hee Young ZHANG , Jonah PROBELL
CPC classification number: G10L15/22 , G10L15/05 , G10L15/1822 , G10L25/78 , G10L2015/088
Abstract: A system detects a period of non-voice activity and compares its duration to a cutoff period. The system adapts the cutoff period based on parsing previously-recognized speech of a user that is stored on a user's device or the system, which detects the voice activity, to determine according to a model, such as a machine-learned model, the probability that the speech recognized so far is a prefix to a longer complete utterance. The cutoff period is longer when a parse of previously recognized speech, which is based on the user profile, has a high probability of being a prefix of a longer utterance.
-
公开(公告)号:US20220405797A1
公开(公告)日:2022-12-22
申请号:US17820858
申请日:2022-08-18
Applicant: SoundHound, Inc.
Inventor: Jonah PROBELL
Abstract: Ads are generated based on product info and consumer profiles. A discriminator evaluates probabilities of ads being effective at causing consumer engagement. A decoder extracts product info from generated ads. Based on the probabilities of ads being effective and similarity of extracted and source product info, generated ads are labeled as examples. The examples are used in training an improved ad generator. Ads may be visual and/or audio containing speech. Ads may even contain humor, as recognized by mismatches between source and decoded product info.
-
公开(公告)号:US20220208192A1
公开(公告)日:2022-06-30
申请号:US17698623
申请日:2022-03-18
Applicant: SoundHound, Inc.
Inventor: Patricia Pozon AGUAYO , Jennifer Hee Young ZHANG , Jonah PROBELL
Abstract: A processing system detects a period of non-voice activity and compares its duration to a cutoff period. The system adapts the cutoff period based on parsing previously-recognized speech to determine, according to a model, such as a machine-learned model, the probability that the speech recognized so far is a prefix to a longer complete utterance. The cutoff period is longer when a parse of previously recognized speech has a high probability of being a prefix of a longer utterance.
-
公开(公告)号:US20210182661A1
公开(公告)日:2021-06-17
申请号:US16716497
申请日:2019-12-17
Applicant: SoundHound, Inc.
Inventor: Zili LI , Asif AMIRGULIYEV , Jonah PROBELL
Abstract: Training and enhancement of neural network models, such as from private data, are described. A slave device receives a version of a neural network model from a master. The slave accesses a local and/or private data source and uses the data to perform optimization of the neural network model. This can be done such as by computing gradients or performing knowledge distillation to locally train an enhanced second version of the model. The slave sends the gradients or enhanced neural network model to a master. The master may use the gradient or second version of the model to improve a master model.
-
公开(公告)号:US20190138602A1
公开(公告)日:2019-05-09
申请号:US16238445
申请日:2019-01-02
Applicant: SoundHound, Inc.
Inventor: Kheng KHOV , Pranav SINGH , Bernard MONT-REYNAUD , Jonah PROBELL
IPC: G06F17/27 , G06F16/9537 , G06F16/29 , G06Q30/02 , G06F16/00
Abstract: A method of predicting a person's interests is provided. The method includes receiving geolocation information about a user location, reading, from a database of interpretations, at least one interpretation of an expression made in close proximity to the location, reading, from a database of ad bids, a plurality of ad bids comprising interpretations, comparing the interpretation from the database to the interpretations of the ad bids to select a most valuable ad bid having an interpretation that matches the interpretation of an expression made in close proximity to the location, and presenting an ad associated with the most valuable ad bid, wherein the interpretation is from a natural language expression.
-
-
-
-
-
-
-
-
-