-
公开(公告)号:US12175964B2
公开(公告)日:2024-12-24
申请号:US17325114
申请日:2021-05-19
Applicant: SoundHound, Inc.
Inventor: Kiran Garaga Lokeswarappa , Joel Gedalius , Bernard Mont-Reynaud , Jun Huang
IPC: G10L15/00 , G06F40/205 , G06F40/211 , G06F40/253 , G06N20/00 , G06Q30/0241 , G06Q30/0251 , G10L15/02 , G10L15/06 , G10L15/18 , G10L25/90 , H04L67/306 , G10L15/22 , G10L15/26 , G10L25/51
Abstract: A computer-implemented method is provided. The method including receiving speech audio of dictation associated with a user ID, deriving acoustic features from the speech audio, storing the derived acoustic features in a user profile associated with the user ID, receiving a request for acoustic features through an application programming interface (API), the request including the user ID, and sending the derived acoustic features through the API.
-
公开(公告)号:US20220223155A1
公开(公告)日:2022-07-14
申请号:US17709131
申请日:2022-03-30
Applicant: SoundHound, Inc.
Inventor: Bernard Mont-Reynaud
Abstract: A system and method are disclosed capable of parsing a spoken utterance into a natural language request and a speech audio segment, where the natural language request directs the system to use the speech audio segment as a new wakeword. In response to this wakeword assignment directive, the system and method are further capable of immediately building a new wakeword spotter to activate the device upon matching the new wakeword in the input audio. Different approaches to promptly building a new wakeword spotter are described. Variations of wakeword assignment directives can make the new wakeword public or private. They can also add the new wakeword to earlier wakewords, or replace earlier wakewords.
-
公开(公告)号:US11144731B2
公开(公告)日:2021-10-12
申请号:US16128227
申请日:2018-09-11
Applicant: SoundHound, Inc.
Inventor: Pranav Singh , Keyvan Mohajer , Kamyar Mohajer , Bernard Mont-Reynaud
IPC: G06F40/40 , G10L15/30 , G06Q30/02 , G06Q20/10 , G06F40/211
Abstract: A platform provides for developers of applications, such as devices, with natural language interfaces to configure the availability of vertical domain modules in applications. Modules can include grammars for parsing natural language expressions and interfaces to data sources. Third party developers can create modules with pricing models for their usage or access to their data. Device developers can browse or search available modules and test their performance for specific queries. The platform provides for devices users to access the chosen modules as configured by device developers and for charging and payment between users, application developers, and module developers.
-
公开(公告)号:US11138205B1
公开(公告)日:2021-10-05
申请号:US16292190
申请日:2019-03-04
Applicant: SoundHound, Inc.
Inventor: Keyvan Mohajer , Bernard Mont-Reynaud , Philipp Hubert
IPC: G06F16/00 , G06F16/2457 , G06F16/2455 , G06F40/40
Abstract: A query-processing server provides natural language services to applications. More specifically, the query-processing server receives and stores domain knowledge information from application developers, the domain knowledge information comprising a linguistic description of the natural language user queries that application developers wish their applications to support. A first portion of the domain knowledge information is applied to transform a natural language query received from an application to an ordered sequence of question elements. A second portion of the domain knowledge information is applied to group the ordered sequence of question elements into a plurality of distinct structured questions posed by the natural language query. The distinct structured questions may then be provided to the application, which may then execute them and obtain the corresponding data referenced by the questions.
-
公开(公告)号:US20210224043A1
公开(公告)日:2021-07-22
申请号:US17225997
申请日:2021-04-08
Applicant: SoundHound, Inc.
Inventor: Bernard Mont-Reynaud , Seyed M. Emami , Chris Wilson , Keyvan Mohajer
Abstract: A method of building a natural language understanding application is provided. The method includes receiving at least one electronic record containing programming code and creating executable code from the programming code. Further, the executable code, when executed by a processor, causes the processor to create a parse and an interpretation of a sequence of input tokens, the programming code includes an interpret-block and the interpret-block includes an interpret-statement. Additionally, the interpret-statement includes a pattern expression and the interpret-statement includes an action statement.
-
公开(公告)号:US10996931B1
公开(公告)日:2021-05-04
申请号:US16209854
申请日:2018-12-04
Applicant: SoundHound, Inc.
Inventor: Keyvan Mohajer , Seyed M. Emami , Chris Wilson , Bernard Mont-Reynaud
Abstract: The technology disclosed relates to authoring of vertical applications of natural language understanding (NLU), which analyze text or utterances and construct their meaning. In particular, it relates to new programming constructs and tools and data structures implementing those new applications.
-
7.
公开(公告)号:US10418032B1
公开(公告)日:2019-09-17
申请号:US15163485
申请日:2016-05-24
Applicant: SoundHound, Inc.
Inventor: Keyvan Mohajer , Christopher Wilson , Bernard Mont-Reynaud , Regina Collecchia
IPC: G10L15/22 , G06F17/30 , G10L15/19 , G06F16/25 , G06F16/2452
Abstract: A dialog with a conversational virtual assistant includes a sequence of user queries and systems responses. Queries are received and interpreted by a natural language understanding system. Dialog context information gathered from user queries and system responses is stored in a layered context data structure. Incomplete queries, which do not have sufficient information to result in an actionable interpretation, become actionable with use of context data. The system recognizes the need to access context data, and retrieves from context layers information required to transform the query into an executable one. The system may then act on the query and provide an appropriate response to the user. Context data buffers forget information, perhaps selectively, with the passage of time, and after a sufficient number and type of intervening queries.
-
8.
公开(公告)号:US20180329991A1
公开(公告)日:2018-11-15
申请号:US16044331
申请日:2018-07-24
Applicant: SoundHound, Inc.
Inventor: Aaron Master , Bernard Mont-Reynaud , Keyvan Mohajer , Timothy Stonehocker
IPC: G06F17/30
CPC classification number: G06F17/30743 , G06F17/30026 , G06F17/30749 , G06F17/30772
Abstract: The present invention relates to providing identification information in response to an audio segment using a first mode of operation including receiving an audio segment and sending the audio segment to a remote server and receiving, from the remote server, identification information relating to the audio segment, and a second mode of operation of receiving an audio segment and using stored information to obtain identification information relating to the received audio segment received, without sending the audio segment to the remote server. The present invention further includes using identification information from the remote server and using local identification information and selecting either identification information from the remote server or local identification information based on selection criteria, and generating an output based on the selected identification information.
-
公开(公告)号:US20180182398A1
公开(公告)日:2018-06-28
申请号:US15389122
申请日:2016-12-22
Applicant: SoundHound, Inc.
Inventor: Scott Halstvedt , Bernard Mont-Reynaud , Kazi Asif Wadud
CPC classification number: G10L15/30 , G06F17/279 , G10L15/22 , G10L15/222 , G10L2015/223
Abstract: A query-processing system processes an input audio stream that represents a succession of queries spoken by a user. The query-processing system listens continuously to the input audio stream, parses queries and takes appropriate actions in mid-stream. In some embodiments, the system processes queries in parallel, limited by serial constraints. In some embodiments, the system parses and executes queries while a previous query's execution is still in progress. To accommodate users who tend to speak slowly and express a thought in separate parts, the query-processing system halts the outputting of results corresponding to a previous query if it detects that a new speech utterance modifies the meaning of the previous query.
-
公开(公告)号:US11776533B2
公开(公告)日:2023-10-03
申请号:US17225997
申请日:2021-04-08
Applicant: SoundHound, Inc.
Inventor: Bernard Mont-Reynaud , Seyed M. Emami , Chris Wilson , Keyvan Mohajer
CPC classification number: G10L15/18 , G06F8/31 , G06F40/205 , G10L15/06 , G10L15/22 , H04M3/4938
Abstract: A method of building a natural language understanding application is provided. The method includes receiving at least one electronic record containing programming code and creating executable code from the programming code. Further, the executable code, when executed by a processor, causes the processor to create a parse and an interpretation of a sequence of input tokens, the programming code includes an interpret-block and the interpret-block includes an interpret-statement. Additionally, the interpret-statement includes a pattern expression and the interpret-statement includes an action statement.
-
-
-
-
-
-
-
-
-