-
公开(公告)号:US20230386458A1
公开(公告)日:2023-11-30
申请号:US17804544
申请日:2022-05-27
Applicant: SoundHound, Inc.
Inventor: Karl STAHL , Bernard MONT-REYNAUD
CPC classification number: G10L15/22 , G10L15/08 , G10L25/93 , G10L2015/088
Abstract: Methods and systems for pre-wakeword speech processing are disclosed. Speech audio, comprising command speech spoken before a wakeword, may be stored in a buffer in oldest to newest order. Upon detection of the wakeword, reverse acoustic models and language models, such as reverse automatic speech recognition (R-ASR) can be applied to the buffered audio, in newest to oldest order, starting from before the wakeword. The speech is converted into a sequence of words. Natural language grammar models, such as natural language understanding (NLU), can be applied to match the sequence of words to a complete command, the complete command being associated with invoking a computer operation.
-
2.
公开(公告)号:US20220383869A1
公开(公告)日:2022-12-01
申请号:US17332927
申请日:2021-05-27
Applicant: SoundHound, Inc.
Inventor: Utku YABAS , Philipp HUBERT , Karl STAHL
IPC: G10L15/22 , G10L15/26 , G06F40/211 , G06F40/284 , G10L15/183
Abstract: A user specifies a natural language command to a device. Software on the device generates contextual metadata about the user interface of the device, such as data about all visible elements of the user interface, and sends the contextual metadata along with the natural language command to a natural language understanding engine. The natural language understanding engine parses the natural language query using a stored grammar (e.g., a grammar provided by a maker of the device) and as a result of the parsing identifies information about the command (e.g., the user interface elements referenced by the command) and provides that information to the device. The device uses that provided information to respond to the command.
-
公开(公告)号:US20220076678A1
公开(公告)日:2022-03-10
申请号:US17531371
申请日:2021-11-19
Applicant: SoundHound, Inc.
Inventor: Irina A. SPIRIDONOVA , Karl STAHL , Mara SELVAGGI
Abstract: A computer-implemented method is provided. The method includes receiving commands to store memos, identifying subjects related to the memos, storing, in a database, the memos, their related subjects, and associated time information, receiving a natural language request to retrieve a memo, the request having query information, identifying a subject related to the request, responsive to the request, querying the database for memos related to the subject, identifying multiple memos in response to the database query, identifying a memo, from the multiple identified memos, that has the most recent associated time information and providing a response in dependence on the identified memo.
-
公开(公告)号:US20200184958A1
公开(公告)日:2020-06-11
申请号:US16212695
申请日:2018-12-07
Applicant: SoundHound, Inc.
Inventor: Katayoun NOROUZI , Karl STAHL
Abstract: A system and method are disclosed for capturing a segment of speech audio, performing phoneme recognition on the segment of speech audio to produce a segmented phoneme sequence, comparing the segmented phoneme sequence to stored phoneme sequences that represent incorrect pronunciations of words to determine if there is a match, and identifying an incorrect pronunciation for a word in the segment of speech audio. The system builds a library based on the data collected for the incorrect pronunciations.
-
-
-