Patent search ap:("SoundHound Page Inc.") AND inv:"Karl STAHL"

1.

发明公开
PRE-WAKEWORD SPEECH PROCESSING 审中-公开

公开(公告)号：US20230386458A1

公开(公告)日：2023-11-30

申请号：US17804544

申请日：2022-05-27

Applicant: SoundHound, Inc.

Inventor： Karl STAHL , Bernard MONT-REYNAUD

IPC: G10L15/22 , G10L15/08 , G10L25/93

CPC classification number: G10L15/22 , G10L15/08 , G10L25/93 , G10L2015/088

Abstract: Methods and systems for pre-wakeword speech processing are disclosed. Speech audio, comprising command speech spoken before a wakeword, may be stored in a buffer in oldest to newest order. Upon detection of the wakeword, reverse acoustic models and language models, such as reverse automatic speech recognition (R-ASR) can be applied to the buffered audio, in newest to oldest order, starting from before the wakeword. The speech is converted into a sequence of words. Natural language grammar models, such as natural language understanding (NLU), can be applied to match the sequence of words to a complete command, the complete command being associated with invoking a computer operation.

2.

发明申请
ENABLING NATURAL LANGUAGE INTERACTIONS WITH USER INTERFACES FOR USERS OF A SOFTWARE APPLICATION 有权

公开(公告)号：US20220383869A1

公开(公告)日：2022-12-01

申请号：US17332927

申请日：2021-05-27

Applicant: SoundHound, Inc.

Inventor： Utku YABAS , Philipp HUBERT , Karl STAHL

IPC: G10L15/22 , G10L15/26 , G06F40/211 , G06F40/284 , G10L15/183

Abstract: A user specifies a natural language command to a device. Software on the device generates contextual metadata about the user interface of the device, such as data about all visible elements of the user interface, and sends the contextual metadata along with the natural language command to a natural language understanding engine. The natural language understanding engine parses the natural language query using a stored grammar (e.g., a grammar provided by a maker of the device) and as a result of the parsing identifies information about the command (e.g., the user interface elements referenced by the command) and provides that information to the device. The device uses that provided information to respond to the command.

3.

发明申请
RECEIVING A NATURAL LANGUAGE REQUEST AND RETRIEVING A PERSONAL VOICE MEMO 有权

公开(公告)号：US20220076678A1

公开(公告)日：2022-03-10

申请号：US17531371

申请日：2021-11-19

Applicant: SoundHound, Inc.

Inventor： Irina A. SPIRIDONOVA , Karl STAHL , Mara SELVAGGI

IPC: G10L15/22 , G10L15/18 , G10L15/19 , G10L15/30

Abstract: A computer-implemented method is provided. The method includes receiving commands to store memos, identifying subjects related to the memos, storing, in a database, the memos, their related subjects, and associated time information, receiving a natural language request to retrieve a memo, the request having query information, identifying a subject related to the request, responsive to the request, querying the database for memos related to the subject, identifying multiple memos in response to the database query, identifying a memo, from the multiple identified memos, that has the most recent associated time information and providing a response in dependence on the identified memo.

4.

发明申请
SYSTEM AND METHOD FOR DETECTION AND CORRECTION OF INCORRECTLY PRONOUNCED WORDS 审中-公开

公开(公告)号：US20200184958A1

公开(公告)日：2020-06-11

申请号：US16212695

申请日：2018-12-07

Applicant: SoundHound, Inc.

Inventor： Katayoun NOROUZI , Karl STAHL

IPC: G10L15/187 , G10L15/04 , G10L15/22 , G06F3/16 , G09B19/04 , G10L13/00

Abstract: A system and method are disclosed for capturing a segment of speech audio, performing phoneme recognition on the segment of speech audio to produce a segmented phoneme sequence, comparing the segmented phoneme sequence to stored phoneme sequences that represent incorrect pronunciations of words to determine if there is a match, and identifying an incorrect pronunciation for a word in the segment of speech audio. The system builds a library based on the data collected for the incorrect pronunciations.

Patent Agency Ranking