Patent search ap:("SoundHound Page Inc.") AND inv:"Keyvan Mohajer"

21.

发明申请
VIRTUAL ASSISTANT CONFIGURED BY SELECTION OF WAKE-UP PHRASE 审中-公开

公开(公告)号：US20180108343A1

公开(公告)日：2018-04-19

申请号：US15294234

申请日：2016-10-14

Applicant: SoundHound, Inc.

Inventor： Mark Stevans , Monika Almudafar-Depeyrot , Keyvan Mohajer

IPC: G10L13/08 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/02

CPC classification number: G10L13/08 , G10L13/043 , G10L15/18 , G10L15/22 , G10L15/30 , G10L2015/025 , G10L2015/088 , G10L2015/223

Abstract: A speech-enabled dialog system responds to a plurality of wake-up phrases. Based on which wake-up phrase is detected, the system's configuration is modified accordingly. Various configurable aspects of the system include selection and morphing of a text-to-speech voice; configuration of acoustic model, language model, vocabulary, and grammar; configuration of a graphic animation; configuration of virtual assistant personality parameters; invocation of a particular user profile; invocation of an authentication function; and configuration of an open sound. Configuration depends on a target market segment. Configuration also depends on the state of the dialog system, such as whether a previous utterance was an information query.

22.

发明授权
System and method for performing dual mode speech recognition 有权

公开(公告)号：US09691390B2

公开(公告)日：2017-06-27

申请号：US15085944

申请日：2016-03-30

Applicant: SoundHound, Inc.

Inventor： Timothy Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/04 , G10L15/00 , G10L15/30 , G10L15/08 , G10L15/26 , G10L15/06 , G10L17/06 , G10L15/34

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

23.

发明申请
Natural Language Module Store 审中-公开

公开(公告)号：US20170154628A1

公开(公告)日：2017-06-01

申请号：US14954810

申请日：2015-11-30

Applicant: SoundHound Inc.

Inventor： Keyvan Mohajer , Kamyar Mohajer , Bernard Mont-Reynaud , Pranav Singh

IPC: G10L15/30 , G06Q30/02 , G06Q20/10 , G06F17/28

CPC classification number: G06F17/28 , G06F17/271 , G06Q30/0283 , G10L15/30

Abstract: The present invention extends to methods, systems, and computer program products for a natural language module store. In general, the invention can be used to manage natural language modules offered through a natural language module store. Natural language module (NLM) developers can post NLMs at a NLM store to make the NLMs available for use by others. Developers can select NLMs for inclusion in natural language interpreters (NLIs) containing (and possibly integrating the functionality of) one or more NLMs. Prior to selecting a NLM, a developer can search or browse NLMs to identify an appropriate NLM. Optionally, a developer can test a NLM in the NLM store prior to inclusion in an NLI. For example, multiple NLMs purporting to provide the same specified natural language functionality can be tested relative to one another prior to selection of one of the NLMs for inclusion in an NLI.

24.

发明授权
System and method for matching a query against a broadcast stream 有权
Title translation: 用于将查询与广播流匹配的系统和方法

公开(公告)号：US09563699B1

公开(公告)日：2017-02-07

申请号：US14692310

申请日：2015-04-21

Applicant: SoundHound Inc.

Inventor： Keyvan Mohajer , Bernard Mont-Reynaud , Joe Kyaw Soe Aung

IPC: G06F17/30 , H04H60/35 , H04H60/27 , H04H60/13 , G10L19/018 , H04H60/44 , H04H60/43 , H04H20/14

CPC classification number: G06F17/30752 , G06F17/30743 , G06F17/30772 , G10L19/018 , H04H20/14 , H04H60/13 , H04H60/27 , H04H60/35 , H04H60/372 , H04H60/43 , H04H60/44 , H04H60/58

Abstract: A method for matching a query against a broadcast stream includes receiving one or more broadcast streams, from which it generates and stores an audio fingerprint of a selected portion of each received broadcast stream. A query is received from which the method generates an audio fingerprint. From that point, the method continues by identifying audio content from the query, using the query audio fingerprint and a database of indexed audio content. The method concludes by identifying the source of the query using the query audio fingerprint and the stored audio fingerprints. Embodiments of the method further include predictively caching audio fingerprint sequences and corresponding audio item identifiers from a server after storing audio fingerprints extracted from the broadcast stream; and using the predictively cached audio fingerprint sequences to identify an audio item within the audio signal based on at least some additional audio fingerprints of the audio signal.

Abstract translation: 用于将查询与广播流匹配的方法包括接收一个或多个广播流，从而生成并存储每个接收的广播流的所选部分的音频指纹。接收到该方法生成音频指纹的查询。从那时起，该方法通过使用查询音频指纹和索引的音频内容的数据库识别来自查询的音频内容来继续。该方法通过使用查询音频指纹和存储的音频指纹识别查询的来源得出结论。该方法的实施例还包括在存储从广播流中提取的音频指纹之后，从服务器预测性地缓存音频指纹序列和对应的音频项目标识符; 以及使用所述预测缓存的音频指纹序列基于所述音频信号的至少一些附加音频指纹来识别所述音频信号内的音频项目。

25.

发明授权
System and method for performing dual mode speech recognition 有权
Title translation: 用于执行双模式语音识别的系统和方法

公开(公告)号：US09330669B2

公开(公告)日：2016-05-03

申请号：US14621024

申请日：2015-02-12

Applicant: SoundHound, Inc.

Inventor： Timothy P. Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/04 , G10L15/00 , G10L15/30 , G10L15/08 , G10L15/26 , G10L15/34

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

Abstract translation: 提出了一种用于执行双模式语音识别的系统和方法，在移动设备上使用本地识别模块和在服务器设备上使用远程识别引擎。该系统接受来自用户的口语查询，并且本地识别模块和远程识别引擎都对查询执行语音识别操作，返回转录和置信度得分，并受到延迟截止时间的限制。如果两个来源成功地转录查询，则系统接受具有较高置信度得分的结果。如果只有一个源成功，则该结果被接受。在任一情况下，如果远程识别引擎确实成功地转录查询，则如果远程系统结果包括客户端词汇中不存在的信息，则更新客户词汇。

26.

发明授权
Method for embedding voice mail in a spoken utterance using a natural language processing computer system 有权
Title translation: 使用自然语言处理计算机系统以语音发音嵌入语音邮件的方法

公开(公告)号：US09292488B2

公开(公告)日：2016-03-22

申请号：US14170574

申请日：2014-02-01

Applicant: SOUNDHOUND, INC.

Inventor： Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G06F17/27 , G10L15/02 , H04M1/65 , H04L12/58

CPC classification number: G10L15/22 , G06F17/27 , G06Q10/00 , G10L15/02 , G10L15/18 , G10L15/265 , G10L25/48 , G10L2015/223 , H04L51/066 , H04L51/22 , H04L51/32 , H04M1/6505 , H04M1/7255

Abstract: A method for processing a voice message in a computerized system. The method receives and records a speech utterance including a message portion and a communication portion. The method proceeds to parse the input to identify and separate the message portion and the communication portion. It then identifies communication parameters, including one or more destination mailboxes, from the communication portion, and it transmits the message portion to the destination mailbox as a voice message.

Abstract translation: 一种在计算机化系统中处理语音消息的方法。该方法接收并记录包括消息部分和通信部分的语音话语。该方法继续解析输入以识别和分离消息部分和通信部分。然后，它从通信部分识别包括一个或多个目的地邮箱的通信参数，并且将消息部分作为语音消息发送到目的地邮箱。

27.

发明申请
SYSTEM AND METHOD FOR PERFORMING DUAL MODE SPEECH RECOGNITION 有权
Title translation: 用于执行双模式语音识别的系统和方法

公开(公告)号：US20150154959A1

公开(公告)日：2015-06-04

申请号：US14621024

申请日：2015-02-12

Applicant: SoundHound, Inc.

Inventor： Timothy P. Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/30 , G10L15/08 , G10L15/26

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

Abstract translation: 提出了一种用于执行双模式语音识别的系统和方法，在移动设备上使用本地识别模块和在服务器设备上使用远程识别引擎。该系统接受来自用户的口语查询，并且本地识别模块和远程识别引擎都对查询执行语音识别操作，返回转录和置信度得分，并受到延迟截止时间的限制。如果两个来源成功地转录查询，则系统接受具有较高置信度得分的结果。如果只有一个源成功，则该结果被接受。在任一情况下，如果远程识别引擎确实成功地转录查询，则如果远程系统结果包括客户端词汇中不存在的信息，则更新客户词汇。

28.

发明授权
Performing speech recognition using a local language context including a set of words with descriptions in terms of components smaller than the words 有权

公开(公告)号：US12223963B2

公开(公告)日：2025-02-11

申请号：US16900857

申请日：2020-06-12

Applicant: SoundHound, Inc.

Inventor： Keyvan Mohajer , Timothy Stonehocker , Bernard Mont-Reynaud

IPC: G10L15/30 , G10L15/04 , G10L15/06 , G10L15/08 , G10L15/26 , G10L15/34 , G10L17/06

Abstract: A method of a local recognition system controlling a host device to perform one or more operations is provided. The method includes receiving, by the local recognition system, a query, performing speech recognition on the received query by implementing, by the local recognition system, a local language context comprising a set of words comprising descriptions in terms of components smaller than the words, and performing speech recognition, using the local language context, to create a transcribed query. Further, the method includes controlling the host device in dependence upon the speech recognition performed on the transcribed query.

29.

发明公开
SYSTEMS AND METHODS FOR GENERATING AND USING SHARED NATURAL LANGUAGE LIBRARIES 审中-公开

公开(公告)号：US20230325358A1

公开(公告)日：2023-10-12

申请号：US18206567

申请日：2023-06-06

Applicant: SoundHound, Inc.

Inventor： Keyvan Mohajer

IPC: G06F16/33 , G10L15/06 , G10L15/183 , G06F16/174

CPC classification number: G06F16/1748 , G06F16/3334 , G06F16/3344 , G10L15/063 , G10L15/183 , G10L2015/0635

Abstract: Systems and methods for searching databases by sound data input are provided herein. A service provider may have a need to make their database(s) searchable through search technology. However, the service provider may not have the resources to implement such search technology. The search technology may allow for search queries using sound data input. The technology described herein provides a solution addressing the service provider’s need, by giving a search technology that furnishes search results in a fast, accurate manner. In further embodiments, systems and methods to monetize those search results are also described herein.

30.

发明授权
Method and system for acoustic model conditioning on non-phoneme information features 有权

公开(公告)号：US11741943B2

公开(公告)日：2023-08-29

申请号：US17224967

申请日：2021-04-07

Applicant: SoundHound, Inc.

Inventor： Zizu Gowayyed , Keyvan Mohajer

IPC: G10L15/22 , G10L15/02 , G10L15/04

CPC classification number: G10L15/02 , G10L15/04 , G10L15/22 , G10L2015/025

Abstract: A method and system for acoustic model conditioning on non-phoneme information features for optimized automatic speech recognition is provided. The method includes using an encoder model to encode sound embedding from a known key phrase of speech and conditioning an acoustic model with the sound embedding to optimize its performance in inferring the probabilities of phonemes in the speech. The sound embedding can comprise non-phoneme information related to the key phrase and the following utterance. Further, the encoder model and the acoustic model can be neural networks that are jointly trained with audio data.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification