Patent search ap:"SoundHound Inc." Page 15

141.

发明授权
System and method for performing dual mode speech recognition 有权
Title translation: 用于执行双模式语音识别的系统和方法

公开(公告)号：US09330669B2

公开(公告)日：2016-05-03

申请号：US14621024

申请日：2015-02-12

Applicant: SoundHound, Inc.

Inventor： Timothy P. Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/04 , G10L15/00 , G10L15/30 , G10L15/08 , G10L15/26 , G10L15/34

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

Abstract translation: 提出了一种用于执行双模式语音识别的系统和方法，在移动设备上使用本地识别模块和在服务器设备上使用远程识别引擎。该系统接受来自用户的口语查询，并且本地识别模块和远程识别引擎都对查询执行语音识别操作，返回转录和置信度得分，并受到延迟截止时间的限制。如果两个来源成功地转录查询，则系统接受具有较高置信度得分的结果。如果只有一个源成功，则该结果被接受。在任一情况下，如果远程识别引擎确实成功地转录查询，则如果远程系统结果包括客户端词汇中不存在的信息，则更新客户词汇。

142.

发明授权
Method for embedding voice mail in a spoken utterance using a natural language processing computer system 有权
Title translation: 使用自然语言处理计算机系统以语音发音嵌入语音邮件的方法

公开(公告)号：US09292488B2

公开(公告)日：2016-03-22

申请号：US14170574

申请日：2014-02-01

Applicant: SOUNDHOUND, INC.

Inventor： Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G06F17/27 , G10L15/02 , H04M1/65 , H04L12/58

CPC classification number: G10L15/22 , G06F17/27 , G06Q10/00 , G10L15/02 , G10L15/18 , G10L15/265 , G10L25/48 , G10L2015/223 , H04L51/066 , H04L51/22 , H04L51/32 , H04M1/6505 , H04M1/7255

Abstract: A method for processing a voice message in a computerized system. The method receives and records a speech utterance including a message portion and a communication portion. The method proceeds to parse the input to identify and separate the message portion and the communication portion. It then identifies communication parameters, including one or more destination mailboxes, from the communication portion, and it transmits the message portion to the destination mailbox as a voice message.

Abstract translation: 一种在计算机化系统中处理语音消息的方法。该方法接收并记录包括消息部分和通信部分的语音话语。该方法继续解析输入以识别和分离消息部分和通信部分。然后，它从通信部分识别包括一个或多个目的地邮箱的通信参数，并且将消息部分作为语音消息发送到目的地邮箱。

143.

发明申请
SYSTEM AND METHOD FOR PERFORMING DUAL MODE SPEECH RECOGNITION 有权
Title translation: 用于执行双模式语音识别的系统和方法

公开(公告)号：US20150154959A1

公开(公告)日：2015-06-04

申请号：US14621024

申请日：2015-02-12

Applicant: SoundHound, Inc.

Inventor： Timothy P. Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/30 , G10L15/08 , G10L15/26

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

Abstract translation: 提出了一种用于执行双模式语音识别的系统和方法，在移动设备上使用本地识别模块和在服务器设备上使用远程识别引擎。该系统接受来自用户的口语查询，并且本地识别模块和远程识别引擎都对查询执行语音识别操作，返回转录和置信度得分，并受到延迟截止时间的限制。如果两个来源成功地转录查询，则系统接受具有较高置信度得分的结果。如果只有一个源成功，则该结果被接受。在任一情况下，如果远程识别引擎确实成功地转录查询，则如果远程系统结果包括客户端词汇中不存在的信息，则更新客户词汇。

144.

发明申请
METHOD FOR COMBINING A QUERY AND A COMMUNICATION COMMAND IN A NATURAL LANGUAGE COMPUTER SYSTEM 有权
Title translation: 在自然语言计算机系统中组合查询和通信命令的方法

公开(公告)号：US20150149152A1

公开(公告)日：2015-05-28

申请号：US14092966

申请日：2013-11-28

Applicant: Soundhound, Inc.

Inventor： KEYVAN MOHAJER , BERNARD MONT-REYNAUD

IPC: G06F17/27 , G06F17/30 , G06F17/28

CPC classification number: G06F17/30637 , G06F17/2765 , G06F17/30654 , G06F17/30684

Abstract: A method for processing a natural language input to a computerized system. The method parses the input to identify a query portion and a communication portion of the input. The system then determines an answer to the query portion, including identifying communication parameters from the communication portion. Upon determining the answer, the system prepares an answer to the communication and transmits that answer. If the answer requires information from a remote source, the system creates a subsidiary query to obtain that information and then submits the subsidiary query to the remote source. A response to the query is used to compose the answer to the query from the answer to the subsidiary query. If the system concludes that the query portion does not require information from a remote source, analyzing and answering the query locally.

Abstract translation: 一种用于处理对计算机化系统的自然语言输入的方法。该方法解析输入以识别输入的查询部分和通信部分。系统然后确定对查询部分的答案，包括从通信部分识别通信参数。在确定答案后，系统准备通信的答案并发送答案。如果答案需要来自远程源的信息，系统将创建一个辅助查询以获取该信息，然后将该子查询提交给远程源。对查询的响应用于从辅助查询的答案中构成查询的答案。如果系统断定查询部分不需要来自远程源的信息，则在本地分析和回答查询。

145.

发明申请
METHOD AND SYSTEM FOR PROVIDING REAL-TIME ASSISTANCE TO A TRAVELER 审中-公开
Title translation: 向旅行者提供实时援助的方法和系统

公开(公告)号：US20150127258A1

公开(公告)日：2015-05-07

申请号：US14073973

申请日：2013-11-07

Applicant: SOUNDHOUND, INC.

Inventor： BERNARD MONT-REYNAUD , KEYVAN MOHAJER , LARRY MARCUS

IPC: G01C21/36

CPC classification number: G01C21/3679 , G01C21/3484

Abstract: A method and system for assisting a traveler. The method initiates a travel segment and interfaces with providers of data regarding geolocation, points of interest, or traffic. During execution of the method, the system monitors geolocation data. The system can predict a route of travel, basing that prediction on past geolocation data or directions from a geolocation provider. Also during execution of the method, the system can identify data of interest to a user, based on input initiated by the user. Further, the system can solicit input from the traveler, or it can perform tasks based on predefined criteria, such as locating and identifying points of interest between two geographic locations. During operation, system can provide information to the traveler relating to data of interest. The system includes a client unit and a server unit, the client unit being mounted in an automotive vehicle or in a communications device.

Abstract translation: 一种用于协助旅行者的方法和系统。该方法启动一个旅行段，并与有关地理位置，兴趣点或交通流量的数据提供者进行接口。在执行该方法期间，系统监视地理定位数据。该系统可以预测旅行路线，将该预测基于过去的地理位置数据或来自地理位置提供者的路线。此外，在执行该方法期间，系统可以基于用户发起的输入来识别用户感兴趣的数据。此外，系统可以向旅行者征求输入，或者可以基于预定义的标准执行任务，例如在两个地理位置之间定位和识别兴趣点。在操作期间，系统可以向旅客提供与感兴趣的数据有关的信息。该系统包括客户端单元和服务器单元，客户机单元安装在机动车辆或通信设备中。

146.

发明授权
Performing speech recognition using a local language context including a set of words with descriptions in terms of components smaller than the words 有权

公开(公告)号：US12223963B2

公开(公告)日：2025-02-11

申请号：US16900857

申请日：2020-06-12

Applicant: SoundHound, Inc.

Inventor： Keyvan Mohajer , Timothy Stonehocker , Bernard Mont-Reynaud

IPC: G10L15/30 , G10L15/04 , G10L15/06 , G10L15/08 , G10L15/26 , G10L15/34 , G10L17/06

Abstract: A method of a local recognition system controlling a host device to perform one or more operations is provided. The method includes receiving, by the local recognition system, a query, performing speech recognition on the received query by implementing, by the local recognition system, a local language context comprising a set of words comprising descriptions in terms of components smaller than the words, and performing speech recognition, using the local language context, to create a transcribed query. Further, the method includes controlling the host device in dependence upon the speech recognition performed on the transcribed query.

147.

发明授权
System and method for voice morphing in a data annotator tool 有权

公开(公告)号：US12086564B2

公开(公告)日：2024-09-10

申请号：US17539182

申请日：2021-11-30

Applicant: SoundHound, Inc.

Inventor： Dylan H. Ross

IPC: G10L15/18 , G06F40/56 , G06F40/58 , G10L15/06 , G10L19/125 , G10L19/26 , G10L21/013

CPC classification number: G06F40/56 , G06F40/58 , G10L15/06 , G10L15/18 , G10L19/125 , G10L19/265 , G10L21/013 , G10L2021/0135

Abstract: A system and method for masking an identity of a speaker of natural language speech, such as speech clips to be labeled by humans in a system generating voice transcriptions for training an automatic speech recognition model. The natural language speech is morphed prior to being presented to the human for labeling. In one embodiment, morphing comprises pitch shifting the speech randomly either up or down, then frequency shifting the speech, then pitch shifting the speech in a direction opposite the first pitch shift. Labeling the morphed speech comprises at least one or more of transcribing the morphed speech, identifying a gender of the speaker, identifying an accent of the speaker, and identifying a noise type of the morphed speech.

148.

发明授权
Classification by natural language grammar slots across domains 有权

公开(公告)号：US11935029B2

公开(公告)日：2024-03-19

申请号：US16121967

申请日：2018-09-05

Applicant: SoundHound, Inc.

Inventor： Joe Aung , Jonah Probell

IPC: G06Q20/24 , G06F9/54 , G06F16/28 , G06F40/205 , G06F40/211 , G06F40/253 , G06F40/30

CPC classification number: G06Q20/24 , G06F9/547 , G06F16/285 , G06F40/205 , G06F40/211 , G06F40/253 , G06F40/30

Abstract: A virtual assistant processes natural language expressions according to grammar rules created by domain providers. The virtual assistant uniquely identifies each of a multiplicity of users and stores values of grammar slots filled by natural language expressions from each user. The virtual assistant stores histories of slot values and computes statistics from the history. The virtual assistant provider, or a classification client, provides values of attributes of users as labels for a machine learning classification algorithm. The algorithm processes the grammar slot values and labels to compute probability distributions for unknown attribute values of users. A network effect of users and domain grammars make the virtual assistant useful and provides increasing amounts of data that improve classification accuracy and usefulness.

149.

发明公开
APPARATUS, PLATFORM, METHOD AND MEDIUM FOR INTENTION IMPORTANCE INFERENCE 审中-公开

公开(公告)号：US20230386459A1

公开(公告)日：2023-11-30

申请号：US17820660

申请日：2022-08-18

Applicant: SoundHound, Inc.

Inventor： Chong Wang

IPC: G10L15/22 , G10L15/16 , G06F16/9535

CPC classification number: G10L15/22 , G10L15/16 , G06F16/9535 , G10L2015/225

Abstract: The application provides an apparatus, platform, method and medium for intention importance interference. The apparatus includes an interface configured to receive user-related information; and a processor coupled to the interface and configured to: extract data related to different aspects of a user from the user-related information; generate a plurality of intention probes based on the data related to different aspects of the user, each intention probe comprising an intention and associated data items; infer an importance of each intention probe by calculating a score of each associated data items of the intention probe based on the data related to different aspects of the user; and provide information associated with an intention probe with a highest importance.

150.

发明公开
METHOD AND SYSTEM FOR ACOUSTIC MODEL CONDITIONING ON NON-PHONEME INFORMATION FEATURES 审中-公开

公开(公告)号：US20230352000A1

公开(公告)日：2023-11-02

申请号：US18348259

申请日：2023-07-06

Applicant: SoundHound, Inc.

Inventor： Zizu GOWAYYED , Keyvan MOHAJER

IPC: G10L15/02 , G10L15/04 , G10L15/22

CPC classification number: G10L15/02 , G10L15/04 , G10L15/22 , G10L2015/025

Abstract: A method and system for acoustic model conditioning on non-phoneme information features for optimized automatic speech recognition is provided. The method includes using an encoder model to encode sound embedding from a known key phrase of speech and conditioning an acoustic model with the sound embedding to optimize its performance in inferring the probabilities of phonemes in the speech. The sound embedding can comprise non-phoneme information related to the key phrase and the following utterance. Further, the encoder model and the acoustic model can be neural networks that are jointly trained with audio data.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification