Patent search ap:"SoundHound Inc." Page 10

91.

发明申请
DUAL MODE SPEECH RECOGNITION 审中-公开

公开(公告)号：US20180358019A1

公开(公告)日：2018-12-13

申请号：US15619304

申请日：2017-06-09

Applicant: SoundHound, Inc.

Inventor： Bernard Mont-Reynaud

IPC: G10L15/32 , G10L15/06 , G10L15/30 , G10L15/02 , G10L15/18

CPC classification number: G10L15/32 , G10L15/02 , G10L15/063 , G10L15/1822 , G10L15/30 , G10L2015/0635

Abstract: A dual mode speech recognition system sends speech to two or more speech recognizers. If a first recognition result is received, whose recognition score exceeds a high threshold, the first result is selected without waiting for another result. If the score is below a low threshold, the first result is ignored. At intermediate values of recognition scores, a timeout duration is dynamically determined as a function of the recognition score. The timeout duration determines how long the system will wait for another result. Many functions of the recognition score are possible, but timeout durations generally decrease as scores increase. When receiving a second recognition score before the timeout occurs, a comparison based on recognition scores determines whether the first result or the second result is the basis for creating a response.

92.

发明申请
PRONUNCIATION GUIDED BY AUTOMATIC SPEECH RECOGNITION 审中-公开

公开(公告)号：US20180190269A1

公开(公告)日：2018-07-05

申请号：US15439883

申请日：2017-02-22

Applicant: SoundHound, Inc.

Inventor： Kiran Garaga Lokeswarappa , Jonah Probell

IPC: G10L15/187 , G10L15/26 , G10L15/01 , G10L13/10 , G10L15/06

CPC classification number: G10L15/26 , G09B5/04 , G09B19/06 , G10L13/00 , G10L2015/225

Abstract: Speech synthesis chooses pronunciations of words with multiple acceptable pronunciations based on an indication of a personal, class-based, or global preference or an intended non-preferred pronunciation. A speaker's words can be parroted back on personal devices using preferred pronunciations for accent training. Degrees of pronunciation error are computed and indicated to the user in a visual transcription or audibly as word emphasis in parroted speech. Systems can use sets of phonemes extended beyond those generally recognized for a language. Speakers are classified in order to choose specific phonetic dictionaries or adapt global ones. User profiles maintain lists of which pronunciations are preferred among ones acceptable for words with multiple recognized pronunciations. Systems use multiple correlations of word preferences across users to predict use preferences of unlisted words. Speaker-preferred pronunciations are used to weight the scores of transcription hypotheses based on phoneme sequence hypotheses in speech engines.

93.

发明申请
System and Method for Performing Dual Mode Speech Recognition 审中-公开

公开(公告)号：US20170256264A1

公开(公告)日：2017-09-07

申请号：US15603257

申请日：2017-05-23

Applicant: SoundHound, Inc.

Inventor： Timothy Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/30 , G10L15/34

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

94.

发明授权
System and methods for offline audio recognition 有权

公开(公告)号：US09619560B1

公开(公告)日：2017-04-11

申请号：US14884650

申请日：2015-10-15

Applicant: Soundhound, Inc.

Inventor： Aaron Master , Bernard Mont-Reynaud , Keyvan Mohajer

IPC: G06F17/30

CPC classification number: G06F17/30743 , G10L15/08 , G10L25/54

Abstract: In one implementation, a method is described of retrying matching of an audio query against audio references. The method includes receiving a follow-up query that requests a retry at matching a previously submitted audio query. In some implementations, this follow-up query is received without any recognition hint that suggests how to retry matching. The follow-up query includes the audio query or a reference to the audio query to be used in the retry. The method further includes retrying matching the audio query using retry matching resources that include an expanded group of audio references, identifying at least one match and transmitting a report of the match. Optionally, the method includes storing data that correlates the follow-up query, the audio query or the reference to the audio query, and the match after retrying.

95.

发明申请
SYSTEM AND METHOD FOR CONTINUING AN INTERRUPTED BROADCAST STREAM 审中-公开
Title translation: 用于连续中断广播流的系统和方法

公开(公告)号：US20160314794A1

公开(公告)日：2016-10-27

申请号：US15098080

申请日：2016-04-13

Applicant: SoundHound, Inc.

Inventor： Victor Leitman , Bernard Mont-Reynaud , Kathleen Worthington McMahon , Regina Collecchia

IPC: G10L19/018 , H04H20/86

CPC classification number: G10L19/018 , H04H60/37 , H04H60/44 , H04H60/58 , H04H2201/90 , H04N21/233 , H04N21/2387 , H04N21/4126 , H04N21/42203 , H04N21/4333 , H04N21/4436 , H04N21/4524 , H04N21/4532 , H04N21/4622 , H04N21/81 , H04N21/8358 , H04N21/84 , H04N21/8456 , H04N21/8547 , H04N21/8586

Abstract: A client, such as a mobile phone, receives an audio signal from a microphone; the sound comes from a broadcast signal such as a radio or television program. The client sends a segment of audio data from the broadcast program to a detection system, such as a server. A broadcast monitoring system receives many broadcast audio signals and encodes their fingerprints in a database for matching. The detection system compares the client's audio data fingerprints to the content fingerprints to identify which broadcast station broadcast the signal having the sampled content. This information enables the client to resume the experience of the broadcast from one of a number of possible media sources.

Abstract translation: 诸如移动电话的客户端从麦克风接收音频信号; 声音来自诸如广播或电视节目的广播信号。客户端将一段音频数据从广播节目发送到诸如服务器的检测系统。广播监控系统接收许多广播音频信号，并将其指纹编码在数据库中进行匹配。检测系统将客户端的音频数据指纹与内容指纹进行比较，以识别哪个广播台广播具有采样内容的信号。该信息使得客户端可以从许多可能的媒体源之一恢复广播的体验。

96.

发明申请
System and Methods for Continuous Audio Matching 审中-公开
Title translation: 用于连续音频匹配的系统和方法

公开(公告)号：US20160292266A1

公开(公告)日：2016-10-06

申请号：US15182300

申请日：2016-06-14

Applicant: SoundHound, Inc.

Inventor： Bernard Mont-Reynaud , Aaron Master , Timothy P. Stonehocker , Keyvan Mohajer

IPC: G06F17/30

CPC classification number: G06F17/30743 , G06F17/30026 , G06F17/30749 , G06F17/30772

Abstract: The present invention relates to the continuous monitoring of an audio signal and identification of audio items within an audio signal. The technology disclosed utilizes predictive caching of fingerprints to improve efficiency. Fingerprints are cached for tracking an audio signal with known alignment and for watching an audio signal without known alignment, based on already identified fingerprints extracted from the audio signal. Software running on a smart phone or other battery-powered device cooperates with software running on an audio identification server.

Abstract translation: 本发明涉及音频信号的连续监视和音频信号内的音频项目的识别。所公开的技术利用指纹的预测性缓存来提高效率。基于从音频信号提取的已经识别的指纹，缓存指纹用于跟踪具有已知对准的音频信号并且用于观看没有已知对准的音频信号。在智能手机或其他电池供电设备上运行的软件与在音频识别服务器上运行的软件配合使用。

97.

发明授权
Token confidence scores for automatic speech recognition 有权

公开(公告)号：US12223948B2

公开(公告)日：2025-02-11

申请号：US17649810

申请日：2022-02-03

Applicant: SoundHound, Inc.

Inventor： Pranav Singh , Saraswati Mishra , Eunjee Na

IPC: G10L15/18 , G10L15/02 , G10L15/26

Abstract: Methods and systems for correction of a likely erroneous word in a speech transcription are disclosed. By evaluating token confidence scores of individual words or phrases, the automatic speech recognition system can replace a low-confidence score word with a substitute word or phrase. Among various approaches, neural network models can be used to generate individual confidence scores. Such word substitution can enable the speech recognition system to automatically detect and correct likely errors in transcription. Furthermore, the system can indicate the token confidence scores on a graphic user interface for labeling and dictionary enhancement.

98.

发明授权
Controlling an engagement state of an agent during a human-machine dialog 有权

公开(公告)号：US12125484B2

公开(公告)日：2024-10-22

申请号：US17562891

申请日：2021-12-27

Applicant: SoundHound, Inc.

Inventor： Scott Halstvedt , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/22 , G06F3/16 , G06F21/32 , G06V40/16 , G10L15/08 , G10L17/00 , G10L17/04 , G10L17/06 , G10L17/22

CPC classification number: G10L15/22 , G06F3/167 , G06F21/32 , G10L15/08 , G10L17/04 , G10L17/06 , G10L17/22 , G06V40/16 , G06V40/166 , G10L2015/088 , G10L2015/223 , G10L17/00

Abstract: A method of controlling an engagement state of an agent during a human-machine dialog is provided. The method can include receiving a spoken request that is a conditional locking request, wherein the conditional locking request uses a natural language expression to explicitly specify a locking condition, which is a predicate, storing the predicate in a format that can be evaluated when needed by the agent, entering a conditionally locked state in response to the conditional locking request, in the conditionally locked state, receiving a multiplicity of requests without a need for a wakeup indicator, and for a request from the multiplicity of requests evaluating the predicate upon receiving the request, and processing the request if the predicate is true.

99.

发明授权
Machine learning system for digital assistants 有权

公开(公告)号：US12067006B2

公开(公告)日：2024-08-20

申请号：US17350294

申请日：2021-06-17

Applicant: SoundHound, Inc.

Inventor： Pranav Singh , Yilun Zhang , Keyvan Mohajer , Mohammadreza Fazeli

IPC: G06F16/242 , G06N3/045 , G06N3/088

CPC classification number: G06F16/2425 , G06N3/045 , G06N3/088

Abstract: A machine learning system for a digital assistant is described, together with a method of training such a system. The machine learning system is based on an encoder-decoder sequence-to-sequence neural network architecture trained to map input sequence data to output sequence data, where the input sequence data relates to an initial query and the output sequence data represents canonical data representation for the query. The method of training involves generating a training dataset for the machine learning system. The method involves clustering vector representations of the query data samples to generate canonical-query original-query pairs in training the machine learning system.

100.

发明授权
Server supported recognition of wake phrases 有权

公开(公告)号：US12051403B2

公开(公告)日：2024-07-30

申请号：US17584780

申请日：2022-01-26

Applicant: SoundHound, Inc.

Inventor： Newton Jain , Sameer Syed Zaheer

IPC: G10L15/06 , G06F8/41 , G10L15/08 , G10L15/16

CPC classification number: G10L15/063 , G06F8/41 , G10L15/16 , G10L2015/088

Abstract: A server supports multiple virtual assistants. It receives requests that include wake phrase audio and an identification of the source of the request, such as a virtual assistant device. Based on the identification, the server searches a database for a wake phrase detector appropriate for the identified source. The server then applies the wake phrase detector to the received wake phrase audio. If the wake phrase audio triggers the wake phrase detector, the server provides an appropriate response to the source.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification