Patent search ap:("SOUNDHOUND Page INC.") AND inv:"Bernard Mont-Reynaud"

21.

发明申请
System and Method for Performing Dual Mode Speech Recognition 审中-公开

公开(公告)号：US20170256264A1

公开(公告)日：2017-09-07

申请号：US15603257

申请日：2017-05-23

Applicant: SoundHound, Inc.

Inventor： Timothy Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/30 , G10L15/34

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

22.

发明授权
System and methods for offline audio recognition 有权

公开(公告)号：US09619560B1

公开(公告)日：2017-04-11

申请号：US14884650

申请日：2015-10-15

Applicant: Soundhound, Inc.

Inventor： Aaron Master , Bernard Mont-Reynaud , Keyvan Mohajer

IPC: G06F17/30

CPC classification number: G06F17/30743 , G10L15/08 , G10L25/54

Abstract: In one implementation, a method is described of retrying matching of an audio query against audio references. The method includes receiving a follow-up query that requests a retry at matching a previously submitted audio query. In some implementations, this follow-up query is received without any recognition hint that suggests how to retry matching. The follow-up query includes the audio query or a reference to the audio query to be used in the retry. The method further includes retrying matching the audio query using retry matching resources that include an expanded group of audio references, identifying at least one match and transmitting a report of the match. Optionally, the method includes storing data that correlates the follow-up query, the audio query or the reference to the audio query, and the match after retrying.

23.

发明申请
SYSTEM AND METHOD FOR CONTINUING AN INTERRUPTED BROADCAST STREAM 审中-公开
Title translation: 用于连续中断广播流的系统和方法

公开(公告)号：US20160314794A1

公开(公告)日：2016-10-27

申请号：US15098080

申请日：2016-04-13

Applicant: SoundHound, Inc.

Inventor： Victor Leitman , Bernard Mont-Reynaud , Kathleen Worthington McMahon , Regina Collecchia

IPC: G10L19/018 , H04H20/86

CPC classification number: G10L19/018 , H04H60/37 , H04H60/44 , H04H60/58 , H04H2201/90 , H04N21/233 , H04N21/2387 , H04N21/4126 , H04N21/42203 , H04N21/4333 , H04N21/4436 , H04N21/4524 , H04N21/4532 , H04N21/4622 , H04N21/81 , H04N21/8358 , H04N21/84 , H04N21/8456 , H04N21/8547 , H04N21/8586

Abstract: A client, such as a mobile phone, receives an audio signal from a microphone; the sound comes from a broadcast signal such as a radio or television program. The client sends a segment of audio data from the broadcast program to a detection system, such as a server. A broadcast monitoring system receives many broadcast audio signals and encodes their fingerprints in a database for matching. The detection system compares the client's audio data fingerprints to the content fingerprints to identify which broadcast station broadcast the signal having the sampled content. This information enables the client to resume the experience of the broadcast from one of a number of possible media sources.

Abstract translation: 诸如移动电话的客户端从麦克风接收音频信号; 声音来自诸如广播或电视节目的广播信号。客户端将一段音频数据从广播节目发送到诸如服务器的检测系统。广播监控系统接收许多广播音频信号，并将其指纹编码在数据库中进行匹配。检测系统将客户端的音频数据指纹与内容指纹进行比较，以识别哪个广播台广播具有采样内容的信号。该信息使得客户端可以从许多可能的媒体源之一恢复广播的体验。

24.

发明申请
System and Methods for Continuous Audio Matching 审中-公开
Title translation: 用于连续音频匹配的系统和方法

公开(公告)号：US20160292266A1

公开(公告)日：2016-10-06

申请号：US15182300

申请日：2016-06-14

Applicant: SoundHound, Inc.

Inventor： Bernard Mont-Reynaud , Aaron Master , Timothy P. Stonehocker , Keyvan Mohajer

IPC: G06F17/30

CPC classification number: G06F17/30743 , G06F17/30026 , G06F17/30749 , G06F17/30772

Abstract: The present invention relates to the continuous monitoring of an audio signal and identification of audio items within an audio signal. The technology disclosed utilizes predictive caching of fingerprints to improve efficiency. Fingerprints are cached for tracking an audio signal with known alignment and for watching an audio signal without known alignment, based on already identified fingerprints extracted from the audio signal. Software running on a smart phone or other battery-powered device cooperates with software running on an audio identification server.

Abstract translation: 本发明涉及音频信号的连续监视和音频信号内的音频项目的识别。所公开的技术利用指纹的预测性缓存来提高效率。基于从音频信号提取的已经识别的指纹，缓存指纹用于跟踪具有已知对准的音频信号并且用于观看没有已知对准的音频信号。在智能手机或其他电池供电设备上运行的软件与在音频识别服务器上运行的软件配合使用。

25.

发明授权
Controlling an engagement state of an agent during a human-machine dialog 有权

公开(公告)号：US12125484B2

公开(公告)日：2024-10-22

申请号：US17562891

申请日：2021-12-27

Applicant: SoundHound, Inc.

Inventor： Scott Halstvedt , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/22 , G06F3/16 , G06F21/32 , G06V40/16 , G10L15/08 , G10L17/00 , G10L17/04 , G10L17/06 , G10L17/22

CPC classification number: G10L15/22 , G06F3/167 , G06F21/32 , G10L15/08 , G10L17/04 , G10L17/06 , G10L17/22 , G06V40/16 , G06V40/166 , G10L2015/088 , G10L2015/223 , G10L17/00

Abstract: A method of controlling an engagement state of an agent during a human-machine dialog is provided. The method can include receiving a spoken request that is a conditional locking request, wherein the conditional locking request uses a natural language expression to explicitly specify a locking condition, which is a predicate, storing the predicate in a format that can be evaluated when needed by the agent, entering a conditionally locked state in response to the conditional locking request, in the conditionally locked state, receiving a multiplicity of requests without a need for a wakeup indicator, and for a request from the multiplicity of requests evaluating the predicate upon receiving the request, and processing the request if the predicate is true.

26.

发明授权
Using phonetic variants in a local context to improve natural language understanding 有权

公开(公告)号：US11295730B1

公开(公告)日：2022-04-05

申请号：US16529689

申请日：2019-08-01

Applicant: SoundHound, Inc.

Inventor： Keyvan Mohajer , Christopher Wilson , Bernard Mont-Reynaud

IPC: G10L15/18 , G10L15/19

Abstract: A method is described that includes processing text and speech from an input utterance using local overrides of default dictionary pronunciations. Applying this method, a word-level grammar used to process the tokens specifies at least one local word phonetic variant that applies within a specific production rule and, within a local context of the specific production rule, the local word phonetic variant overrides one or more default dictionary phonetic versions of the word. This method can be applied to parsing utterances where the pronunciation of some words depends on their syntactic or semantic context.

27.

发明授权
Advertisement selection by linguistic classification 有权

公开(公告)号：US11030993B2

公开(公告)日：2021-06-08

申请号：US16388753

申请日：2019-04-18

Applicant: SoundHound, Inc.

Inventor： Jun Huang , Kiran Garaga Lokeswarappa , Joel Gedalius , Bernard Mont-Reynaud

IPC: G10L15/00 , G10L15/02 , H04L29/08 , G10L15/06 , G06Q30/02 , G06F40/205 , G06F40/211 , G06F40/253 , G06N20/00 , G10L15/18 , G10L25/90 , G10L15/22 , G10L25/51 , G10L15/26

Abstract: A method is provided for advertisement selection. The method includes recognizing words from user speech over a large number of interactions, computing a number of unique words uttered during the interactions, classifying the user by the number of unique words uttered during the interactions, and selecting an advertisement targeted to the classified users.

28.

发明授权
Parsing to determine interruptible state in an utterance by detecting pause duration and complete sentences 有权

公开(公告)号：US10832005B1

公开(公告)日：2020-11-10

申请号：US16243920

申请日：2019-01-09

Applicant: SoundHound, Inc.

Inventor： Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G06F40/205 , G06F40/30

Abstract: The technology disclosed relates to computer-implemented conversational agents and particularly to detecting a point in the dialog (end of turn, or end of utterance) at which the agent can start responding to the user. The technology disclosed provides a method of incrementally parsing an input utterance with multiple parses operating in parallel. The technology disclosed includes detecting an interjection point in the input utterance when a pause exceeds a high threshold, or detecting an interjection point in the input utterance when a pause exceeds a low threshold and at least one of the parallel parses is determined to be interruptible by matching a complete sentence according to the grammar. The conversational agents start responding to the user at a detected interjection point.

29.

发明申请
Information Retrieval According To A User Interest Model 审中-公开

公开(公告)号：US20200219490A1

公开(公告)日：2020-07-09

申请号：US16822933

申请日：2020-03-18

Applicant: SoundHound, Inc.

Inventor： Bernard Mont-Reynaud , Jonah Probell

IPC: G10L15/18 , G10L15/30 , G06F3/16 , G10L15/22 , G06F40/58

Abstract: Systems and methods are provided for providing relevant information in response to natural language expressions. The expressions may be part of a spoken conversation between people either together or remotely. The information may be provided visually. Whether a piece of information is relevant to display can be conditioned by a model of the interest of the speaker. The interest model can be based on a history of expressions by the speaker and information from a user profile. The display of information can also be conditioned on a current conversation topic and on whether the same information has been displayed recently.

30.

发明授权
Techniques for concurrent processing of user speech 有权

公开(公告)号：US10699713B2

公开(公告)日：2020-06-30

申请号：US16388526

申请日：2019-04-18

Applicant: SoundHound, Inc.

Inventor： Scott Halstvedt , Bernard Mont-Reynaud , Kazi Asif Wadud

IPC: G10L15/30 , G10L15/22 , G06F40/35

Abstract: A server receives a user audio stream, the stream comprising multiple utterances. A query-processing module of the server continuously listens to and processes the utterances. The processing includes parsing successive utterances and recognizing corresponding queries, taking appropriate actions while the utterances are being received. In some embodiments, a query may be parsed and executed before the previous query's execution is complete.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification