Patent search ap:("SoundHound Page Inc.") AND inv:"Timothy P. Stonehocker"

1.

发明申请
Systems and Methods for Sound Recognition 审中-公开

公开(公告)号：US20160275184A1

公开(公告)日：2016-09-22

申请号：US15012741

申请日：2016-02-01

Applicant: SoundHound, Inc.

Inventor： Aaron Steven Master , Timothy P. Stonehocker , Benjamin John Levitt , Jun Huang , Keyvan Mohajer

IPC: G06F17/30 , G10L25/51

CPC classification number: G06F16/683 , G06F16/634 , G06F16/68 , G10L25/51

Abstract: Systems and methods for recognizing sounds are provided herein. User input relating to one or more sounds is received from a computing device. Instructions, which are stored in memory, are executed by a processor to discriminate the one or more sounds, extract music features from the one or more sounds, analyze the music features using one or more databases, and obtain information regarding the music features based on the analysis. Further, information regarding the music features of the one or more sounds may be transmitted to display on the computing device.

2.

发明申请
System and Methods for Continuous Audio Matching 审中-公开
Title translation: 用于连续音频匹配的系统和方法

公开(公告)号：US20160292266A1

公开(公告)日：2016-10-06

申请号：US15182300

申请日：2016-06-14

Applicant: SoundHound, Inc.

Inventor： Bernard Mont-Reynaud , Aaron Master , Timothy P. Stonehocker , Keyvan Mohajer

IPC: G06F17/30

CPC classification number: G06F17/30743 , G06F17/30026 , G06F17/30749 , G06F17/30772

Abstract: The present invention relates to the continuous monitoring of an audio signal and identification of audio items within an audio signal. The technology disclosed utilizes predictive caching of fingerprints to improve efficiency. Fingerprints are cached for tracking an audio signal with known alignment and for watching an audio signal without known alignment, based on already identified fingerprints extracted from the audio signal. Software running on a smart phone or other battery-powered device cooperates with software running on an audio identification server.

Abstract translation: 本发明涉及音频信号的连续监视和音频信号内的音频项目的识别。所公开的技术利用指纹的预测性缓存来提高效率。基于从音频信号提取的已经识别的指纹，缓存指纹用于跟踪具有已知对准的音频信号并且用于观看没有已知对准的音频信号。在智能手机或其他电池供电设备上运行的软件与在音频识别服务器上运行的软件配合使用。

3.

发明授权
System and method for performing dual mode speech recognition 有权
Title translation: 用于执行双模式语音识别的系统和方法

公开(公告)号：US09330669B2

公开(公告)日：2016-05-03

申请号：US14621024

申请日：2015-02-12

Applicant: SoundHound, Inc.

Inventor： Timothy P. Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/04 , G10L15/00 , G10L15/30 , G10L15/08 , G10L15/26 , G10L15/34

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

Abstract translation: 提出了一种用于执行双模式语音识别的系统和方法，在移动设备上使用本地识别模块和在服务器设备上使用远程识别引擎。该系统接受来自用户的口语查询，并且本地识别模块和远程识别引擎都对查询执行语音识别操作，返回转录和置信度得分，并受到延迟截止时间的限制。如果两个来源成功地转录查询，则系统接受具有较高置信度得分的结果。如果只有一个源成功，则该结果被接受。在任一情况下，如果远程识别引擎确实成功地转录查询，则如果远程系统结果包括客户端词汇中不存在的信息，则更新客户词汇。

4.

发明申请
SYSTEM AND METHOD FOR PERFORMING DUAL MODE SPEECH RECOGNITION 有权
Title translation: 用于执行双模式语音识别的系统和方法

公开(公告)号：US20150154959A1

公开(公告)日：2015-06-04

申请号：US14621024

申请日：2015-02-12

Applicant: SoundHound, Inc.

Inventor： Timothy P. Stonehocker , Keyvan Mohajer , Bernard Mont-Reynaud

IPC: G10L15/30 , G10L15/08 , G10L15/26

CPC classification number: G10L15/30 , G10L15/04 , G10L15/063 , G10L15/08 , G10L15/265 , G10L15/34 , G10L17/06 , G10L2015/0635 , G10L2015/081

Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

Abstract translation: 提出了一种用于执行双模式语音识别的系统和方法，在移动设备上使用本地识别模块和在服务器设备上使用远程识别引擎。该系统接受来自用户的口语查询，并且本地识别模块和远程识别引擎都对查询执行语音识别操作，返回转录和置信度得分，并受到延迟截止时间的限制。如果两个来源成功地转录查询，则系统接受具有较高置信度得分的结果。如果只有一个源成功，则该结果被接受。在任一情况下，如果远程识别引擎确实成功地转录查询，则如果远程系统结果包括客户端词汇中不存在的信息，则更新客户词汇。

5.

发明授权
Multiple service levels for automatic speech recognition 有权

公开(公告)号：US11978454B2

公开(公告)日：2024-05-07

申请号：US17447823

申请日：2021-09-16

Applicant: SoundHound, Inc.

Inventor： Timothy P. Stonehocker , Zizu Gowayyed , Matthias Eichstaedt , Seyed Majid Emami , Evelyn Jiang , Ryan Berryhill , Mathieu Ramona , Neil Veira

IPC: G10L15/30 , G10L15/16 , G10L15/26

CPC classification number: G10L15/30 , G10L15/16 , G10L15/26

Abstract: A system for performing automated speech recognition (ASR) on audio data includes a queue manager to receive a request to perform ASR on audio data, add the request to a queue of incoming requests, and determine a queue depth representing a number of requests in the queue at a given time. The system also includes a load supervisor to receive the request and the queue depth from the queue manager and assign a service level for the request based on the queue depth. In addition, the system includes a speech-to-text converter to receive the assigned service level for the request from the load supervisor, select an ASR model for the request based on the received service level, receive the audio data associated with the request, and perform ASR on the audio data using the selected ASR model.

6.

发明申请
SYSTEM AND METHOD FOR CONTROLLING AN APPLICATION USING NATURAL LANGUAGE COMMUNICATION 审中-公开

公开(公告)号：US20200335101A1

公开(公告)日：2020-10-22

申请号：US16388867

申请日：2019-04-19

Applicant: SoundHound, Inc.

Inventor： Kathleen Worthington McMahon , Timothy P. Stonehocker

IPC: G10L15/22 , G10L15/193 , G10L15/06 , G06F3/16

Abstract: A system and method are disclosed for setting up a communication link between a device or application and a system with a controller. The controller can collect and send information to the application. A user interfaces with the controller to access the functionality of the application through providing commands to the controller. The system allows the user to interface with multiple applications.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification