System and methods for offline audio recognition

    公开(公告)号:US09619560B1

    公开(公告)日:2017-04-11

    申请号:US14884650

    申请日:2015-10-15

    CPC classification number: G06F17/30743 G10L15/08 G10L25/54

    Abstract: In one implementation, a method is described of retrying matching of an audio query against audio references. The method includes receiving a follow-up query that requests a retry at matching a previously submitted audio query. In some implementations, this follow-up query is received without any recognition hint that suggests how to retry matching. The follow-up query includes the audio query or a reference to the audio query to be used in the retry. The method further includes retrying matching the audio query using retry matching resources that include an expanded group of audio references, identifying at least one match and transmitting a report of the match. Optionally, the method includes storing data that correlates the follow-up query, the audio query or the reference to the audio query, and the match after retrying.

    System and Methods for Continuous Audio Matching
    24.
    发明申请
    System and Methods for Continuous Audio Matching 审中-公开
    用于连续音频匹配的系统和方法

    公开(公告)号:US20160292266A1

    公开(公告)日:2016-10-06

    申请号:US15182300

    申请日:2016-06-14

    Abstract: The present invention relates to the continuous monitoring of an audio signal and identification of audio items within an audio signal. The technology disclosed utilizes predictive caching of fingerprints to improve efficiency. Fingerprints are cached for tracking an audio signal with known alignment and for watching an audio signal without known alignment, based on already identified fingerprints extracted from the audio signal. Software running on a smart phone or other battery-powered device cooperates with software running on an audio identification server.

    Abstract translation: 本发明涉及音频信号的连续监视和音频信号内的音频项目的识别。 所公开的技术利用指纹的预测性缓存来提高效率。 基于从音频信号提取的已经识别的指纹,缓存指纹用于跟踪具有已知对准的音频信号并且用于观看没有已知对准的音频信号。 在智能手机或其他电池供电设备上运行的软件与在音频识别服务器上运行的软件配合使用。

    Using phonetic variants in a local context to improve natural language understanding

    公开(公告)号:US11295730B1

    公开(公告)日:2022-04-05

    申请号:US16529689

    申请日:2019-08-01

    Abstract: A method is described that includes processing text and speech from an input utterance using local overrides of default dictionary pronunciations. Applying this method, a word-level grammar used to process the tokens specifies at least one local word phonetic variant that applies within a specific production rule and, within a local context of the specific production rule, the local word phonetic variant overrides one or more default dictionary phonetic versions of the word. This method can be applied to parsing utterances where the pronunciation of some words depends on their syntactic or semantic context.

    Parsing to determine interruptible state in an utterance by detecting pause duration and complete sentences

    公开(公告)号:US10832005B1

    公开(公告)日:2020-11-10

    申请号:US16243920

    申请日:2019-01-09

    Abstract: The technology disclosed relates to computer-implemented conversational agents and particularly to detecting a point in the dialog (end of turn, or end of utterance) at which the agent can start responding to the user. The technology disclosed provides a method of incrementally parsing an input utterance with multiple parses operating in parallel. The technology disclosed includes detecting an interjection point in the input utterance when a pause exceeds a high threshold, or detecting an interjection point in the input utterance when a pause exceeds a low threshold and at least one of the parallel parses is determined to be interruptible by matching a complete sentence according to the grammar. The conversational agents start responding to the user at a detected interjection point.

    Information Retrieval According To A User Interest Model

    公开(公告)号:US20200219490A1

    公开(公告)日:2020-07-09

    申请号:US16822933

    申请日:2020-03-18

    Abstract: Systems and methods are provided for providing relevant information in response to natural language expressions. The expressions may be part of a spoken conversation between people either together or remotely. The information may be provided visually. Whether a piece of information is relevant to display can be conditioned by a model of the interest of the speaker. The interest model can be based on a history of expressions by the speaker and information from a user profile. The display of information can also be conditioned on a current conversation topic and on whether the same information has been displayed recently.

Patent Agency Ranking