Segment-based speaker verification using dynamically generated phrases
    112.
    发明授权
    Segment-based speaker verification using dynamically generated phrases 有权
    使用动态生成的短语进行基于段的演讲者验证

    公开(公告)号:US09424846B2

    公开(公告)日:2016-08-23

    申请号:US14447115

    申请日:2014-07-30

    Applicant: Google Inc.

    CPC classification number: G10L17/24 G10L15/02 G10L17/04 G10L2015/025

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying an identity of a user. The methods, systems, and apparatus include actions of receiving a request for a verification phrase for verifying an identity of a user. Additional actions include, in response to receiving the request for the verification phrase for verifying the identity of the user, identifying subwords to be included in the verification phrase and in response to identifying the subwords to be included in the verification phrase, obtaining a candidate phrase that includes at least some of the identified subwords as the verification phrase. Further actions include providing the verification phrase as a response to the request for the verification phrase for verifying the identity of the user.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于验证用户的身份。 方法,系统和装置包括接收用于验证用户身份的验证短语的请求的动作。 附加动作包括响应于接收到用于验证用户身份的验证短语的请求,识别要包括在验证短语中的子词,并且响应于识别要包括在验证短语中的子词,获得候选短语 其包括至少一些所识别的子词作为验证短语。 进一步的操作包括提供验证短语作为对用于验证用户身份的验证短语的请求的响应。

    Associating audio tracks of an album with video content
    113.
    发明授权
    Associating audio tracks of an album with video content 有权
    将相册的音轨与视频内容相关联

    公开(公告)号:US09344759B2

    公开(公告)日:2016-05-17

    申请号:US13786132

    申请日:2013-03-05

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: An example method comprises determining, by a computing device, an indication of video content, determining, by the computing device and based at least in part on the indication, one or more candidate albums, selecting, by the computing device, a particular candidate album of the one or more candidate albums based at least in part on a match between an audio fingerprint of an audio track included in the video content and an audio fingerprint of an audio track included in the particular candidate album, and sending, by the computing device, a message that associates the video content with the particular candidate album.

    Abstract translation: 一个示例性方法包括由计算设备确定视频内容的指示,由计算设备确定并至少部分地基于该指示,一个或多个候选专辑,由计算设备选择特定候选专辑 至少部分地基于视频内容中包括的音频轨道的音频指纹与包含在该特定候选相册中的音轨的音频指纹之间的匹配,以及由计算设备发送一个或多个候选相册 ,将视频内容与特定候选专辑相关联的消息。

    Hotword detection on multiple devices

    公开(公告)号:US09318107B1

    公开(公告)日:2016-04-19

    申请号:US14675932

    申请日:2015-04-01

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance. The actions further include determining a first value corresponding to a likelihood that the utterance includes a hotword. The actions further include receiving a second value corresponding to a likelihood that the utterance includes the hotword, the second value being determined by a second computing device. The actions further include comparing the first value and the second value. The actions further include based on comparing the first value to the second value, initiating speech recognition processing on the audio data.

    Promoting voice actions to hotwords
    116.
    发明授权
    Promoting voice actions to hotwords 有权
    促进语音动作到热门词汇

    公开(公告)号:US09263035B2

    公开(公告)日:2016-02-16

    申请号:US14221520

    申请日:2014-03-21

    Applicant: Google Inc.

    Inventor: Matthew Sharifi

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于将某些语音命令指定为热词。 方法,系统和装置包括接收随后的语音命令的热门词汇的动作。 附加动作包括确定语音命令满足与指定语音命令相关联的一个或多个预定准则作为热门词,其中指定为热门词汇的语音命令被视为语音输入,而不管语音命令是否在另一个之前 热门词 响应于确定语音命令满足与指定语音命令相关联的一个或多个预定标准作为热门词,响应于指定语音命令作为热门词语。

    Unified recognition of speech and music
    118.
    发明授权
    Unified recognition of speech and music 有权
    语音和音乐的统一认可

    公开(公告)号:US09224385B1

    公开(公告)日:2015-12-29

    申请号:US13919170

    申请日:2013-06-17

    Applicant: Google Inc.

    Abstract: Methods, systems, and computer programs are presented for unified recognition of speech and music. One method includes an operation for starting an audio recognition mode by a computing device while receiving an audio stream. Segments of the audio stream are analyzed as the audio stream is received, where the analysis includes simultaneous checking for speech and music. Further, the method includes an operation for determining a first confidence score for speech and a second confidence score for music. As the audio stream is received, additional segments are analyzed until the end of the audio stream or until the first and second confidence scores indicate that the audio stream has been identified as speech or music. Further, results are presented on a display based on the identification of the audio stream, including text entered if the audio stream was speech or song information if the audio stream was music.

    Abstract translation: 提出方法,系统和计算机程序,用于统一识别语音和音乐。 一种方法包括在接收音频流的同时由计算设备启动音频识别模式的操作。 当接收到音频流时,分析音频流的分段,其中分析包括语音和音乐的同时检查。 此外,该方法包括用于确定用于语音的第一可信度得分和用于音乐的第二可信度得分的操作。 当音频流被接收时,分析附加段直到音频流的结束,或者直到第一和第二置信度得分指示音频流已经被识别为语音或音乐。 此外,如果音频流是音乐,则在显示器上显示结果,该显示器基于音频流的标识,包括输入的文本,如果音频流是语音或歌曲信息。

    IDF weighting of LSH bands for live reference ingestion
    119.
    发明授权
    IDF weighting of LSH bands for live reference ingestion 有权
    用于实时参考摄取的LSH带的IDF加权

    公开(公告)号:US09208154B1

    公开(公告)日:2015-12-08

    申请号:US14458387

    申请日:2014-08-13

    Applicant: Google Inc.

    Abstract: Down scoring overcrowded bands via IDF weighting scores provides a soft way to reduce the effect of common bands from Locality Sensitive Hashing (LSH) processes. An index component indexes live video references of a live streaming infrastructure pathway process in a reference index. A scoring component scores a set of bands with a set of inverse document frequency (IDF) weighting scores in the reference index. A high score is generated for bands that are featured in a small number of references and a low score is generated for bands featured in a high number of references.

    Abstract translation: 通过IDF加权分数的下划线过度拥挤的频带提供了一种柔性的方法来减少局部敏感哈希(LSH)过程中常用频带的影响。 索引组件在参考索引中索引实况流基础设施路径进程的实时视频参考。 评分组件在参考指标中以一组逆文档频率(IDF)加权分数对一组频带进行评分。 对于以少量参考为特征的频带,产生高分,并且对于大量参考中的频带生成低分数。

    Classifying music by genre using discrete cosine transforms
    120.
    发明授权
    Classifying music by genre using discrete cosine transforms 有权
    使用离散余弦变换对流派进行音乐分类

    公开(公告)号:US09055376B1

    公开(公告)日:2015-06-09

    申请号:US13791131

    申请日:2013-03-08

    Applicant: Google Inc.

    CPC classification number: H04R3/00 G06F17/30743 H04R2430/03

    Abstract: Systems and methods are provided herein relating to audio classification. Genres of music can be identified by detecting unique spectral features inherent to those genres. One example genre detected is techno music. Two dimensional discrete cosine transforms can be generated for consecutive windows of the spectrogram or chromagram. A max value of the energy of portions of the two dimensional discrete cosine transforms can be determined. The max value can be normalized and aggregated with max values related to neighboring windows. If the aggregate scores meet a genre threshold, the audio sample, or portions thereof, can be associated with a genre of music.

    Abstract translation: 本文提供了与音频分类有关的系统和方法。 可以通过检测这些类型固有的独特光谱特征来识别音乐类型。 检测到的一个例子是技术音乐。 可以为光谱图或色谱图的连续窗口生成二维离散余弦变换。 可以确定二维离散余弦变换的部分的能量的最大值。 最大值可以与相邻窗口相关的最大值进行归一化和聚合。 如果聚合分数满足类型阈值,则音频样本或其部分可以与音乐类型相关联。

Patent Agency Ranking