摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for obtaining audio data corresponding to an utterance; transmitting the audio data corresponding to the utterance; receiving an indication that that utterance likely includes a communication-related voice command; in response to receiving the indication that the utterance likely includes the communication-related voice command, applying at least a language model to a representation of the audio data corresponding to the utterance, to identify data referencing a contact; and transmitting the data referencing the contact.
摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, receiving audio data; determining that an initial portion of the audio data corresponds to an initial portion of a hotword; in response to determining that the initial portion of the audio data corresponds to the initial portion of the hotword, selecting, from among a set of one or more actions that are performed when the entire hotword is detected, a subset of the one or more actions; and causing one or more actions of the subset to be performed.
摘要:
Methods, systems, and apparatus for receiving, from a user, a request that includes an entity identifier associated with an entity that is referenced by one or more query terms of a search query, determining that the entity is identified in a media consumption database as a media item that has been indicated as consumed by the user or that the entity is associated with a media item that is identified in the media consumption database as a media item that has been indicated as consumed by the user, and based on the determination, providing a response to the request, the response including data indicating that the entity is a media item that has been indicated as consumed by the user or that the entity is associated with a media item that has been indicated as consumed by the user.
摘要:
Methods, systems, and apparatus for receiving a natural language query of a user, and environmental data, identifying a media item based on the environmental data, determining an entity type based on the natural language query, selecting an entity associated with the media item that matches the entity type, selecting, from a media consumption database that identifies media items that have been indicated as consumed by the user, one or more media items that have been indicated as consumed by the user and that are associated with the selected entity, and providing a response to the query based on selecting the one or more media items that have been indicated as consumed by the user and that are associated with the selected entity.
摘要:
Systems and methods are disclosed for providing device-specific instructions in response to a perception of a media content segment. In one implementation, a processing device captures, at a user device, one or more media content segments. The processing device provides the one or more media content segments to a remote device. The processing device receives one or more instructions, each of the one or more instructions being associated with at least one of the one or more media content segments and corresponding to one or more operations. The processing device initiates execution of at least one of the one or more instructions.
摘要:
An example method comprises determining, by a computing device, an indication of video content, determining, by the computing device and based at least in part on the indication, one or more candidate albums, selecting, by the computing device, a particular candidate album of the one or more candidate albums based at least in part on a match between an audio fingerprint of an audio track included in the video content and an audio fingerprint of an audio track included in the particular candidate album, and sending, by the computing device, a message that associates the video content with the particular candidate album.
摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying an identity of a user. The methods, systems, and apparatus include actions of receiving a request for a verification phrase for verifying an identity of a user. Additional actions include, in response to receiving the request for the verification phrase for verifying the identity of the user, identifying subwords to be included in the verification phrase and in response to identifying the subwords to be included in the verification phrase, obtaining a candidate phrase that includes at least some of the identified subwords as the verification phrase. Further actions include providing the verification phrase as a response to the request for the verification phrase for verifying the identity of the user.
摘要:
Systems, methods, routines and/or techniques for synchronized content playback related to content recognition are described. A software program may cause a video to play synchronously with a song, for example, a song that is playing in an ambient environment such as a café or bar. In some embodiments, a client device may sense a song and the client device may communicate audio data related to the song to a remote server, and the remote server may identify a song that is related to the audio data. The remote server may also identify one or more videos (e.g., in a video database) that relate to the song. The remote server may communicate one or more of the videos (e.g., a link/URL) back to the client device such that the client device can play one of the videos synchronously with the song, even if playback of the video is delayed.
摘要:
Systems and methods for matching live media content are disclosed. At a server, obtaining first media content from a client device, herein the first media content corresponds to a portion of media content being played on the client device, and the first media content is associated with a predefined expiration time; obtaining second media content from one or more content feeds, wherein the second media content also corresponds to a portion of the media content being played on the client device; in accordance with a determination that the second media content corresponds to a portion of the media content that has been played on the client device: before the predefined expiration time, obtaining third media content corresponding to the media content being played on the client device, from the one or more content feeds; and comparing the first media content with the third media content.
摘要:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving (i) audio data that encodes a spoken natural language query, and (ii) environmental audio data, obtaining a transcription of the spoken natural language query, determining a particular content type associated with one or more keywords in the transcription, providing at least a portion of the environmental audio data to a content recognition engine, and identifying a content item that has been output by the content recognition engine, and that matches the particular content type.