-
公开(公告)号:US20210286844A1
公开(公告)日:2021-09-16
申请号:US17327184
申请日:2021-05-21
Applicant: Google LLC
Inventor: Alexander Collins , Ian James Leader , Yunkai Zhou , Gaurav Bhaya , Robert Stets
IPC: G06F16/632 , G10L15/08 , G10L25/54 , G06F16/951 , G06F16/9032 , G06F16/9532
Abstract: Routing packetized actions in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate an action data structure. The action data structure can include digital components and entity-action pairs.
-
公开(公告)号:US11093692B2
公开(公告)日:2021-08-17
申请号:US15638304
申请日:2017-06-29
Applicant: GOOGLE LLC
Inventor: Boon-Lock Yeo , Xuemei Gu , Gangjiang Li , Gaurav Bhaya , Robert Stets
IPC: G06F40/134 , G06K9/00 , G06F16/432 , G06F16/583 , G06F40/279 , G06K9/46 , G06K9/62
Abstract: Systems and methods for extracting audiovisual features from images and other digital components. A data processing system can extract image data and image features from an input image. The data processing system can match the image features to the image features of a plurality of image to identify candidate images. A second image can be selected from the candidate images based on a request that the data processing system received with the input image.
-
公开(公告)号:US10956485B2
公开(公告)日:2021-03-23
申请号:US15590861
申请日:2017-05-09
Applicant: GOOGLE LLC
Inventor: Wei-Hsin Lee , Jacob D. Schonberg , Chiu Wah Kelvin So , Jianfeng Shen , Gaurav Bhaya , Robert Stets
IPC: G06F16/00 , G06F16/48 , G06F16/432
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for retargeting content in a search environment. A method can include receiving a request for a content item to be provided with a search results page and determining that one or more retargeted content items are eligible for presentation with the search results page. Each retargeted content item is a content item that is eligible for presentation with the search results page based on: (1) the search query matching a targeting keyword for the retargeted content item, and (2) the user identifier matching a retargeted identifier that is included in a retargeting set for the retargeted content item. A responsive content item to be presented with the search results page is selected, based at least in part on bids that are associated with the retargeted content items, and data specifying the responsive content item are provided.
-
公开(公告)号:US20200327121A1
公开(公告)日:2020-10-15
申请号:US16915231
申请日:2020-06-29
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G06F16/2455 , G10L15/18 , G10L15/30 , G10L15/22 , G06F16/242
Abstract: Systems and methods of voice activated thread management in a voice activated data packet based environment are provided. A natural language processor (“NLP”) component can receive and parse data packets comprising a first input audio signal to identify a first request and a first trigger keyword. A direct action application programming interface (“API”) can generate a first action data structure with a parameter defining a first action. The NLP component can receive and parse a second input audio signal to identify a second request and a second trigger keyword, and can generate a second action data structure with a parameter defining a second action. A pooling component can generate the first and second action data structures into a pooled data structure, and can transmit the pooled data structure to a service provider computing device to cause it device to perform an operation defined by the pooled data structure.
-
公开(公告)号:US20200322396A1
公开(公告)日:2020-10-08
申请号:US16909375
申请日:2020-06-23
Applicant: Google LLC
Inventor: Justin Lewis , Richard Rapp , Gaurav Bhaya , Robert Stets
IPC: H04L29/06 , G06F9/50 , G06F9/451 , G10L15/18 , H04L12/721
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US10719515B2
公开(公告)日:2020-07-21
申请号:US16546623
申请日:2019-08-21
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G10L15/00 , G06F16/2455 , G10L15/18 , G10L15/30 , G10L15/22 , G06F16/242 , G10L15/08
Abstract: Systems and methods of voice activated thread management in a voice activated data packet based environment are provided. A natural language processor (“NLP”) component can receive and parse data packets comprising a first input audio signal to identify a first request and a first trigger keyword. A direct action application programming interface (“API”) can generate a first action data structure with a parameter defining a first action. The NLP component can receive and parse a second input audio signal to identify a second request and a second trigger keyword, and can generate a second action data structure with a parameter defining a second action. A pooling component can generate the first and second action data structures into a pooled data structure, and can transmit the pooled data structure to a service provider computing device to cause it device to perform an operation defined by the pooled data structure.
-
公开(公告)号:US20180322878A1
公开(公告)日:2018-11-08
申请号:US16039202
申请日:2018-07-18
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
CPC classification number: G10L15/22 , G06F3/165 , G10L15/14 , G10L15/1822 , G10L15/26 , G10L15/30 , G10L2015/088 , H04L47/25
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US20180191713A1
公开(公告)日:2018-07-05
申请号:US15863042
申请日:2018-01-05
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: H04L29/06 , H04W4/02 , H04L29/08 , G10L17/24 , G06F21/34 , G10L15/18 , G10L25/51 , G06F21/32 , G10L17/02 , G10L15/08
Abstract: The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.
-
公开(公告)号:US20180096284A1
公开(公告)日:2018-04-05
申请号:US15815368
申请日:2017-11-16
Applicant: Google LLC
Inventor: Robert Stets , Valerie Nygaard , Bogdan Caprita , Bradley M. Abrams , Jason Brant Douglas
CPC classification number: G06Q10/063112 , G06F9/46 , G06F16/951 , G10L15/22 , G10L2015/223
Abstract: An example method includes receiving, by one or more processors, a representation of an utterance spoken at a computing device; identifying, by a first computational agent from a plurality of computational agents and based on the utterance, a multi-element task to be performed, wherein the plurality of computational agents includes one or more first party computational agents and a plurality of third-party computational agents; and performing, by the first computational agent, a first sub-set of elements of the multi-element task, wherein performing the first sub-set of elements comprises selecting a second computational agent from the plurality of computational agents to perform a second sub-set of elements of the multi-element task.
-
公开(公告)号:US20180096283A1
公开(公告)日:2018-04-05
申请号:US15815353
申请日:2017-11-16
Applicant: Google LLC
Inventor: Bo Wang , Lei Zhong , Barnaby John James , Saisuresh Krishnakumaran , Robert Stets , Bogdan Caprita , Valerie Nygaard
CPC classification number: G06Q10/063112 , G06F16/951 , G10L13/00 , G10L15/08 , G10L15/22 , G10L2015/088 , G10L2015/223
Abstract: An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; identifying, based on the utterance, a task to be performed; determining a capability level of a first party (1P) agent to perform the task; determining capability levels of respective third party (3P) agents of a plurality of 3P agents to perform the task; responsive to determining that the capability level of the 1P agent does not satisfy a threshold capability level, that a capability level of a particular 3P agent of the plurality of 3P agents is a greatest of the determined capability levels, and that the capability level of the particular 3P agent satisfies the threshold capability level, selecting the particular 3P agent to perform the task; and performing one or more actions determined by the selected agent to perform the task.
-
-
-
-
-
-
-
-
-