-
公开(公告)号:US10748541B2
公开(公告)日:2020-08-18
申请号:US16666780
申请日:2019-10-29
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets , Umesh Patil
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US11663535B2
公开(公告)日:2023-05-30
申请号:US15815368
申请日:2017-11-16
Applicant: Google LLC
Inventor: Robert Stets , Valerie Nygaard , Bogdan Caprita , Bradley M. Abrams , Jason Brant Douglas
IPC: G06Q10/0631 , G10L15/22 , G06F16/951
CPC classification number: G06Q10/063112 , G10L15/22 , G06F16/951 , G10L2015/223
Abstract: An example method includes receiving, by one or more processors, a representation of an utterance spoken at a computing device; identifying, by a first computational agent from a plurality of computational agents and based on the utterance, a multi-element task to be performed, wherein the plurality of computational agents includes one or more first party computational agents and a plurality of third-party computational agents; and performing, by the first computational agent, a first sub-set of elements of the multi-element task, wherein performing the first sub-set of elements comprises selecting a second computational agent from the plurality of computational agents to perform a second sub-set of elements of the multi-element task.
-
公开(公告)号:US11482216B2
公开(公告)日:2022-10-25
申请号:US16447718
申请日:2019-06-20
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G10L15/22 , H04L65/1069 , G10L15/18 , G10L15/30 , G10L21/003 , G10L21/0316 , G10L13/027 , G06F40/205 , G10L15/08
Abstract: Modulating packetized audio signals in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate a first action data structure. The system can identify a content item object based on the trigger keyword, and generate an output signal comprising a first portion corresponding to the first action data structure and a second portion corresponding to the content item object. The system can apply a modulation to the first or second portion of the output signal, and transmit the modulated output signal to the device.
-
公开(公告)号:US11120195B2
公开(公告)日:2021-09-14
申请号:US16816077
申请日:2020-03-11
Applicant: Google LLC
Inventor: Graeme John Rimmer , Lewis Jay Hemens , Gaurav Bhaya , Robert Stets
IPC: G06F40/103 , G06F40/131 , G06F40/186 , G06F40/189 , H04L29/08 , H04L29/06 , G10L15/22
Abstract: Systems and methods for automatically determining a content item size may be based on a size of a viewport and a width of a parent element. A script may be configured to determine a size of a viewport, determine a width of a parent element of a resource, and determine a content item size based, at least in part, on the size of the view port and the width of the parent element. A dimension of the determined content item size may be used by a content item selection system to determine a set of content items. A content item selection system may select a content item from the determined set of content items and serve data to effect display of the selected content item in the parent element with the resource.
-
公开(公告)号:US11030239B2
公开(公告)日:2021-06-08
申请号:US15584746
申请日:2017-05-02
Applicant: GOOGLE LLC
Inventor: Alexander Collins , Ian James Leader , Yunkai Zhou , Gaurav Bhaya , Robert Stets
IPC: G06F17/30 , G06F16/632 , G10L15/08 , G10L25/54 , G06F16/951 , G06F16/9032
Abstract: Routing packetized actions in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate an action data structure. The action data structure can include digital components and entity-action pairs.
-
公开(公告)号:US11017428B2
公开(公告)日:2021-05-25
申请号:US15604319
申请日:2017-05-24
Applicant: Google LLC
Inventor: Mark J. Foladare , Richard L. Bennett , Gaurav Bhaya , Robert Stets
IPC: G06Q30/00 , G06Q30/02 , G06Q50/18 , G06F16/955 , H04L29/08
Abstract: Disclosed are systems and methods for adjusting the frequency of data transmissions in a voice activated data packet based environment. A pooling component can generate first and second action data structures into a pooled data structure, and can transmit the pooled data structure to a service provider computing device to cause it device to perform an operation defined by the pooled data structure. Based on characteristics of the client devices, the system can select transmission rates for the transmission of operations associated with the pooled data structure to each of the client devices.
-
37.
公开(公告)号:US10957002B2
公开(公告)日:2021-03-23
申请号:US15596943
申请日:2017-05-16
Applicant: GOOGLE LLC
Inventor: Surojit Chatterjee , Terry Van Belle , Anshul Kothari , Jian Zhou , Paul Feng , Ravi Jain , Nandita Narasimha Prabhu , Yun Huang , Gaurav Bhaya , Robert Stets
Abstract: Various methods, systems, and computer program products are disclosed for communicating location-based digital components to a mobile and other devices. A natural language processor component can parse an input audio signal to identify a request and a keyword. A content selector can select digital components based on keyword and request. An audio signal generator component can generate an output signal that includes a selected digital components. An interface can transmit the output signal to cause a client computing device to drive a speaker to generate an acoustic wave corresponding to the output signal prior to occurrence of at least one of the first action and the second action.
-
公开(公告)号:US10854198B2
公开(公告)日:2020-12-01
申请号:US16018854
申请日:2018-06-26
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G10L15/22 , G10L15/26 , G06F3/16 , G06F16/332 , G06F16/33 , G06F40/40 , G06F40/284 , G10L15/00 , G10L15/14 , G10L15/18 , G10L15/08
Abstract: Optimization of sequence dependent operations in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A prediction component can determine a thread based on the trigger keyword and the request that includes a first action, a second action subsequent to the first action, and a third action subsequent to the second action. A content selector component can select, based on the third action and the trigger keyword, a content item. An audio signal generator component can generate an output signal comprising the content item. An interface can transmit the output signal to cause a client computing device to drive a speaker to generate an acoustic wave corresponding to the output signal prior to occurrence of at least one of the first action and the second action.
-
公开(公告)号:US10854188B2
公开(公告)日:2020-12-01
申请号:US16417024
申请日:2019-05-20
Applicant: Google LLC
Inventor: Valerie Nygaard , Bogdan Caprita , Robert Stets , Saisuresh Krishnakumaran , Jason Brant Douglas
Abstract: An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents; responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices; and outputting synthesized audio data using the selected voice to satisfy the utterance.
-
公开(公告)号:US20200098369A1
公开(公告)日:2020-03-26
申请号:US16696622
申请日:2019-11-26
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
-
-
-
-
-
-
-
-