-
公开(公告)号:US20190377732A1
公开(公告)日:2019-12-12
申请号:US16546623
申请日:2019-08-21
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G06F16/2455 , G10L15/18 , G10L15/30 , G10L15/22 , G06F16/242
Abstract: Systems and methods of voice activated thread management in a voice activated data packet based environment are provided. A natural language processor (“NLP”) component can receive and parse data packets comprising a first input audio signal to identify a first request and a first trigger keyword. A direct action application programming interface (“API”) can generate a first action data structure with a parameter defining a first action. The NLP component can receive and parse a second input audio signal to identify a second request and a second trigger keyword, and can generate a second action data structure with a parameter defining a second action. A pooling component can generate the first and second action data structures into a pooled data structure, and can transmit the pooled data structure to a service provider computing device to cause it device to perform an operation defined by the pooled data structure.
-
公开(公告)号:US20190304462A1
公开(公告)日:2019-10-03
申请号:US16447718
申请日:2019-06-20
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G10L15/22 , G10L13/027 , G10L21/003 , H04L29/06 , G10L21/0316 , G06F17/27 , G10L15/30 , G10L15/18
Abstract: Modulating packetized audio signals in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate a first action data structure. The system can identify a content item object based on the trigger keyword, and generate an output signal comprising a first portion corresponding to the first action data structure and a second portion corresponding to the content item object. The system can apply a modulation to the first or second portion of the output signal, and transmit the modulated output signal to the device.
-
公开(公告)号:US20190180770A1
公开(公告)日:2019-06-13
申请号:US15943506
申请日:2018-04-02
Applicant: Google LLC
Inventor: Anshul Kothari , Gaurav Bhaya , Tarun Jain
Abstract: Coordinating signal processing among computing devices in a voice-driven computing environment is provided. A first and second digital assistant can detect an input audio signal, perform a signal quality check, and provide indications that the first and second digital assistants are operational to process the input audio signal. A system can select the first digital assistant for further processing. The system can receive, from the first digital assistant, data packets including a command. The system can generate, for a network connected device selected from a plurality of network connected devices, an action data structure based on the data packets, and transmit the action data structure to the selected network connected device.
-
54.
公开(公告)号:US20190179608A1
公开(公告)日:2019-06-13
申请号:US15836746
申请日:2017-12-08
Applicant: Google LLC
Inventor: Anshul Kothari , Gaurav Bhaya , Tarun Jain
CPC classification number: G06F3/167 , G06F9/451 , G06F9/453 , G06F16/2365 , G06F16/3329 , G06F16/9535 , G06N20/00 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/265 , H04L51/02 , H04L67/20
Abstract: Managing rendering of a graphical user interface is provided. A system receives data packets comprising an input audio signal. The system determines an application identifier and query. The system provides the query to the application to cause the application to generate a second query for transmission to a third-party server, and identify responses to the query. The system intercepts the responses, and generates a keyword based on the responses. The system selects a digital component using the keyword, executes a deduplication process, and determines to add the digital component to the responses. The system constructs a display output using a graphical user interface template that integrates the plurality of responses generated by the application with the digital component, and provides the display output to the computing device for rendering.
-
公开(公告)号:US20180322879A1
公开(公告)日:2018-11-08
申请号:US16039204
申请日:2018-07-18
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
CPC classification number: G10L15/22 , G06F3/167 , G06F17/2765 , G10L15/1822 , G10L2015/088 , G10L2015/223
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US20180308484A1
公开(公告)日:2018-10-25
申请号:US16018854
申请日:2018-06-26
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G10L15/22 , G10L15/00 , G06F17/28 , G06F17/30 , G10L15/18 , G10L15/14 , G06F3/16 , G10L15/26 , G10L15/08
CPC classification number: G10L15/26 , G06F3/16 , G06F3/167 , G06F17/277 , G06F17/28 , G06F17/30654 , G06F17/30684 , G10L15/00 , G10L15/14 , G10L15/1822 , G10L15/22 , G10L2015/088 , G10L2015/223
Abstract: Optimization of sequence dependent operations in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A prediction component can determine a thread based on the trigger keyword and the request that includes a first action, a second action subsequent to the first action, and a third action subsequent to the second action. A content selector component can select, based on the third action and the trigger keyword, a content item. An audio signal generator component can generate an output signal comprising the content item. An interface can transmit the output signal to cause a client computing device to drive a speaker to generate an acoustic wave corresponding to the output signal prior to occurrence of at least one of the first action and the second action.
-
公开(公告)号:US20180247654A1
公开(公告)日:2018-08-30
申请号:US15966587
申请日:2018-04-30
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets , Justin Lewis , Ruxandra Davies
CPC classification number: G10L15/265 , G06F3/167 , G06F17/2705 , G06F17/2765 , G10L15/22 , G10L2015/223
Abstract: Identifier dependent operation processing of packet based data communication is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A content selector component can select, based on the request or trigger keyword, a content item. A link generation component can determine whether the client computing device has an account or a record in a database associated with the service provider device. In the absence of the record or account, the link generation device generates and sends a virtual identifier to the service provider device with instructions to generate an account in the database using the virtual identifier. Once the account is created, the service provider device can communicate with the client computing device.
-
公开(公告)号:US20180137267A1
公开(公告)日:2018-05-17
申请号:US15862963
申请日:2018-01-05
Applicant: Google LLC
Inventor: Ken Krieger , Andrew Joseph Alexander Gildfind , Nicholas Salvatore Arini , Simon Michael Rowe , Raimundo Mirisola , Gaurav Bhaya , Robert Stets
CPC classification number: G06F21/32 , G06F21/316 , G06F21/34 , G06F21/35 , G06K9/00288 , G10L17/005 , G10L17/24 , H04L63/0861 , H04L63/107
Abstract: The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.
-
公开(公告)号:US20180075493A1
公开(公告)日:2018-03-15
申请号:US15815398
申请日:2017-11-16
Applicant: Google LLC
Inventor: Amit Agarwal , Surojit Chatterjee , Gaurav Bhaya , Anshul Kothari , Vibhor Nanavati
CPC classification number: G06Q30/0275 , G06Q30/0252 , G06Q30/0261 , H04L67/26
Abstract: The present disclosure is directed to systems and methods of providing content. A server can generate a request for a push content item for an account identifier linked with a computing device. The server can establish a push auction for the account identifier with multiple candidate push content items. The server can determine an auction score for each candidate push content item and select a push content item therefrom based on the auction score. The server can determine a parameter for the account identifier and control delivery of the selected push content item based on a delivery control policy. The server can compare a value of the parameter with a threshold value to authorize the push content item. The server can provide the selected and authorized push content item for presentation in a push content slot via the computing device linked to the account identifier.
-
公开(公告)号:US12243521B2
公开(公告)日:2025-03-04
申请号:US17387608
申请日:2021-07-28
Applicant: Google LLC
Inventor: Gaurav Bhaya , Robert Stets
IPC: G10L15/00 , G06F3/16 , G10L15/14 , G10L15/18 , G10L15/22 , G10L15/26 , G10L15/30 , H04L47/25 , G10L15/08
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
-
-
-
-
-
-
-
-