-
公开(公告)号:US20190377732A1
公开(公告)日:2019-12-12
申请号:US16546623
申请日:2019-08-21
申请人: Google LLC
发明人: Gaurav Bhaya , Robert Stets
IPC分类号: G06F16/2455 , G10L15/18 , G10L15/30 , G10L15/22 , G06F16/242
摘要: Systems and methods of voice activated thread management in a voice activated data packet based environment are provided. A natural language processor (“NLP”) component can receive and parse data packets comprising a first input audio signal to identify a first request and a first trigger keyword. A direct action application programming interface (“API”) can generate a first action data structure with a parameter defining a first action. The NLP component can receive and parse a second input audio signal to identify a second request and a second trigger keyword, and can generate a second action data structure with a parameter defining a second action. A pooling component can generate the first and second action data structures into a pooled data structure, and can transmit the pooled data structure to a service provider computing device to cause it device to perform an operation defined by the pooled data structure.
-
公开(公告)号:US20190304462A1
公开(公告)日:2019-10-03
申请号:US16447718
申请日:2019-06-20
申请人: Google LLC
发明人: Gaurav Bhaya , Robert Stets
IPC分类号: G10L15/22 , G10L13/027 , G10L21/003 , H04L29/06 , G10L21/0316 , G06F17/27 , G10L15/30 , G10L15/18
摘要: Modulating packetized audio signals in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate a first action data structure. The system can identify a content item object based on the trigger keyword, and generate an output signal comprising a first portion corresponding to the first action data structure and a second portion corresponding to the content item object. The system can apply a modulation to the first or second portion of the output signal, and transmit the modulated output signal to the device.
-
公开(公告)号:US10311856B2
公开(公告)日:2019-06-04
申请号:US15815375
申请日:2017-11-16
申请人: Google LLC
发明人: Valerie Nygaard , Bogdan Caprita , Robert Stets , Saisuresh Krishnakumaran , Jason Brant Douglas
摘要: An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; selecting, based on the utterance, an agent from a plurality of agents, wherein the plurality of agents includes one or more first party agents and a plurality of third-party agents; responsive to determining that the selected agent comprises a first party agent, selecting a reserved voice from a plurality of voices; and outputting synthesized audio data using the selected voice to satisfy the utterance.
-
公开(公告)号:US20180322879A1
公开(公告)日:2018-11-08
申请号:US16039204
申请日:2018-07-18
申请人: Google LLC
发明人: Gaurav Bhaya , Robert Stets
CPC分类号: G10L15/22 , G06F3/167 , G06F17/2765 , G10L15/1822 , G10L2015/088 , G10L2015/223
摘要: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US20180308484A1
公开(公告)日:2018-10-25
申请号:US16018854
申请日:2018-06-26
申请人: Google LLC
发明人: Gaurav Bhaya , Robert Stets
IPC分类号: G10L15/22 , G10L15/00 , G06F17/28 , G06F17/30 , G10L15/18 , G10L15/14 , G06F3/16 , G10L15/26 , G10L15/08
CPC分类号: G10L15/26 , G06F3/16 , G06F3/167 , G06F17/277 , G06F17/28 , G06F17/30654 , G06F17/30684 , G10L15/00 , G10L15/14 , G10L15/1822 , G10L15/22 , G10L2015/088 , G10L2015/223
摘要: Optimization of sequence dependent operations in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A prediction component can determine a thread based on the trigger keyword and the request that includes a first action, a second action subsequent to the first action, and a third action subsequent to the second action. A content selector component can select, based on the third action and the trigger keyword, a content item. An audio signal generator component can generate an output signal comprising the content item. An interface can transmit the output signal to cause a client computing device to drive a speaker to generate an acoustic wave corresponding to the output signal prior to occurrence of at least one of the first action and the second action.
-
公开(公告)号:US20180247654A1
公开(公告)日:2018-08-30
申请号:US15966587
申请日:2018-04-30
申请人: Google LLC
发明人: Gaurav Bhaya , Robert Stets , Justin Lewis , Ruxandra Davies
CPC分类号: G10L15/265 , G06F3/167 , G06F17/2705 , G06F17/2765 , G10L15/22 , G10L2015/223
摘要: Identifier dependent operation processing of packet based data communication is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A content selector component can select, based on the request or trigger keyword, a content item. A link generation component can determine whether the client computing device has an account or a record in a database associated with the service provider device. In the absence of the record or account, the link generation device generates and sends a virtual identifier to the service provider device with instructions to generate an account in the database using the virtual identifier. Once the account is created, the service provider device can communicate with the client computing device.
-
公开(公告)号:US20180137267A1
公开(公告)日:2018-05-17
申请号:US15862963
申请日:2018-01-05
申请人: Google LLC
发明人: Ken Krieger , Andrew Joseph Alexander Gildfind , Nicholas Salvatore Arini , Simon Michael Rowe , Raimundo Mirisola , Gaurav Bhaya , Robert Stets
CPC分类号: G06F21/32 , G06F21/316 , G06F21/34 , G06F21/35 , G06K9/00288 , G10L17/005 , G10L17/24 , H04L63/0861 , H04L63/107
摘要: The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.
-
公开(公告)号:US11930050B2
公开(公告)日:2024-03-12
申请号:US17856636
申请日:2022-07-01
申请人: Google LLC
发明人: Justin Lewis , Richard Rapp , Gaurav Bhaya , Robert Stets
IPC分类号: H04L65/1066 , G01S5/02 , G01S5/18 , G06F3/16 , G06F9/451 , G06F9/50 , G10L15/08 , G10L15/18 , H04L45/00 , H04L65/75 , H04L65/80
CPC分类号: H04L65/1066 , G01S5/0295 , G06F9/451 , G06F9/505 , G10L15/1822 , H04L45/70 , H04L65/75 , H04L65/80 , G01S5/02 , G01S5/18 , G06F3/167 , G10L2015/088
摘要: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US11705121B2
公开(公告)日:2023-07-18
申请号:US16936972
申请日:2020-07-23
申请人: Google LLC
发明人: Gaurav Bhaya , Robert Stets , Umesh Patil
CPC分类号: G10L15/22 , G06F3/167 , G06F40/279 , G10L15/1822 , G10L2015/088 , G10L2015/223
摘要: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:US11627065B2
公开(公告)日:2023-04-11
申请号:US17152246
申请日:2021-01-19
申请人: GOOGLE LLC
发明人: Gaurav Bhaya , Robert Stets
IPC分类号: H04L43/103 , G06F16/683 , G06F16/33 , G10L15/26 , G06F3/16 , G06F40/186 , G10L15/18 , G10L15/22 , H04L41/0813 , H04L67/12 , G10L15/08
摘要: A selective sensor polling system for a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request. The system can select a template for an action data structure with a plurality of fields. The system can determine to poll a first sensor for data for the first field. The system can determine to obtain data in memory previously collected by the second sensor. The system can generate and transmit the action data structure with the data from the sensor and memory, and transmit the action data structure to a third party device.
-
-
-
-
-
-
-
-
-