-
公开(公告)号:WO2018125298A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049709
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: BHAYA, Gaurav , STETS, Robert
Abstract: Optimization of sequence dependent operations in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A prediction component can determine a thread based on the trigger keyword and the request that includes a first action, a second action subsequent to the first action, and a third action subsequent to the second action. A content selector component can select, based on the third action and the trigger keyword, a content item. An audio signal generator component can generate an output signal comprising the content item. An interface can transmit the output signal to cause a client computing device to drive a speaker to generate an acoustic wave corresponding to the output signal prior to occurrence of at least one of the first action and the second action.
-
公开(公告)号:WO2018125303A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049766
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: BHAYA, Gaurav , STETS, Robert
Abstract: A feedback control system for data transmissions in voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request. The system can select a content item using the trigger keyword or request. The content item can be configured to establish a communication session between the device and a third party device. The system can monitor the communication session to measure a characteristic of the communication session. The system can generate a quality signal based on the measured characteristic.
-
3.
公开(公告)号:WO2018125306A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049780
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: BHAYA, Gaurav , STETS, Robert
Abstract: Systems and methods to combine multiple voice activated audio input data packets that indicate sequence dependent operations are provided. A natural language processor component can receive first and second input audio signal from a client computing device, and can identify respective requests and corresponding trigger keywords. A direct action application programming interface ("API") can generate respective action data structures, and can construct respective data transmissions including the respective action data structures. A thread optimization component can obtain data packets of the first data transmission, and can obtain data packets of the second data transmission. The thread optimization component can determine, based on a heuristic technique applied to the data packets of the respective data transmissions a sequence dependency parameter. The thread optimization component can merge, based on a comparison of the sequence dependency parameter with a threshold, the first and second data transmissions into a single thread.
-
公开(公告)号:WO2018125299A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049713
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: BHAYA, Gaurav , STETS, Robert
Abstract: Routing packetized actions in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate an action data structure. The system can transmit the action data structure to a third party provider device. The system can receive an indication from the third party provider device that a communication session was established with the device.
-
公开(公告)号:WO2018125307A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049782
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: LEWIS, Justin , RAPP, Richard , BHAYA, Gaurav , STETS, Robert
Abstract: A system of multi-modal transmission of packetized data in a voice activated data packet based computer network environment is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. Based on the input audio signal, a direct action application programming interface can generate a first action data structure, and a content selector component can select a content item. An interface management component can identify first and second candidate interfaces, and respective resource utilization values. The interface management component can select, based on the resource utilization values, the first candidate interface to present the content item. The interface management component can provide the first action data structure to the client computing device for rendering as audio output, and can transmit the content item converted for a first modality to deliver the content item for rendering from the selected interface.
-
公开(公告)号:WO2018125304A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049774
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: BHAYA, Gaurav , STETS, Robert
Abstract: Systems and methods of voice activated thread management in a voice activated data packet based environment are provided. A natural language processor ("NLP") component can receive and parse data packets comprising a first input audio signal to identify a first request and a first trigger keyword. A direct action application programming interface ("API") can generate a first action data structure with a parameter defining a first action. The NLP component can receive and parse a second input audio signal to identify a second request and a second trigger keyword, and can generate a second action data structure with a parameter defining a second action. A pooling component can generate the first and second action data structures into a pooled data structure, and can transmit the pooled data structure to a service provider computing device to cause it device to perform an operation defined by the pooled data structure.
-
公开(公告)号:WO2018125302A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049758
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: BHAYA, Gaurav , STETS, Robert
IPC: G10L13/027 , G10L15/22
Abstract: Modulating packetized audio signals in a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request, and generate a first action data structure. The system can identify a content item object based on the trigger keyword, and generate an output signal comprising a first portion corresponding to the first action data structure and a second portion corresponding to the content item object. The system can apply a modulation to the first or second portion of the output signal, and transmit the modulated output signal to the device.
-
公开(公告)号:WO2018125301A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049738
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: BHAYA, Gaurav , STETS, Robert
Abstract: Identifier dependent operation processing of packet based data communication is provided. A natural language processor component can parse an input audio signal to identify a request and a trigger keyword. A content selector component can select, based on the request or trigger keyword, a content item. A link generation component can determine whether the client computing device has an account or a record in a database associated with the service provider device. In the absence of the record or account, the link generation device generates and sends a virtual identifier to the service provider device with instructions to generate an account in the database using the virtual identifier. Once the account is created, the service provider device can communicate with the client computing device.
-
公开(公告)号:WO2018125305A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049779
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: BHAYA, Gaurav , STETS, Robert
Abstract: A selective sensor polling system for a voice activated data packet based computer network environment is provided. A system can receive audio signals detected by a microphone of a device. The system can parse the audio signal to identify trigger keyword and request. The system can select a template for an action data structure with a plurality of fields. The system can determine to poll a first sensor for data for the first field. The system can determine to obtain data in memory previously collected by the second sensor. The system can generate and transmit the action data structure with the data from the sensor and memory, and transmit the action data structure to a third party device.
-
公开(公告)号:WO2018125300A1
公开(公告)日:2018-07-05
申请号:PCT/US2017/049721
申请日:2017-08-31
Applicant: GOOGLE LLC
Inventor: BHAYA, Gaurav , STETS, Robert
Abstract: The present disclosure is generally directed a data processing system for authenticating packetized audio signals in a voice activated computer network environment. The data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, disabling malicious transmissions prior to their transmission across the network. The present solution can also improve computational efficiency by disabling remote computer processes possibly affected by or caused by the malicious audio signal transmissions. By disabling the transmission of malicious audio signals, the system can reduce bandwidth utilization by not transmitting the data packets carrying the malicious audio signal across the networks.
-
-
-
-
-
-
-
-
-