-
公开(公告)号:US20180018961A1
公开(公告)日:2018-01-18
申请号:US15209064
申请日:2016-07-13
Applicant: Google Inc.
Inventor: Abraham Jung-Gyu Lee , Sang Soo Sung , Yeliang Zhang
CPC classification number: G10L15/08 , G06F3/04842 , G06F3/167 , G06F17/2775 , G10L15/005 , G10L15/04 , G10L15/22 , G10L25/87 , G10L2015/088 , H04M2203/4536
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for combining audio data and a transcription of the audio data into a data structure are disclosed. In one aspect, a method includes the actions of receiving audio data that corresponds to an utterance. The actions include generating a transcription of the utterance. The actions include classifying a first portion of the transcription as a trigger term and a second portion as an object of the trigger term. The actions include determining that the trigger term matches trigger term for which a result of processing is to include both a transcription of an object and audio data of the object in a generated data structure. The actions include isolating the audio data of the object. The actions include generating a data structure that includes the transcription of the object and the audio data of the object.
-
公开(公告)号:US20180039477A1
公开(公告)日:2018-02-08
申请号:US15226046
申请日:2016-08-02
Applicant: Google Inc.
Inventor: Sang Soo Sung , Lantian Zheng , Haywai Hayward Chan , Chen Liu , Liuyi Sun , David P. Whipp
IPC: G06F3/16 , G06F3/0484 , G06F3/0481 , G10L15/22 , G10L15/18
CPC classification number: G06F3/167 , G06F3/04817 , G06F3/04842 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L2015/223 , G10L2015/225
Abstract: The disclosed embodiments include computerized methods, systems, and devices, including computer programs encoded on a computer storage medium, for integrating voice-based interaction and control into a native graphical user interface (GUI) of an executed application. For example, a communications device may obtaining component data identifying a plurality of components of a voice-user interface from a computing system maintained by a voice-service provider, and may execute an application linked to a corresponding one of the components of the voice-user interface. The communications device may generate the native GUI based on an output of the executed application, and may generate an interface element representative of the corresponding one of the components of the voice-user interface. The communications device may present the generated interface element within the native GUI, which may embed the corresponding component of the voice-user interface into the native GUI.
-
公开(公告)号:US20180039478A1
公开(公告)日:2018-02-08
申请号:US15226054
申请日:2016-08-02
Applicant: Google Inc.
Inventor: Sang Soo Sung , Lantian Zheng , David P. Whipp , Liuyi Sun , Haywai Hayward Chan
IPC: G06F3/16 , G06F3/0484 , G06F3/0481 , G10L15/22 , G10L15/18
CPC classification number: G06F3/167 , G06F3/0481 , G06F3/04842 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L2015/223 , G10L2015/225 , G10L2015/228
Abstract: The disclosed embodiments include computerized methods, systems, and devices, including computer programs encoded on a computer storage medium, for integrating voice-based interaction and control into a native graphical user interface (GUI) of an executed application. For example, a communications device may receive audio data corresponding to an utterance spoken by a user, and may obtain structured data representative of the received audio data. The communications device may provide structured data to the executed application through a programmatic interface, and the executed application may perform the one or more operations in accordance with the structured data. The communications device may generate data indicative of an output of the one or more operations performed by the executed application, and may present at least a portion of the generated output data to a user through a corresponding interface.
-
公开(公告)号:US20170372703A1
公开(公告)日:2017-12-28
申请号:US15193929
申请日:2016-06-27
Applicant: Google Inc.
Inventor: Sang Soo Sung , David P. Whipp , Jing Qian
CPC classification number: G10L15/30 , G06Q10/0631 , G10L13/00 , G10L15/22 , G10L15/26 , G10L2015/223 , G10L2015/225
Abstract: Methods, systems, and apparatus, including computer programs stored on a computer-readable storage medium, for asynchronous execution of client requests. In some implementations, data indicating a user request to a digital assistant is received. An action corresponding to the user request is determined. It is determined that the action is classified as an action to be performed asynchronously to the user request. A confirmation message is sent, for output, and the action is performed asynchronously to the user request.
-
-
-