-
公开(公告)号:US20170032791A1
公开(公告)日:2017-02-02
申请号:US14815794
申请日:2015-07-31
Applicant: Google Inc.
Inventor: David Kliger Elson , David P. Whipp , Shir Judith Yehoshua
IPC: G10L17/22
CPC classification number: G10L17/22 , G06F17/30654
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for managing dialogs. In one aspect, a method includes receiving a request associated with a task from a user device; submitting the request to each of a plurality of distinct data providers; receiving a plurality of suggested dialog responses from two or more of the data providers; scoring the one or more suggested dialog responses based on one or more scoring factors; determining a particular dialog response to provide to the user based on the scoring; and providing the determined dialog response to the user device.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于管理对话。 一方面,一种方法包括从用户设备接收与任务相关联的请求; 将请求提交给多个不同数据提供者中的每一个; 从两个或多个数据提供者接收多个建议的对话响应; 基于一个或多个评分因素评分一个或多个建议的对话响应; 确定特定的对话响应以基于评分提供给用户; 以及向所述用户设备提供所确定的对话响应。
-
公开(公告)号:US20180039477A1
公开(公告)日:2018-02-08
申请号:US15226046
申请日:2016-08-02
Applicant: Google Inc.
Inventor: Sang Soo Sung , Lantian Zheng , Haywai Hayward Chan , Chen Liu , Liuyi Sun , David P. Whipp
IPC: G06F3/16 , G06F3/0484 , G06F3/0481 , G10L15/22 , G10L15/18
CPC classification number: G06F3/167 , G06F3/04817 , G06F3/04842 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L2015/223 , G10L2015/225
Abstract: The disclosed embodiments include computerized methods, systems, and devices, including computer programs encoded on a computer storage medium, for integrating voice-based interaction and control into a native graphical user interface (GUI) of an executed application. For example, a communications device may obtaining component data identifying a plurality of components of a voice-user interface from a computing system maintained by a voice-service provider, and may execute an application linked to a corresponding one of the components of the voice-user interface. The communications device may generate the native GUI based on an output of the executed application, and may generate an interface element representative of the corresponding one of the components of the voice-user interface. The communications device may present the generated interface element within the native GUI, which may embed the corresponding component of the voice-user interface into the native GUI.
-
公开(公告)号:US20160171114A1
公开(公告)日:2016-06-16
申请号:US14881778
申请日:2015-10-13
Applicant: Google Inc.
Inventor: David P. Whipp , David Kliger Elson , Shir Judith Yehoshua
CPC classification number: G06F16/9535 , G06F3/167 , G06F16/243 , G06F16/3329 , G10L15/1822 , G10L15/22 , G10L2015/223 , H04M3/4936
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using frames for performing tasks. One of the methods includes receiving a first request to perform a task, the first request comprising user speech identifying the task; generating a frame associated with the task, wherein the frame comprises one or more types of values necessary to perform the task, and wherein each type of value can be satisfied by a respective value; receiving a second request to provide information related to a question, the second request comprising user speech identifying the question; providing information identifying the question to a search engine, and receiving a response identifying one or more terms; determining that at least one term can satisfy a type of value necessary to perform the task; and storing the at least one term in the frame.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用帧执行任务。 所述方法之一包括接收执行任务的第一请求,所述第一请求包括识别所述任务的用户语音; 生成与所述任务相关联的帧,其中所述帧包括执行所述任务所需的一种或多种类型的值,并且其中每种类型的值可以由相应的值来满足; 接收第二请求以提供与问题相关的信息,所述第二请求包括识别所述问题的用户语音; 向搜索引擎提供识别问题的信息,以及接收标识一个或多个术语的响应; 确定至少一个术语可以满足执行任务所需的一种类型的值; 以及将所述至少一个项存储在所述帧中。
-
公开(公告)号:US10885129B2
公开(公告)日:2021-01-05
申请号:US14881778
申请日:2015-10-13
Applicant: Google Inc.
Inventor: David P. Whipp , David Kliger Elson , Shir Judith Yehoshua
IPC: G06F16/9535 , G06F16/953 , G06F16/9536 , G06F16/242 , G06F16/332 , G06F3/16 , H04M3/493 , G10L15/18 , G10L15/22
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for using frames for performing tasks. One of the methods includes receiving a first request to perform a task, the first request comprising user speech identifying the task; generating a frame associated with the task, wherein the frame comprises one or more types of values necessary to perform the task, and wherein each type of value can be satisfied by a respective value; receiving a second request to provide information related to a question, the second request comprising user speech identifying the question; providing information identifying the question to a search engine, and receiving a response identifying one or more terms; determining that at least one term can satisfy a type of value necessary to perform the task; and storing the at least one term in the frame.
-
公开(公告)号:US20180039478A1
公开(公告)日:2018-02-08
申请号:US15226054
申请日:2016-08-02
Applicant: Google Inc.
Inventor: Sang Soo Sung , Lantian Zheng , David P. Whipp , Liuyi Sun , Haywai Hayward Chan
IPC: G06F3/16 , G06F3/0484 , G06F3/0481 , G10L15/22 , G10L15/18
CPC classification number: G06F3/167 , G06F3/0481 , G06F3/04842 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/30 , G10L2015/223 , G10L2015/225 , G10L2015/228
Abstract: The disclosed embodiments include computerized methods, systems, and devices, including computer programs encoded on a computer storage medium, for integrating voice-based interaction and control into a native graphical user interface (GUI) of an executed application. For example, a communications device may receive audio data corresponding to an utterance spoken by a user, and may obtain structured data representative of the received audio data. The communications device may provide structured data to the executed application through a programmatic interface, and the executed application may perform the one or more operations in accordance with the structured data. The communications device may generate data indicative of an output of the one or more operations performed by the executed application, and may present at least a portion of the generated output data to a user through a corresponding interface.
-
公开(公告)号:US20170372703A1
公开(公告)日:2017-12-28
申请号:US15193929
申请日:2016-06-27
Applicant: Google Inc.
Inventor: Sang Soo Sung , David P. Whipp , Jing Qian
CPC classification number: G10L15/30 , G06Q10/0631 , G10L13/00 , G10L15/22 , G10L15/26 , G10L2015/223 , G10L2015/225
Abstract: Methods, systems, and apparatus, including computer programs stored on a computer-readable storage medium, for asynchronous execution of client requests. In some implementations, data indicating a user request to a digital assistant is received. An action corresponding to the user request is determined. It is determined that the action is classified as an action to be performed asynchronously to the user request. A confirmation message is sent, for output, and the action is performed asynchronously to the user request.
-
公开(公告)号:US20160196110A1
公开(公告)日:2016-07-07
申请号:US14988494
申请日:2016-01-05
Applicant: Google Inc.
Inventor: Shir Judith Yehoshua , David Kliger Elson , David P. Whipp
IPC: G06F3/16
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for managing dialogs. In one aspect, a method includes receiving a request to perform a task from a user device; obtaining a dialog corresponding to the task; providing multiple protocol buffers to the user device; receiving a voice input and one or more annotated protocol buffers from the user device, the one or more annotated protocol buffers identifying corresponding non-verbal responses to content in the protocol buffers; and using the received protocol buffers to update a state of the dialog and to interpret the voice input.
Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于管理对话。 一方面,一种方法包括从用户设备接收执行任务的请求; 获取对应于该任务的对话; 向用户设备提供多个协议缓冲器; 从所述用户设备接收语音输入和一个或多个注释协议缓冲器,所述一个或多个注释协议缓冲器标识对所述协议缓冲器中的内容的对应非语言响应; 并使用接收到的协议缓冲器来更新对话的状态并解释语音输入。
-
-
-
-
-
-