Patent search cpc:"G10L15/1822" Page 1

1.

发明申请
SYSTEM AND METHOD FOR ARTIFICIAL INTELLIGENCE DRIVEN AUTOMATED COMPANION 审中-公开

公开(公告)号：WO2019133715A1

公开(公告)日：2019-07-04

申请号：PCT/US2018/067680

申请日：2018-12-27

Applicant: DMAI, INC.

Inventor： SHUKLA, Nishant , FANG, Rui , LIU, Changsong

IPC: G10L15/22 , G06T13/40 , G06F17/28

CPC classification number: G10L15/22 , B25J11/0005 , G10L15/1815 , G10L15/1822 , G10L15/30 , G10L25/63 , G10L2015/223 , G10L2015/226

Abstract: The present teaching relates to method, system, medium, and implementations for an automated dialogue companion. Multimodal input data associated with a user engaged in a dialogue of a certain topic in a dialogue scene are first received and used to extract features representing a state of the user and relevant information associated with the dialogue scene. A current state of the dialogue characterizing the context of the dialogue is generated based on the state of the user and the relevant information associated with the dialogue scene. A response communication for the user is determined based on a dialogue tree corresponding to the dialogue of the certain topic, the current state of the dialogue, and utilities learned based on historic dialogue data and the current state of the dialogue.

2.

发明申请
AUTOMATED POPULATION OF ELECTRONIC RECORDS 审中-公开

公开(公告)号：WO2018222228A1

公开(公告)日：2018-12-06

申请号：PCT/US2018/012088

申请日：2018-01-02

Applicant: SORENSON IP HOLDINGS, LLC

Inventor： MONTERO, Adam , BROOKSBY, Scot, Lorin

IPC: G10L15/00

CPC classification number: G10L15/32 , G10L15/1822 , G10L15/26 , G10L15/265 , G10L15/28 , G10L17/005 , G10L2015/088 , G16H10/60

Abstract: A computer-implemented method to populate an electronic record may include generating first transcript data of first audio of a first speaker during a conversation between the first speaker and a second speaker. The method may also include generating second transcript data of second audio of the second speaker during the conversation and identifying one or more words from the first transcript data as being a value for a record field based on the identified words corresponding to the record field and the one or more words being from the first transcript data and not being from the second transcript data. The method may further include providing the identified words to an electronic record database as a value for the record field of a user record of the first speaker.

3.

发明申请
ACOUSTIC MAP COMMAND CONTEXTUALIZATION AND DEVICE CONTROL 审中-公开
Title translation: 声学地图命令语境和设备控制

公开(公告)号：WO2017116522A1

公开(公告)日：2017-07-06

申请号：PCT/US2016/055355

申请日：2016-10-04

Applicant: GENERAL ELECTRIC COMPANY

Inventor： VANDROUX, Stéphane , HADDAD, Mireille , RIOU, Patrice

IPC: G10L15/22 , G10L17/00

CPC classification number: G10L17/22 , G10L15/005 , G10L15/1822 , G10L15/22 , G10L17/00 , G10L21/028 , G10L25/93 , G10L2015/223 , H04R3/005

Abstract: Provided is a system where users performing a coordinated process are localized in a complex environment based upon audio input. Audio commands are detected and executed based on system user vocalization. Available commands are limited by user status, location, process type and process progress. Command execution is limited by the presence and locations of system users, non-users, or extraneous equipment.

Abstract translation: 提供了一种系统，其中执行协调处理的用户基于音频输入被集中在复杂的环境中。音频命令根据系统用户发声进行检测和执行。可用命令受用户状态，位置，进程类型和进程进度的限制。命令执行受系统用户，非用户或无关设备的存在和位置限制。

4.

发明申请
DATA DRIVEN SPEECH ENABLED SELF-HELP SYSTEMS AND METHODS OF OPERATING THEREOF 审中-公开
Title translation: 数据驱动语音启用自助系统及其操作方法

公开(公告)号：WO2017011343A1

公开(公告)日：2017-01-19

申请号：PCT/US2016/041626

申请日：2016-07-08

Applicant: GENESYS TELECOMMUNICATIONS LABORATORIES, INC.

Inventor： LEV, Yoni , TAPUHI, Tamir , FAIZAKOF, Avraham , LEV-TOV, Amir , KONIG, Yochai

IPC: G10L15/22 , G10L15/02 , G10L15/28 , G10L13/08

CPC classification number: G06F17/2775 , G10L15/1815 , G10L15/1822 , G10L15/19 , G10L2015/223

Abstract: A method for configuring an automated, speech driven self-help system based on prior interactions between a plurality of customers and a plurality of agents includes: recognizing, by a processor, speech in the prior interactions between customers and agents to generate recognized text; detecting, by the processor, a plurality of phrases in the recognized text; clustering, by the processor, the plurality of phrases into a plurality of clusters; generating, by the processor, a plurality of grammars describing corresponding ones of the clusters; outputting, by the processor, the plurality of grammars; and invoking configuration of the automated self-help system based on the plurality of grammars.

Abstract translation: 基于多个客户和多个代理之间的先前交互来配置自动语音驱动的自助系统的方法包括：由处理器识别客户和代理之间的先前交互中的语音以产生识别的文本; 由所述处理器检测所识别的文本中的多个短语; 由所述处理器将所述多个短语聚类成多个聚类; 由所述处理器生成描述所述簇中的相应簇的多个语法; 由所述处理器输出所述多个语法; 并且基于多个语法来调用自动化自助系统的配置。

5.

发明申请
METHOD AND APPARATUS FOR PROCESSING USER INPUT 审中-公开
Title translation: 处理用户输入的方法和装置

公开(公告)号：WO2017003452A1

公开(公告)日：2017-01-05

申请号：PCT/US2015/038535

申请日：2015-06-30

Applicant: NUANCE COMMUNICATIONS, INC.

Inventor： GEORGES, Munir Nikolai Alexander , VELLASQUES, Eduardo , NIEDTNER, Friederike Eva Anabel , JUNG, Diana Deokhwa , BENDER, Oliver , ANASTASIADIS, Josef Damianus

IPC: G06F17/27 , G10L15/04

CPC classification number: G10L15/197 , G06F17/279 , G10L15/04 , G10L15/1822 , G10L15/19 , G10L15/22 , G10L25/54 , G10L2015/226

Abstract: According to some aspects, a method of processing user input received from a user is provided. The method comprises generating a plurality of segmentation hypotheses from content of the user input based, at least in part, on a set of parameters, querying a domain- specific database using each of the plurality of segmentation hypotheses to obtain at least one result, and modifying at least one of the set of parameters based, at least in part, on the at least one result.

Abstract translation: 根据一些方面，提供了一种处理从用户接收的用户输入的方法。该方法包括至少部分地基于一组参数从用户输入的内容生成多个分割假设，使用多个分割假设中的每一个来查询特定于领域的数据库以获得至少一个结果，以及至少部分地基于所述至少一个结果修改所述一组参数中的至少一个。

6.

发明申请
VOICE AND CONNECTION PLATFORM 审中-公开
Title translation: 语音和连接平台

公开(公告)号：WO2016054230A1

公开(公告)日：2016-04-07

申请号：PCT/US2015/053251

申请日：2015-09-30

Applicant: XBRAIN, INC.

Inventor： RENARD, Gregory , HERBAUX, Mathias

IPC: G10L15/183 , G10L15/22 , H04M3/493

CPC classification number: G10L15/22 , G06F3/167 , G10L15/18 , G10L15/1822 , G10L15/24 , G10L2015/223 , G10L2015/226 , G10L2015/227

Abstract: A system and method for providing a voice assistant including receiving, at a first device, a first audio input from a user requesting a first action; performing automatic speech recognition on the first audio input; obtaining a context of user; performing natural language understanding based on the speech recognition of the first audio input; and taking the first action based on the context of the user and the natural language understanding.

Abstract translation: 一种用于提供语音助理的系统和方法，包括在第一设备处从请求第一动作的用户接收第一音频输入; 在第一音频输入上执行自动语音识别; 获取用户的上下文; 基于第一音频输入的语音识别来执行自然语言理解; 并根据用户的语境和自然语言理解采取第一步。

7.

发明申请
USER ADAPTIVE INTERFACES 审中-公开
Title translation: 用户自适应接口

公开(公告)号：WO2016048581A1

公开(公告)日：2016-03-31

申请号：PCT/US2015/047527

申请日：2015-08-28

Applicant: INTEL CORPORATION

Inventor： GRAFF, Peter , QUIRINO SIMOES, Ana Paula , NAKATSU, Crystal A. , CHRISTIAN, Jessica M.

IPC: G01C21/36 , G06F3/16 , G10L15/18 , G10L15/22

CPC classification number: G06F3/167 , G01C21/3641 , G10L13/033 , G10L13/043 , G10L15/1822 , G10L15/187 , G10L15/22 , G10L15/265 , G10L17/26 , G10L25/51 , G10L2015/223 , G10L2015/226

Abstract: Systems and methods for providing a user adaptive natural language interface are disclosed. The disclosed embodiments may receive and analyze user input to derive current user behavior data, including data indicative of characteristics of the user input. The user input is classified based on prior user behavior data previously logged during one or more previous user-system interactions and the current user behavior data to generate a classification of the user input. Machine learning algorithms can be employed to classify the user input. User adaptive utterances are selected based on the user input and the classification of the user input. The user-system interaction is logged for use as prior user behavior data in future user-system interactions. A response to the user input is generated, including synthesizing output speech from the user adaptive utterances selected. Example applications of the disclosed systems and methods provide user adaptive navigation directions in navigation systems.

Abstract translation: 公开了用于提供用户自适应自然语言接口的系统和方法。所公开的实施例可以接收和分析用户输入以导出当前用户行为数据，包括指示用户输入的特征的数据。用户输入基于先前在一个或多个先前用户 - 系统交互期间记录的先前用户行为数据和当前用户行为数据进行分类，以生成用户输入的分类。机器学习算法可用于对用户输入进行分类。基于用户输入和用户输入的分类来选择用户自适应语音。用户系统交互记录用作未来用户系统交互中的先前用户行为数据。产生对用户输入的响应，包括从所选择的用户自适应语音合成输出语音。所公开的系统和方法的示例应用在导航系统中提供用户适应性导航方向。

8.

发明申请
SPEECH INTERACTION APPARATUS AND METHOD 审中-公开
Title translation: 语音交互设备和方法

公开(公告)号：WO2016042815A1

公开(公告)日：2016-03-24

申请号：PCT/JP2015/059010

申请日：2015-03-18

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventor： YAMAMOTO, Ayana , FUJII, Hiroko

IPC: G10L15/22 , G06F17/28 , G10L15/10

CPC classification number: G10L15/22 , G10L15/1807 , G10L15/1815 , G10L15/1822 , G10L15/26 , G10L25/87 , G10L2015/223

Abstract: According to one embodiment, a speech interaction apparatus for performing an interaction with a user based on a scenario includes a speech recognition unit, a determination unit, a selection unit and an execution unit. The speech recognition unit recognizes a speech of the user and generates a recognition result text. The determination unit determines whether or not the speech includes an interrogative intention based on the recognition result text. The selection unit selects, when the speech includes the interrogative intention, a term of inquiry from a response sentence in the interaction in accordance with timing of the speech, the term of inquiry being a subject of the interrogative intention. The execution unit executes an explanation scenario including an explanation of the term of inquiry.

Abstract translation: 根据一个实施例，用于基于场景执行与用户的交互的语音交互装置包括语音识别单元，确定单元，选择单元和执行单元。语音识别单元识别用户的语音并生成识别结果文本。确定单元基于识别结果文本来确定语音是否包括询问意图。选择单元在语音包括询问意图时，根据语音的时间选择来自交互中的响应句子的查询项，查询词是询问意图的对象。执行单元执行包括对查询期限的说明的说明场景。

9.

发明申请
DATA STRUCTURE, INTERACTIVE VOICE RESPONSE DEVICE, AND ELECTRONIC DEVICE 审中-公开
Title translation: 数据结构，互动语音响应设备和电子设备

公开(公告)号：WO2016027909A1

公开(公告)日：2016-02-25

申请号：PCT/JP2015078633

申请日：2015-10-08

Applicant: SHARP KK

Inventor： FUKUNAGA KOHJI

IPC: G10L15/22 , G10L15/10

CPC classification number: G10L15/22 , G10L13/08 , G10L15/08 , G10L15/1822 , G10L15/30 , G10L2015/088

Abstract: The present invention enables voice interaction to continue at a suitable timing without requiring high processing capacity and regardless of deviations in the flow of conversation. This data structure at least comprises, as one set, speech content (Speak) which is spoken to a user, response content (Return) with which a spoken dialogue is established with the speech content, and attribute information (Entity) which indicates attributes of the speech content.

Abstract translation: 本发明使得语音交互能够在适当的定时继续进行，而不需要高处理能力，而不管对话流程的偏差。该数据结构至少包括与用户说话的语音内容（Speak），用于与语音内容建立口语对话的响应内容（返回），以及指示属性的属性信息（Entity）讲话内容。

10.

发明申请
INTERACTION APPARATUS AND METHOD 审中-公开
Title translation: 交互设备和方法

公开(公告)号：WO2015198673A1

公开(公告)日：2015-12-30

申请号：PCT/JP2015/059821

申请日：2015-03-23

Applicant: KABUSHIKI KAISHA TOSHIBA

Inventor： ICHIMURA, Yumi

IPC: G06F17/28 , G06F17/30 , G10L13/00 , G10L13/08 , G10L15/22

CPC classification number: G10L15/22 , G10L13/00 , G10L15/063 , G10L15/08 , G10L15/1822 , G10L15/30 , G10L25/54 , G10L2015/0635 , G10L2015/088 , G10L2015/221

Abstract: According to one embodiment, an interaction apparatus includes an interaction apparatus includes a storage, a first extractor, a retriever, a generator, a second extractor and a register. The storage stores a problem and at least one solution for solving the problem. The first extractor extracts a target problem which is an expression regarded as the problem, from a first speech. The generator generates a first speech-prompting sentence prompting the user to make a speech including the target solution if the storage stores no target solution or if the user rejects the target solution. The second extractor extracts the target solution from a second speech which is a response of the user relating to the first speech-prompting sentence. The register registers, on the storage, the target problem and the target solution.

Abstract translation: 根据一个实施例，交互装置包括交互装置，包括存储器，第一提取器，检索器，发生器，第二提取器和寄存器。存储存储问题和解决问题的至少一个解决方案。第一个提取器从第一个讲话中提取出一个被视为问题的表达的目标问题。如果存储器没有存储目标解决方案或者用户拒绝目标解决方案，则生成器生成第一语音提示语句，提示用户进行包括目标解决方案的语音。第二提取器从作为与第一语音提示句相关的用户的响应的第二语音提取目标解。寄存器在存储器上注册目标问题和目标解决方案。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification