System and method for generating customized text-to-speech voices
    1.
    发明授权
    System and method for generating customized text-to-speech voices 有权
    用于生成定制的文本到语音语音的系统和方法

    公开(公告)号:US09240177B2

    公开(公告)日:2016-01-19

    申请号:US14196578

    申请日:2014-03-04

    Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.

    Abstract translation: 公开了用于为特定应用产生定制的文本到语音语音的系统和方法。 该方法包括通过选择用于生成与域相关联的自定义文本到语音语音的语音来生成自定义文本到语音语音,从预先存在的文本数据源收集与域相关联的文本数据,并使用收集的 文本数据,通过搜索合成语音单元的预先存在的库存来选择适合于该域的语音单元,或者通过记录所选合成质量水平的最小库存来生成合成语音单元的域内库存。 使用合成语音单元的域内库存来生成域的文本到语音定制语音。 还可以使用主动学习技术来识别问题短语,其中只需要几分钟的记录数据来传送高质量的TTS定制语音。

    Systems and methods for extracting meaning from multimodal inputs using finite-state devices
    2.
    发明授权
    Systems and methods for extracting meaning from multimodal inputs using finite-state devices 有权
    使用有限状态设备从多模态输入中提取意义的系统和方法

    公开(公告)号:US08626507B2

    公开(公告)日:2014-01-07

    申请号:US13690037

    申请日:2012-11-30

    CPC classification number: G10L15/00 G06F3/167 G06K9/00355 G10L15/24

    Abstract: Multimodal utterances contain a number of different modes. These modes can include speech, gestures, and pen, haptic, and gaze inputs, and the like. This invention use recognition results from one or more of these modes to provide compensation to the recognition process of one or more other ones of these modes. In various exemplary embodiments, a multimodal recognition system inputs one or more recognition lattices from one or more of these modes, and generates one or more models to be used by one or more mode recognizers to recognize the one or more other modes. In one exemplary embodiment, a gesture recognizer inputs a gesture input and outputs a gesture recognition lattice to a multimodal parser. The multimodal parser generates a language model and outputs it to an automatic speech recognition system, which uses the received language model to recognize the speech input that corresponds to the recognized gesture input.

    Abstract translation: 多模式话语包含多种不同的模式。 这些模式可以包括语音,手势和笔,触觉和注视输入等。 本发明使用这些模式中的一个或多个的识别结果为这些模式中的一个或多个其他模式的识别过程提供补偿。 在各种示例性实施例中,多模式识别系统从这些模式中的一个或多个输入一个或多个识别网格,并且生成要由一个或多个模式识别器使用以识别一个或多个其他模式的一个或多个模型。 在一个示例性实施例中,手势识别器输入手势输入并向多模式解析器输出手势识别格点。 多模式解析器生成语言模型并将其输出到自动语音识别系统,其使用所接收的语言模型来识别对应于识别的手势输入的语音输入。

    System and method of spoken language understanding in human computer dialogs

    公开(公告)号:US08612232B2

    公开(公告)日:2013-12-17

    申请号:US13775546

    申请日:2013-02-25

    Abstract: A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.

    System and Method of Providing a Spoken Dialog Interface to a Website
    4.
    发明申请
    System and Method of Providing a Spoken Dialog Interface to a Website 有权
    向网站提供口语对话界面的系统和方法

    公开(公告)号:US20130246069A1

    公开(公告)日:2013-09-19

    申请号:US13891447

    申请日:2013-05-10

    Abstract: Disclosed is a method for training a spoken dialog service component from website data. Spoken dialog service components typically include an automatic speech recognition module, a language understanding module, a dialog management module, a language generation module and a text-to-speech module. The method includes selecting anchor texts within a website based on a term density, weighting those anchor texts based on a percent of salient words to total words, and incorporating the weighted anchor texts into a live spoken dialog interface, the weights determining a level of incorporation into the live spoken dialog interface.

    Abstract translation: 公开了一种从网站数据训练口语对话服务组件的方法。 口语对话服务组件通常包括自动语音识别模块,语言理解模块,对话管理模块,语言生成模块和文本到语音模块。 该方法包括基于术语密度选择网站内的锚文本,基于显着词的百分比将总计文本加权到总词,并将加权的锚文本加入到现场语音对话界面中,权重确定并入级别 进入现场对话界面。

    System and method of spoken language understanding in human computer dialogs

    公开(公告)号:US09263031B2

    公开(公告)日:2016-02-16

    申请号:US14081166

    申请日:2013-11-15

    Abstract: A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.

    Method and apparatus for automatically building conversational systems
    6.
    发明授权
    Method and apparatus for automatically building conversational systems 有权
    自动构建对话系统的方法和装置

    公开(公告)号:US08718242B2

    公开(公告)日:2014-05-06

    申请号:US13914974

    申请日:2013-06-11

    Abstract: A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a telephone interface can provide speech information, which is converted to text and used to automatically fill in input fields on a webpage form. The form is then submitted to a database search and a response is generated. Information contained on the responsive webpage is extracted and converted to speech via a text-to-speech engine and communicated to the person.

    Abstract translation: 系统和方法为世界各地的Web内容提供了一种自然语言界面。 提前或动态地,使用解析算法解析网页内容。 使用电话接口的人可以提供语音信息,其被转换成文本并用于自动填写网页表单上的输入字段。 然后将表单提交到数据库搜索,并生成响应。 包含在响应网页上的信息被提取并经由文本到语音引擎转换成语音,并传达给该人。

    Methods and Systems for Natural Language Understanding Using Human Knowledge and Collected Data
    7.
    发明申请
    Methods and Systems for Natural Language Understanding Using Human Knowledge and Collected Data 有权
    使用人类知识和收集数据的自然语言理解的方法和系统

    公开(公告)号:US20130311170A1

    公开(公告)日:2013-11-21

    申请号:US13873548

    申请日:2013-04-30

    CPC classification number: G10L15/183 G06F17/2818 G10L15/14 G10L15/19

    Abstract: Disclosed herein are systems and methods to incorporate human knowledge when developing and using statistical models for natural language understanding. The disclosed systems and methods embrace a data-driven approach to natural language understanding which progresses seamlessly along the continuum of availability of annotated collected data, from when there is no available annotated collected data to when there is any amount of annotated collected data.

    Abstract translation: 这里公开的是在开发和使用自然语言理解的统计模型时将人类知识结合在一起的系统和方法。 所公开的系统和方法包含一种数据驱动的自然语言理解方法,其从注释收集的数据的可用性的连续性无缝地进行,从没有可用的注释收集的数据到当有任何数量的注释收集的数据时。

    Method and Apparatus for Automatically Building Conversational Systems
    8.
    发明申请
    Method and Apparatus for Automatically Building Conversational Systems 有权
    自动构建对话系统的方法和装置

    公开(公告)号:US20130275132A1

    公开(公告)日:2013-10-17

    申请号:US13914974

    申请日:2013-06-11

    Abstract: A system and method provides a natural language interface to world-wide web content. Either in advance or dynamically, webpage content is parsed using a parsing algorithm. A person using a telephone interface can provide speech information, which is converted to text and used to automatically fill in input fields on a webpage form. The form is then submitted to a database search and a response is generated. Information contained on the responsive webpage is extracted and converted to speech via a text-to-speech engine and communicated to the person.

    Abstract translation: 系统和方法为世界各地的Web内容提供了一种自然语言界面。 提前或动态地,使用解析算法解析网页内容。 使用电话接口的人可以提供语音信息,其被转换成文本并用于自动填写网页表单上的输入字段。 然后将表单提交到数据库搜索,并生成响应。 包含在响应网页上的信息被提取并经由文本到语音引擎转换成语音,并传达给该人。

    System and Method of Spoken Language Understanding in Human Computer Dialogs
    9.
    发明申请
    System and Method of Spoken Language Understanding in Human Computer Dialogs 有权
    人类对话中口语理解的系统与方法

    公开(公告)号:US20130166284A1

    公开(公告)日:2013-06-27

    申请号:US13775546

    申请日:2013-02-25

    Abstract: A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.

    Abstract translation: 公开了一种提高口语对话系统中的自动语音识别的系统和方法。 该方法包括将语音识别器输出划分为独立子句,识别每个自包含子句中的对话行为,通过识别当前域对象和/或当前域动作进行限定对话行为,以及确定是否可进一步限定 对于当前域对象和/或当前域操作。 如果可以进一步鉴定,则该方法包括识别与当前域对象和/或当前域操作相关联的另一域操作和/或另一域对象,将另一域操作和/或另一域对象重新分配为当前域操作,以及 /或当前域对象,然后递归地限定新的当前域操作和/或当前对象。 这个过程一直持续到没有什么是剩下的资格。

Patent Agency Ranking