SYSTEM AND METHOD OF PERFORMING AUTOMATIC SPEECH RECOGNITION USING LOCAL PRIVATE DATA
    2.
    发明申请
    SYSTEM AND METHOD OF PERFORMING AUTOMATIC SPEECH RECOGNITION USING LOCAL PRIVATE DATA 有权
    使用本地私有数据执行自动语音识别的系统和方法

    公开(公告)号:US20150120288A1

    公开(公告)日:2015-04-30

    申请号:US14066079

    申请日:2013-10-29

    CPC classification number: G10L15/22 G10L15/30 G10L2015/228

    Abstract: A method of providing hybrid speech recognition between a local embedded speech recognition system and a remote speech recognition system relates to receiving speech from a user at a device communicating with a remote speech recognition system. The system recognizes a first part of speech by performing a first recognition of the first part of the speech with the embedded speech recognition system that accesses private user data, wherein the private user data is not available to the remote speech recognition system. The system recognizes the second part of the speech by performing a second recognition of the second part of the speech with the remote speech recognition system. The final recognition result is a combination of these two recognition processes. The private data can be such local information as a user location, a playlist, frequently dialed numbers or texted people, user contact list information, and so forth.

    Abstract translation: 在本地嵌入式语音识别系统和远程语音识别系统之间提供混合语音识别的方法涉及在与远程语音识别系统通信的设备处从用户接收语音。 该系统通过利用访问私人用户数据的嵌入式语音识别系统执行语音的第一部分的第一识别来识别第一语音,其中私人用户数据不可用于远程语音识别系统。 该系统通过用远程语音识别系统执行语音的第二部分的第二识别来识别语音的第二部分。 最终的识别结果是这两个识别过程的组合。 专用数据可以是诸如用户位置,播放列表,经常拨打的号码或发短信的人,用户联系人列表信息等的本地信息。

    SYSTEM AND METHOD FOR CREATING AND SHARING PLANS THROUGH MULTIMODAL DIALOG
    3.
    发明申请
    SYSTEM AND METHOD FOR CREATING AND SHARING PLANS THROUGH MULTIMODAL DIALOG 有权
    通过多模式对话创建和共享计划的系统和方法

    公开(公告)号:US20160179908A1

    公开(公告)日:2016-06-23

    申请号:US14577311

    申请日:2014-12-19

    CPC classification number: G06F3/04847 G06F17/3087

    Abstract: Methods, systems, devices, and media for creating a plan through multimodal search inputs are provided. A multimodal virtual assistant receives a first search request which comprises a geographic area. First search results are displayed in response to the first search request being received. The first search results are based on the first search request and correspond to the geographic area. Each of the first search results is associated with a geographic location. The multimodal virtual assistant receives a selection of one of the first search results, and adds the selected one of the first search results to a plan. A second search request is received after the selection, and second search results are displayed in response to the second search request being received. The second search results are based on the second search request and correspond to the geographic location of the selected one of the first search results.

    Abstract translation: 提供了通过多模态搜索输入创建计划的方法,系统,设备和媒体。 多模式虚拟助理接收包括地理区域的第一搜索请求。 响应于正在接收的第一搜索请求显示第一搜索结果。 第一搜索结果基于第一搜索请求并对应于地理区域。 每个第一搜索结果与地理位置相关联。 多模式虚拟助理接收第一搜索结果之一的选择,并将所选择的第一搜索结果添加到计划中。 在选择之后接收第二搜索请求,并且响应于接收到的第二搜索请求而显示第二搜索结果。 第二搜索结果基于第二搜索请求,并且对应于所选择的第一搜索结果的地理位置。

    SYSTEM AND METHOD FOR LOCALIZED ERROR DETECTION OF RECOGNITION RESULTS
    4.
    发明申请
    SYSTEM AND METHOD FOR LOCALIZED ERROR DETECTION OF RECOGNITION RESULTS 有权
    用于本地化错误检测识别结果的系统和方法

    公开(公告)号:US20160155445A1

    公开(公告)日:2016-06-02

    申请号:US14557030

    申请日:2014-12-01

    CPC classification number: G10L15/22 G10L15/01 G10L15/1822 H04M2250/74

    Abstract: A system, method and computer-readable storage devices are disclosed for using targeted clarification (TC) questions in dialog systems in a multimodal virtual agent system (MVA) providing access to information about movies, restaurants, and musical events. In contrast with open-domain spoken systems, the MVA application covers a domain with a fixed set of concepts and uses a natural language understanding (NLU) component to mark concepts in automatically recognized speech. Instead of identifying an error segment, localized error detection (LED) identifies which of the concepts are likely to be present and correct using domain knowledge, automatic speech recognition (ASR), and NLU tags and scores. If at least concept is identified to be present but not correct, the TC component uses this information to generate a targeted clarification question. This approach computes probability distributions of concept presence and correctness for each user utterance, which can apply to automatic learning for clarification policies.

    Abstract translation: 公开了一种用于在多模式虚拟代理系统(MVA)中的对话系统中使用目标澄清(TC)问题的系统,方法和计算机可读存储设备,其提供对关于电影,餐馆和音乐事件的信息的访问。 与开放域语言系统相比,MVA应用程序涵盖了具有固定概念集的域,并使用自然语言理解(NLU)组件来标记自动识别的语音中的概念。 本地化错误检测(LED)不是识别错误段,而是使用域知识,自动语音识别(ASR)和NLU标签和分数来识别哪些概念可能存在和正确。 如果至少将概念确定为存在但不正确,则TC组件使用此信息来产生有针对性的澄清问题。 这种方法计算每个用户话语的概念存在和正确性的概率分布,这可以应用于自动学习以进行澄清策略。

    SYSTEM AND METHOD FOR INITIATING MULTI-MODAL SPEECH RECOGNITION USING A LONG-TOUCH GESTURE
    5.
    发明申请
    SYSTEM AND METHOD FOR INITIATING MULTI-MODAL SPEECH RECOGNITION USING A LONG-TOUCH GESTURE 审中-公开
    使用长触发语言启发多模式语音识别的系统和方法

    公开(公告)号:US20160124706A1

    公开(公告)日:2016-05-05

    申请号:US14529766

    申请日:2014-10-31

    Abstract: A system, method and computer-readable storage devices are disclosed for multi-modal interactions with a system via a long-touch gesture on a touch-sensitive display. A system operating per this disclosure can receive a multi-modal input comprising speech and a touch on a display, wherein the speech comprises a pronoun. When the touch on the display has a duration longer than a threshold duration, the system can identify an object within a threshold distance of the touch, associate the object with the pronoun in the speech, to yield an association, and perform an action based on the speech and the association.

    Abstract translation: 公开了一种系统,方法和计算机可读存储装置,用于通过触敏显示器上的长触摸手势与系统进行多模式交互。 根据本公开操作的系统可以接收包括语音和在显示器上的触摸的多模式输入,其中该语音包括代词。 当显示屏上的触摸持续时间长于阈值持续时间时,系统可以识别触摸阈值距离内的对象,将对象与语音中的代词相关联,以产生关联,并且基于 演讲和协会。

    SYSTEM AND METHOD FOR ROBUST PERSONALIZATION OF SPEECH RECOGNITION
    8.
    发明申请
    SYSTEM AND METHOD FOR ROBUST PERSONALIZATION OF SPEECH RECOGNITION 审中-公开
    用于语音识别的鲁棒个性化的系统和方法

    公开(公告)号:US20140136210A1

    公开(公告)日:2014-05-15

    申请号:US13676531

    申请日:2012-11-14

    CPC classification number: G10L15/07 G10L15/183

    Abstract: Personalization of speech recognition while maintaining privacy of user data is achieved by transmitting data associated with received speech to a speech recognition service and receiving a result from the speech recognition service. The speech recognition service result is generated from a general purpose speech language model. The system generates an input finite state machine from the speech recognition result and composes the input finite state machine with a phone edit finite state machine, to yield a resulting finite state machine. The system composes the resulting finite state machine with a user data finite state machine to yield a second resulting finite state machine, and uses a best path through the second resulting finite state machine to yield a user specific speech recognition result.

    Abstract translation: 通过将与接收到的语音相关联的数据发送到语音识别服务并从语音识别服务接收结果来实现语音识别的个性化,同时保持用户数据的隐私。 语音识别服务结果是从通用语言语言模型生成的。 该系统从语音识别结果生成输入有限状态机,并用手机编辑有限状态机组成输入有限状态机,产生有限状态机。 该系统由用户数据有限状态机组成所得到的有限状态机,产生第二个有限状态机,并通过第二个有限状态机通过最佳路径产生用户特定的语音识别结果。

Patent Agency Ranking