专利检索 ap:("AT & T Intellectual Property I, L.P.") AND inv:"Michael Johnston" 第 1 页

1.

发明申请
System and Method for Improving Speech Recognition Accuracy Using Textual Context 有权
标题翻译：使用文本语境提高语音识别精度的系统和方法

公开(公告)号：US20150279361A1

公开(公告)日：2015-10-01

申请号：US14737708

申请日：2015-06-12

申请人： AT&T Intellectual Property I, L.P.

发明人： Dan Melamed , Srinivas Bangalore , Michael Johnston

IPC分类号： G10L15/18

CPC分类号： G10L25/51 , G06F3/162 , G10L15/05 , G10L15/07 , G10L15/18 , G10L15/183 , G10L15/19 , G10L15/30 , G10L17/04 , G10L2015/228

摘要： Disclosed herein are systems, methods, and computer-readable storage media for improving speech recognition accuracy using textual context. The method includes retrieving a recorded utterance, capturing text from a device display associated with the spoken dialog and viewed by one party to the recorded utterance, and identifying words in the captured text that are relevant to the recorded utterance. The method further includes adding the identified words to a dynamic language model, and recognizing the recorded utterance using the dynamic language model. The recorded utterance can be a spoken dialog. A time stamp can be assigned to each identified word. The method can include adding identified words to and/or removing identified words from the dynamic language model based on their respective time stamps. A screen scraper can capture text from the device display associated with the recorded utterance. The device display can contain customer service data.

摘要翻译： 本文公开了用于使用文本上下文改善语音识别精度的系统，方法和计算机可读存储介质。所述方法包括检索记录的话语，从与所述口语对话相关联的设备显示中捕获文本并由一方观看所述记录的话语，以及识别与记录的话语相关的所捕获的文本中的单词。该方法还包括将所识别的词添加到动态语言模型中，并使用动态语言模型来识别记录的话语。记录的话语可以是一个口语对话。时间戳可以分配给每个识别的单词。该方法可以包括基于它们各自的时间戳将识别的词添加到动态语言模型中和/或从动态语言模型中移除所识别的单词。屏幕刮刀可以从与记录的话语相关联的设备显示中捕获文本。设备显示可以包含客户服务数据。

2.

发明申请
System and Method for Enhancing Voice-Enabled Search Based on Automated Demographic Identification 有权
标题翻译：基于自动人口识别的增强语音搜索的系统和方法

公开(公告)号：US20130218561A1

公开(公告)日：2013-08-22

申请号：US13847173

申请日：2013-03-19

申请人： AT & T Intellectual Property I, L.P.

发明人： Michael Johnston , Srinivas Bangalore , Junlan Feng , Taniya Mishra

IPC分类号： G06F17/30

CPC分类号： G06F17/30026 , G06F17/30976 , G06F17/30979 , G10L15/22 , G10L2015/227

摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating responses to a user speech query in voice-enabled search based on metadata that include demographic features of the speaker. A system practicing the method recognizes received speech from a speaker to generate recognized speech, identifies metadata about the speaker from the received speech, and feeds the recognized speech and the metadata to a question-answering engine. Identifying the metadata about the speaker is based on voice characteristics of the received speech. The demographic features can include age, gender, socio-economic group, nationality, and/or region. The metadata identified about the speaker from the received speech can be combined with or override self-reported speaker demographic information.

摘要翻译： 本文公开的是基于包括说话者的人口统计特征的元数据的用于在基于语音的搜索中近似对用户语音查询的响应的系统，方法和非暂时计算机可读存储介质。实施该方法的系统识别来自扬声器的接收到的语音以产生识别的语音，从接收到的语音识别关于说话者的元数据，并将识别的语音和元数据馈送到问答引擎。识别关于扬声器的元数据是基于所接收语音的语音特征。人口特征可以包括年龄，性别，社会经济群体，国籍和/或地区。从接收到的语音中识别的关于说话者的元数据可以与自报告的说话者人口统计信息进行组合或覆盖。

3.

发明授权
Applied automatic demographic analysis 有权
标题翻译：应用自动人口学分析

公开(公告)号：US09479802B2

公开(公告)日：2016-10-25

申请号：US14621013

申请日：2015-02-12

申请人： AT&T INTELLECTUAL PROPERTY I, L.P.

发明人： Harry E. Blanchard , Hisao Chang , Bernard S. Renger , Michael Johnston

IPC分类号： H04N21/234 , H04N21/2343 , H04N21/258 , H04N21/44 , H04N21/45 , H04N21/81 , H04N21/442 , H04N7/16 , G06Q30/02 , H04H60/33 , H04H60/45 , H04N21/454

CPC分类号： H04N21/812 , G06Q30/0269 , H04H60/33 , H04H60/45 , H04N7/165 , H04N21/23424 , H04N21/23439 , H04N21/25883 , H04N21/44016 , H04N21/44218 , H04N21/4532 , H04N21/4542

摘要： A system for managing a data stream that is transmitted to an environment is provided. The system includes a receiver that receives the data stream. The data stream includes a first program, with the first program configured to be displayed in the environment. An input receives information of an individual in the environment. A processor analyzes the information, determines a demographic descriptor of the individual based on the information, and correlates the demographic descriptor of the individual with a content of the first program to determine whether a predetermined condition is satisfied. The processor further determines a second program based on the demographic descriptor of the individual and modifies the first program based on the second program when the predetermined condition is satisfied.

摘要翻译： 提供了一种用于管理传输到环境的数据流的系统。该系统包括接收数据流的接收器。数据流包括第一程序，其中第一程序被配置为在环境中显示。输入接收环境中个人的信息。处理器分析信息，基于该信息确定个体的人口统计描述符，并且将个体的人口统计描述符与第一节目的内容相关联，以确定是否满足预定条件。处理器还基于个人的人口统计描述符进一步确定第二节目，并且当满足预定条件时，基于第二节目修改第一节目。

4.

发明授权
System and method for an iterative disambiguation interface 有权
标题翻译：迭代消歧界面的系统和方法

公开(公告)号：US09286386B2

公开(公告)日：2016-03-15

申请号：US14558780

申请日：2014-12-03

申请人： AT&T Intellectual Property I, L.P.

发明人： Michael Johnston

IPC分类号： G06F17/30

CPC分类号： G06F17/30843 , G06F17/30976

摘要： Disclosed herein are systems, methods, and computer-readable storage media for an iterative disambiguation interface. A system practicing the method receives a search query formatted according to a standard XML markup language for containing and annotating interpretations of user input, the search query being based on a natural language spoken query from a user and retrieves search results based on the search query. The system transmits the search results to a user device and iteratively receives multimodal input from the user to change search attributes and transmits updated search results to the user device based on the changed search attributes. The search results can include a link to additional information, such as a video presentation, related to the search results. The standard XML markup language can be Extensible MultiModal Annotation (EMMA) markup language from W3C. The system can generate an iteration transaction history for each multimodal input and updated search result.

摘要翻译： 这里公开了用于迭代消歧界面的系统，方法和计算机可读存储介质。实施该方法的系统接收根据用于包含和注释用户输入的标准XML标记语言格式化的搜索查询，所述搜索查询基于来自用户的自然语言查询，并且基于搜索查询来检索搜索结果。该系统将搜索结果发送到用户设备，并且迭代地接收来自用户的多模式输入以改变搜索属性，并且基于改变的搜索属性将更新的搜索结果发送到用户设备。搜索结果可以包括与搜索结果相关的附加信息的链接，例如视频呈现。标准XML标记语言可以是来自W3C的可扩展多模态注释（EMMA）标记语言。系统可以为每个多模态输入和更新的搜索结果生成迭代事务历史记录。

5.

发明授权
System and method for improving speech recognition accuracy using textual context 有权

公开(公告)号：US09355638B2

公开(公告)日：2016-05-31

申请号：US14737708

申请日：2015-06-12

申请人： AT&T Intellectual Property I, L.P.

发明人： Dan Melamed , Srinivas Bangalore , Michael Johnston

IPC分类号： G10L15/06 , G10L15/18 , G10L15/19 , G10L17/04 , G10L15/183

CPC分类号： G10L25/51 , G06F3/162 , G10L15/05 , G10L15/07 , G10L15/18 , G10L15/183 , G10L15/19 , G10L15/30 , G10L17/04 , G10L2015/228

摘要： Disclosed herein are systems, methods, and computer-readable storage media for improving speech recognition accuracy using textual context. The method includes retrieving a recorded utterance, capturing text from a device display associated with the spoken dialog and viewed by one party to the recorded utterance, and identifying words in the captured text that are relevant to the recorded utterance. The method further includes adding the identified words to a dynamic language model, and recognizing the recorded utterance using the dynamic language model. The recorded utterance can be a spoken dialog. A time stamp can be assigned to each identified word. The method can include adding identified words to and/or removing identified words from the dynamic language model based on their respective time stamps. A screen scraper can capture text from the device display associated with the recorded utterance. The device display can contain customer service data.

6.

发明授权
System and method for improving speech recognition accuracy using textual context 有权
标题翻译：使用文本语境提高语音识别精度的系统和方法

公开(公告)号：US09058808B2

公开(公告)日：2015-06-16

申请号：US14061855

申请日：2013-10-24

申请人： AT&T Intellectual Property I, L.P.

发明人： Dan Melamed , Srinivas Bangalore , Michael Johnston

IPC分类号： G10L15/04 , G10L17/04 , G10L15/19 , G10L15/183

CPC分类号： G10L25/51 , G06F3/162 , G10L15/05 , G10L15/07 , G10L15/18 , G10L15/183 , G10L15/19 , G10L15/30 , G10L17/04 , G10L2015/228

摘要： Disclosed herein are systems, methods, and computer-readable storage media for improving speech recognition accuracy using textual context. The method includes retrieving a recorded utterance, capturing text from a device display associated with the spoken dialog and viewed by one party to the recorded utterance, and identifying words in the captured text that are relevant to the recorded utterance. The method further includes adding the identified words to a dynamic language model, and recognizing the recorded utterance using the dynamic language model. The recorded utterance can be a spoken dialog. A time stamp can be assigned to each identified word. The method can include adding identified words to and/or removing identified words from the dynamic language model based on their respective time stamps. A screen scraper can capture text from the device display associated with the recorded utterance. The device display can contain customer service data.

摘要翻译： 本文公开了用于使用文本上下文改善语音识别精度的系统，方法和计算机可读存储介质。所述方法包括检索记录的话语，从与所述口语对话相关联的设备显示中捕获文本并由一方观看所述记录的话语，以及识别与记录的话语相关的所捕获的文本中的单词。该方法还包括将所识别的词添加到动态语言模型中，并使用动态语言模型来识别记录的话语。记录的话语可以是一个口语对话。时间戳可以分配给每个识别的单词。该方法可以包括基于它们各自的时间戳将识别的词添加到动态语言模型中和/或从动态语言模型中移除所识别的单词。屏幕刮刀可以从与记录的话语相关联的设备显示中捕获文本。设备显示可以包含客户服务数据。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类