专利检索 ap:"Trausti Kristjansson" 第 4 页

31.

发明申请
Auto-translation for multi user audio and video 有权

公开(公告)号：US20150154183A1

公开(公告)日：2015-06-04

申请号：US13316689

申请日：2011-12-12

申请人： Trausti Kristjansson , John Huang , Yu-Kuan Lin , Hung-ying Tyan , Jakob David Uszkoreit , Joshua James Estelle , Chung-yi Wang , Kirill Buryak , Yusuke Konishi

发明人： Trausti Kristjansson , John Huang , Yu-Kuan Lin , Hung-ying Tyan , Jakob David Uszkoreit , Joshua James Estelle , Chung-yi Wang , Kirill Buryak , Yusuke Konishi

IPC分类号： G06F17/28 , G10L15/26 , G10L13/00

CPC分类号： G06F17/289 , G10L13/00 , G10L15/26 , H04M3/56 , H04M3/568 , H04M2203/2061 , H04M2242/12 , H04N7/15 , H04N7/152

摘要： The disclosed subject matter provides a system, computer readable storage medium, and a method providing an audio and textual transcript of a communication. A conferencing services may receive audio or audio visual signals from a plurality of different devices that receive voice communications from participants in a communication, such as a chat or teleconference. The audio signals representing voice (speech) communications input into respective different devices by the participants. A translation services server may receive over a separate communication channel the audio signals for translation into a second language. As managed by the translation services server, the audio signals may be converted into textual data. The textual data may be translated into text of different languages based the language preferences of the end user devices in the teleconference. The translated text may be further translated into audio signals.

32.

发明授权
Multisensory speech detection 有权
标题翻译：多感觉语音检测

公开(公告)号：US08862474B2

公开(公告)日：2014-10-14

申请号：US13618720

申请日：2012-09-14

申请人： Dave Burke , Michael J. LeBeau , Konrad Gianno , Trausti Kristjansson , John Nicholas Jitkoff , Andrew W. Senior

发明人： Dave Burke , Michael J. LeBeau , Konrad Gianno , Trausti Kristjansson , John Nicholas Jitkoff , Andrew W. Senior

IPC分类号： G01L21/00 , G06F3/0346 , G10L25/78

CPC分类号： G10L25/78 , G06F3/0346 , G06F3/167 , G10L15/10 , G10L15/22 , G10L15/265 , G10L17/00 , G10L25/21 , H04M1/72569 , H04M2250/12 , H04M2250/74 , H04R1/08 , H04W4/026

摘要： A computer-implemented method of multisensory speech detection is disclosed. The method comprises determining an orientation of a mobile device and determining an operating mode of the mobile device based on the orientation of the mobile device. The method further includes identifying speech detection parameters that specify when speech detection begins or ends based on the determined operating mode and detecting speech from a user of the mobile device based on the speech detection parameters.

摘要翻译： 公开了一种计算机实现的多感觉语音检测方法。该方法包括基于移动设备的方向来确定移动设备的方位并确定移动设备的操作模式。该方法还包括识别基于所确定的操作模式来指定语音检测何时开始或结束的语音检测参数，以及基于语音检测参数来检测来自移动设备的用户的语音。

33.

发明授权
Word-level correction of speech input 有权

公开(公告)号：US08494852B2

公开(公告)日：2013-07-23

申请号：US12913407

申请日：2010-10-27

申请人： Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti Kristjansson

发明人： Michael J. LeBeau , William J. Byrne , John Nicholas Jitkoff , Brandon M. Ballinger , Trausti Kristjansson

IPC分类号： G10L15/26 , G10L17/00 , G10L21/06 , G10L11/00 , G10L21/00 , G10L15/04 , G10L15/00 , G06F17/27 , G06F17/21

CPC分类号： G10L15/22 , G06F3/0482 , G06F3/04842 , G06F3/04886 , G06F17/2241 , G06F17/24 , G06F17/273 , G06F17/277 , G10L15/01 , G10L15/26 , G10L15/265 , G10L15/30

摘要： The subject matter of this specification can be implemented in, among other things, a computer-implemented method for correcting words in transcribed text including receiving speech audio data from a microphone. The method further includes sending the speech audio data to a transcription system. The method further includes receiving a word lattice transcribed from the speech audio data by the transcription system. The method further includes presenting one or more transcribed words from the word lattice. The method further includes receiving a user selection of at least one of the presented transcribed words. The method further includes presenting one or more alternate words from the word lattice for the selected transcribed word. The method further includes receiving a user selection of at least one of the alternate words. The method further includes replacing the selected transcribed word in the presented transcribed words with the selected alternate word.

34.

发明授权
Acoustic model adaptation using geographic information 有权
标题翻译：使用地理信息的声学模型适应

公开(公告)号：US08468012B2

公开(公告)日：2013-06-18

申请号：US12787568

申请日：2010-05-26

申请人： Matthew I. Lloyd , Trausti Kristjansson

发明人： Matthew I. Lloyd , Trausti Kristjansson

IPC分类号： G06F17/20

CPC分类号： G10L15/22 , G10L15/065 , G10L15/30

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving an audio signal that corresponds to an utterance recorded by a mobile device, determining a geographic location associated with the mobile device, adapting one or more acoustic models for the geographic location, and performing speech recognition on the audio signal using the one or more acoustic models model that are adapted for the geographic location.

摘要翻译： 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于增强语音识别精度。在一个方面，一种方法包括接收对应于由移动设备记录的话语的音频信号，确定与移动设备相关联的地理位置，调整用于地理位置的一个或多个声学模型，以及对该音频执行语音识别使用适合于地理位置的一个或多个声学模型模型的信号。

35.

发明授权
Predictive pre-recording of audio for voice input 有权
标题翻译：用于语音输入的音频预测录像

公开(公告)号：US08428759B2

公开(公告)日：2013-04-23

申请号：US12732827

申请日：2010-03-26

申请人： Trausti Kristjansson , Matthew I. Lloyd

发明人： Trausti Kristjansson , Matthew I. Lloyd

IPC分类号： G06F17/00 , G10L21/00 , H04M3/42 , G01P15/00

CPC分类号： G10L15/22 , G06F1/1626 , G06F1/1694 , G06F3/167

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for providing predictive pre-recording of audio for voice input. In one aspect, a method includes establishing, as input data, state data that references a state of a mobile device and sensor data that is sensed by one or more sensors of the mobile device, applying a rule or a probabilistic model to the input data, inferring, based on applying the rule or the probabilistic model to the input data, that a user of the mobile device is likely to initiate voice input, and invoking one or more functionalities of the mobile device in response to inferring that the user is likely to initiate voice input.

摘要翻译： 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于提供用于语音输入的音频的预测预记录。在一个方面，一种方法包括建立参考移动设备的状态的状态数据和由移动设备的一个或多个传感器感测到的传感器数据的状态数据，将规则或概率模型应用于输入数据，基于将规则或概率模型应用于输入数据，推断出移动设备的用户可能发起语音输入，并且响应于推断用户可能会调用移动设备的一个或多个功能启动语音输入。

36.

发明申请
Multisensory Speech Detection 有权

公开(公告)号：US20130013315A1

公开(公告)日：2013-01-10

申请号：US13618720

申请日：2012-09-14

申请人： Dave Burke , Michael J. LeBeau , Konrad Gianno , Trausti Kristjansson , John Nicholas Jitkoff , Andrew W. Senior

发明人： Dave Burke , Michael J. LeBeau , Konrad Gianno , Trausti Kristjansson , John Nicholas Jitkoff , Andrew W. Senior

IPC分类号： G10L21/00

CPC分类号： G10L25/78 , G06F3/0346 , G06F3/167 , G10L15/10 , G10L15/22 , G10L15/265 , G10L17/00 , G10L25/21 , H04M1/72569 , H04M2250/12 , H04M2250/74 , H04R1/08 , H04W4/026

摘要： A computer-implemented method of multisensory speech detection is disclosed. The method comprises determining an orientation of a mobile device and determining an operating mode of the mobile device based on the orientation of the mobile device. The method further includes identifying speech detection parameters that specify when speech detection begins or ends based on the determined operating mode and detecting speech from a user of the mobile device based on the speech detection parameters.

37.

发明申请
PREDICTIVE PRE-RECORDING OF AUDIO FOR VOICE INPUT 有权
标题翻译：用于语音输入的音频预测预录

公开(公告)号：US20120296655A1

公开(公告)日：2012-11-22

申请号：US13563504

申请日：2012-07-31

申请人： Trausti Kristjansson , Matthew I. Lloyd

发明人： Trausti Kristjansson , Matthew I. Lloyd

IPC分类号： G10L21/00

CPC分类号： G10L15/22 , G06F1/1626 , G06F1/1694 , G06F3/167

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for providing predictive pre-recording of audio for voice input. In one aspect, a method includes obtaining sensor data from one or more sensors of a mobile device while the mobile device is operating in an inactive state, determining that a user of the mobile device is interacting with the mobile device based on the sensor data, invoking voice input functionality of the mobile device in response to determining that the user of the mobile device is interacting with the mobile device, detecting a voice input, and activating the mobile device in response to detecting the voice input.

摘要翻译： 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于提供用于语音输入的音频的预测预记录。一方面，一种方法包括：当移动设备在非活动状态下工作时，从移动设备的一个或多个传感器获取传感器数据，基于传感器数据确定移动设备的用户正在与移动设备交互，响应于确定移动设备的用户正在与移动设备进行交互，检测语音输入以及响应于检测到语音输入而激活移动设备，来调用移动设备的语音输入功能。

38.

发明授权
Geotagged environmental audio for enhanced speech recognition accuracy 有权
标题翻译：地理标记环境音频，用于增强语音识别精度

公开(公告)号：US08265928B2

公开(公告)日：2012-09-11

申请号：US12760147

申请日：2010-04-14

申请人： Trausti Kristjansson , Matthew I. Lloyd

发明人： Trausti Kristjansson , Matthew I. Lloyd

IPC分类号： G10L21/02 , G10L15/00

CPC分类号： G10L21/0208 , G10L15/20

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving geotagged audio signals that correspond to environmental audio recorded by multiple mobile devices in multiple geographic locations, receiving an audio signal that corresponds to an utterance recorded by a particular mobile device, determining a particular geographic location associated with the particular mobile device, generating a noise model for the particular geographic location using a subset of the geotagged audio signals, where noise compensation is performed on the audio signal that corresponds to the utterance using the noise model that has been generated for the particular geographic location.

摘要翻译： 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于增强语音识别精度。一方面，一种方法包括接收对应于多个地理位置中的多个移动设备记录的环境音频的地理标记音频信号，接收对应于由特定移动设备记录的话语的音频信号，确定与该特定移动设备相关联的特定地理位置特定的移动设备，使用所述地理标记的音频信号的子集来生成针对所述特定地理位置的噪声模型，其中使用对于所述特定地理位置生成的所述噪声模型对与所述话语相对应的所述音频信号执行噪声补偿。

39.

发明申请
Speech and Noise Models for Speech Recognition 有权
标题翻译：语音识别语音和噪声模型

公开(公告)号：US20110307253A1

公开(公告)日：2011-12-15

申请号：US12814665

申请日：2010-06-14

申请人： Matthew I. Lloyd , Trausti Kristjansson

发明人： Matthew I. Lloyd , Trausti Kristjansson

IPC分类号： G10L15/20 , G10L21/02

CPC分类号： G10L15/20 , G10L21/0208

摘要： An audio signal generated by a device based on audio input from a user may be received. The audio signal may include at least a user audio portion that corresponds to one or more user utterances recorded by the device. A user speech model associated with the user may be accessed and a determination may be made background audio in the audio signal is below a defined threshold. In response to determining that the background audio in the audio signal is below the defined threshold, the accessed user speech model may be adapted based on the audio signal to generate an adapted user speech model that models speech characteristics of the user. Noise compensation may be performed on the received audio signal using the adapted user speech model to generate a filtered audio signal with reduced background audio compared to the received audio signal.

摘要翻译： 可以接收由基于来自用户的音频输入的设备生成的音频信号。音频信号可以包括至少一个对应于由该设备记录的一个或多个用户话语的用户音频部分。可以访问与用户相关联的用户语音模型，并且可以确定音频信号中的背景音频低于定义的阈值。响应于确定音频信号中的背景音频低于定义的阈值，可以基于音频信号来调整所访问的用户语音模型，以生成对用户的语音特征进行建模的适配的用户语音模型。可以使用适配的用户语音模型对所接收的音频信号执行噪声补偿，以生成与接收的音频信号相比具有降低的背景音频的滤波音频信号。

40.

发明申请
GEOTAGGED ENVIRONMENTAL AUDIO FOR ENHANCED SPEECH RECOGNITION ACCURACY 有权
标题翻译： GEOTAGGED环境音频用于增强语音识别精度

公开(公告)号：US20110257974A1

公开(公告)日：2011-10-20

申请号：US12760147

申请日：2010-04-14

申请人： Trausti Kristjansson , Matthew I. Lloyd

发明人： Trausti Kristjansson , Matthew I. Lloyd

IPC分类号： G10L15/00 , H04W64/00

CPC分类号： G10L21/0208 , G10L15/20

摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for enhancing speech recognition accuracy. In one aspect, a method includes receiving geotagged audio signals that correspond to environmental audio recorded by multiple mobile devices in multiple geographic locations, receiving an audio signal that corresponds to an utterance recorded by a particular mobile device, determining a particular geographic location associated with the particular mobile device, generating a noise model for the particular geographic location using a subset of the geotagged audio signals, where noise compensation is performed on the audio signal that corresponds to the utterance using the noise model that has been generated for the particular geographic location.

摘要翻译： 方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于增强语音识别精度。一方面，一种方法包括接收对应于多个地理位置中的多个移动设备记录的环境音频的地理标记音频信号，接收对应于由特定移动设备记录的话语的音频信号，确定与该特定移动设备相关联的特定地理位置特定的移动设备，使用所述地理标记的音频信号的子集来生成针对所述特定地理位置的噪声模型，其中使用对于所述特定地理位置生成的所述噪声模型对与所述话语相对应的所述音频信号执行噪声补偿。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类