Patent search ap:"Andrej LJOLJE" Page 4

31.

发明授权
System and method for pronunciation modeling 有权
Title translation: 发音建模的系统和方法

公开(公告)号：US08862470B2

公开(公告)日：2014-10-14

申请号：US13302380

申请日：2011-11-22

Applicant: Andrej Ljolje , Alistair D. Conkie , Ann K. Syrdal

Inventor： Andrej Ljolje , Alistair D. Conkie , Ann K. Syrdal

IPC: G10L15/187 , G10L15/183

CPC classification number: G10L15/187 , G10L15/183 , G10L2015/025

Abstract: Systems, computer-implemented methods, and tangible computer-readable media for generating a pronunciation model. The method includes identifying a generic model of speech composed of phonemes, identifying a family of interchangeable phonemic alternatives for a phoneme in the generic model of speech, labeling the family of interchangeable phonemic alternatives as referring to the same phoneme, and generating a pronunciation model which substitutes each family for each respective phoneme. In one aspect, the generic model of speech is a vocal tract length normalized acoustic model. Interchangeable phonemic alternatives can represent a same phoneme for different dialectal classes. An interchangeable phonemic alternative can include a string of phonemes.

Abstract translation: 系统，计算机实现的方法和用于生成发音模型的有形计算机可读介质。该方法包括识别由音素组成的通用语音模型，在通用语音模型中识别音素的可互换音素替代品系列，将可互换音素替代品的家族标记为指相同的音素，以及生成发音模型，其中将每个家庭的每个音素替代。在一个方面，语音的通用模型是声道长度归一化声学模型。可互换的音素替代品可以代表不同方言课程的相同音素。可互换的音素替代品可以包括一串音素。

32.

发明授权
Correlated call analysis for identified patterns in call transcriptions 失效
Title translation: 呼叫转录中识别模式的相关呼叫分析

公开(公告)号：US08756065B2

公开(公告)日：2014-06-17

申请号：US12343981

申请日：2008-12-24

Applicant: I. Dan Melamed , Yeon-Jun Kim , Bernard S. Renger , Andrej Ljolje , David J. Smith

Inventor： I. Dan Melamed , Yeon-Jun Kim , Bernard S. Renger , Andrej Ljolje , David J. Smith

IPC: G10L21/00 , G10L25/00 , G10L15/00

CPC classification number: G06F17/2715 , G10L15/26 , G10L17/00

Abstract: A method of correlating received communication data with operational communication characteristics is provided. The method includes receiving audible input from a source in a communication over a communications network, recording the received audible input, and transcribing the recorded audible input into a transcript. The method further includes outputting the transcript, specifying features of the transcript to be analyzed, specifying and recording operational communication characteristics particular to the communication, analyzing the transcript for the specified features to identify patterns associated with the audible input, computing statistical correlations of the identified patterns with the operational communication characteristics, and outputting results of the computed statistical correlations on a user interface.

Abstract translation: 提供了一种使接收到的通信数据与操作通信特性相关的方法。该方法包括通过通信网络在通信中接收来自源的可听输入，记录所接收的可听输入，以及将记录的可听输入转录成抄本。该方法还包括输出抄本，指定要分析的抄本的特征，指定和记录特定于通信的操作通信特征，分析指定特征的抄本以识别与可听见输入相关联的模式，计算所识别的具有操作通信特性的模式，并且在用户界面上输出所计算的统计相关性的结果。

33.

发明授权
System and method for handling missing speech data 有权
Title translation: 用于处理丢失的语音数据的系统和方法

公开(公告)号：US08751229B2

公开(公告)日：2014-06-10

申请号：US12275920

申请日：2008-11-21

Applicant: Andrej Ljolje , Alistair D. Conkie

Inventor： Andrej Ljolje , Alistair D. Conkie

IPC: G10L15/00

CPC classification number: G10L15/1815 , G06F17/27 , G10L13/027 , G10L15/02 , G10L15/04 , G10L15/20 , G10L25/87 , G10L2015/025

Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for handling missing speech data. The computer-implemented method includes receiving speech with a missing segment, generating a plurality of hypotheses for the missing segment, identifying a best hypothesis for the missing segment, and recognizing the received speech by inserting the identified best hypothesis for the missing segment. In another method embodiment, the final step is replaced with synthesizing the received speech by inserting the identified best hypothesis for the missing segment. In one aspect, the method further includes identifying a duration for the missing segment and generating the plurality of hypotheses of the identified duration for the missing segment. The step of identifying the best hypothesis for the missing segment can be based on speech context, a pronouncing lexicon, and/or a language model. Each hypothesis can have an identical acoustic score.

Abstract translation: 本文公开了用于处理丢失的语音数据的系统，计算机实现的方法和有形的计算机可读介质。计算机实现的方法包括接收具有缺失段的语音，为缺失段生成多个假设，识别缺失段的最佳假设，以及通过为缺失段插入所识别的最佳假设来识别接收到的语音。在另一种方法实施例中，通过为缺失的段插入所识别的最佳假设，来代替最后的步骤来合成所接收的语音。在一个方面，所述方法还包括识别缺失段的持续时间并为缺失段生成所识别的持续时间的多个假设。识别缺失片段的最佳假设的步骤可以基于语音上下文，发音词典和/或语言模型。每个假设可以具有相同的声学得分。

34.

发明授权
Automatic disclosure detection 有权

公开(公告)号：US08412527B2

公开(公告)日：2013-04-02

申请号：US12490631

申请日：2009-06-24

Applicant: I. Dan Melamed , Yeon-Jun Kim , Andrej Ljolje , Bernard S. Renger , David J. Smith

Inventor： I. Dan Melamed , Yeon-Jun Kim , Andrej Ljolje , Bernard S. Renger , David J. Smith

IPC: G10L15/18

CPC classification number: G10L25/63 , G06F17/2881 , G06Q10/06395 , G10L15/04 , G10L15/18 , G10L15/1822 , G10L15/26 , G10L15/265

Abstract: A method of detecting pre-determined phrases to determine compliance quality is provided. The method includes determining whether at least one of an event or a precursor event has occurred based on a comparison between pre-determined phrases and a communication between a sender and a recipient in a communications network, and rating the recipient based on the presence of the pre-determined phrases associated with the event or the presence of the pre-determined phrases associated with the precursor event in the communication.

35.

发明申请
SYSTEM AND METHOD FOR OPTIMIZING SPEECH RECOGNITION AND NATURAL LANGUAGE PARAMETERS WITH USER FEEDBACK 有权
Title translation: 用户反馈优化语音识别和自然语言参数的系统和方法

公开(公告)号：US20120290298A1

公开(公告)日：2012-11-15

申请号：US13103665

申请日：2011-05-09

Applicant: Andrej LJOLJE , Diamantino Antonio Caseiro , Mazin Gilbert , Vincent Goffin , Taniya Mishra

Inventor： Andrej LJOLJE , Diamantino Antonio Caseiro , Mazin Gilbert , Vincent Goffin , Taniya Mishra

IPC: G10L15/26

CPC classification number: G10L15/197 , G10L15/063 , G10L15/22

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.

Abstract translation: 这里公开了用于将显着权重分配给ASR模型的单词的系统，方法和非暂时计算机可读存储介质。分配给ASR模型中的单词的显着性值基于以前的成绩单的人类感知判断。这些显着性值被用作权重以修改ASR模型，使得将口头文档转换成抄本的加权ASR模型的结果为用户提供更准确和有用的转录。

36.

发明申请
Systems and Methods of Providing Modified Media Content 有权
Title translation: 提供修改媒体内容的系统和方法

公开(公告)号：US20120227078A1

公开(公告)日：2012-09-06

申请号：US13471851

申请日：2012-05-15

Applicant: Andrej Ljolje

Inventor： Andrej Ljolje

IPC: H04N21/643 , H04N5/775

CPC classification number: H04N5/783 , G06F17/30787 , G06F17/30796 , G06F17/30843 , G10L19/09 , G10L21/04 , G11B27/005 , G11B27/034 , H04N5/44513 , H04N5/781 , H04N5/85 , H04N21/4305 , H04N21/8549

Abstract: A method includes receiving a command to provide media content configured to be sent to a display device for display at a particular scan rate. The media content includes audio data and video data. The method includes identifying high priority segments of the media content based on the audio data. The high priority segments are to be displayed by the display device at a presentation rate such that the high priority segments displayed at the presentation rate correspond to the media content displayed at the particular scan rate. The method also includes sending the high priority segments to the display device to provide video content and audio content of the requested media content for display.

Abstract translation: 一种方法包括接收命令以提供配置成发送到显示设备以便以特定扫描速率显示的媒体内容。媒体内容包括音频数据和视频数据。该方法包括基于音频数据识别媒体内容的高优先级段。显示设备将以显示速率显示高优先级片段，使得以呈现速率显示的高优先级片段对应于以特定扫描速率显示的媒体内容。该方法还包括将高优先级段发送到显示设备以提供所请求的媒体内容的视频内容和音频内容以供显示。

37.

发明授权
Systems and methods of providing modified media content 失效
Title translation: 提供修改的媒体内容的系统和方法

公开(公告)号：US08204359B2

公开(公告)日：2012-06-19

申请号：US11725979

申请日：2007-03-20

Applicant: Andrej Ljolje

Inventor： Andrej Ljolje

IPC: H04N9/80

CPC classification number: H04N5/783 , G06F17/30787 , G06F17/30796 , G06F17/30843 , G10L19/09 , G10L21/04 , G11B27/005 , G11B27/034 , H04N5/44513 , H04N5/781 , H04N5/85 , H04N21/4305 , H04N21/8549

Abstract: In an embodiment, a method of providing modified media content is disclosed and includes receiving media content that includes audio data and video data having a first number of video frames. The method also includes generating abstracted media content that includes portions of the video data and audio elements of the audio data, where the abstracted media content includes less than all of the video data and includes fewer video frames than the first number of video frames.

Abstract translation: 在一个实施例中，公开了提供修改的媒体内容的方法，并且包括接收包括具有第一数量视频帧的音频数据和视频数据的媒体内容。该方法还包括生成包括音频数据的视频数据和音频元素的部分的抽象媒体内容，其中抽象媒体内容包括少于所有视频数据，并且包括比第一数量的视频帧少的视频帧。

38.

发明申请
PREDICTING COMMUNICATION OUTCOME BASED ON A REGRESSION MODEL 审中-公开
Title translation: 基于回归模型预测通信成果

公开(公告)号：US20100332286A1

公开(公告)日：2010-12-30

申请号：US12490662

申请日：2009-06-24

Applicant: I. Dan MELAMED , Yeon-Jun KIM , Andrej LJOLJE , Bernard S. RENGER , David J. SMITH

Inventor： I. Dan MELAMED , Yeon-Jun KIM , Andrej LJOLJE , Bernard S. RENGER , David J. SMITH

IPC: G06Q10/00 , G06N7/02

CPC classification number: G06Q30/0203 , G06Q10/10 , G06Q30/0245

Abstract: Predicting a score related to a communication sent by a sender over a communications network to a first agent servicing the communication includes obtaining a regression result for an objective function by encoding features extracted from the communication. The encoded features are applied to a regression model for the objective function. The regression result is output to a network component in the communications network. The regression model is determined prior to or concurrently with receiving the communication from the sender.

Abstract translation: 预测与发送方通过通信网络发送的通信相关的评分与服务于通信的第一代理包括通过对从通信提取的特征进行编码来获得目标函数的回归结果。编码的特征被应用于目标函数的回归模型。回归结果输出到通信网络中的网络组件。在从发送者接收通信之前或同时确定回归模型。

39.

发明申请
SYSTEM AND METHOD FOR SPEECH PERSONALIZATION BY NEED 有权
Title translation: 需要个性化的系统和方法

公开(公告)号：US20100312556A1

公开(公告)日：2010-12-09

申请号：US12480864

申请日：2009-06-09

Applicant: Andrej LJOLJE , Alistair D. CONKIE , Ann K. SYRDAL

Inventor： Andrej LJOLJE , Alistair D. CONKIE , Ann K. SYRDAL

IPC: G10L15/06

CPC classification number: G10L15/07 , G10L15/10 , G10L15/265

Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for speaker recognition personalization. The method recognizes speech received from a speaker interacting with a speech interface using a set of allocated resources, the set of allocated resources including bandwidth, processor time, memory, and storage. The method records metrics associated with the recognized speech, and after recording the metrics, modifies at least one of the allocated resources in the set of allocated resources commensurate with the recorded metrics. The method recognizes additional speech from the speaker using the modified set of allocated resources. Metrics can include a speech recognition confidence score, processing speed, dialog behavior, requests for repeats, negative responses to confirmations, and task completions. The method can further store a speaker personalization profile having information for the modified set of allocated resources and recognize speech associated with the speaker based on the speaker personalization profile.

Abstract translation: 这里公开了用于说话人识别个性化的系统，计算机实现的方法和有形的计算机可读存储介质。该方法使用一组分配的资源来识别从与语音接口交互的扬声器接收的语音，所分配的资源的集合包括带宽，处理器时间，存储器和存储。该方法记录与识别的语音相关联的度量，并且在记录度量之后，修改与记录的度量相称的所分配资源集合中的所分配的资源中的至少一个。该方法使用经修改的分配资源集来识别来自扬声器的附加语音。指标可以包括语音识别置信度分数，处理速度，对话行为，重复请求，对确认的否定响应以及任务完成。该方法还可以存储具有用于所修改的分配资源集合的信息的扬声器个性化简档，并且基于说话者个性化简档识别与说话者相关联的语音。

40.

发明申请
SYSTEM AND METHOD FOR PRONUNCIATION MODELING 有权
Title translation: 发明建模系统与方法

公开(公告)号：US20100145707A1

公开(公告)日：2010-06-10

申请号：US12328407

申请日：2008-12-04

Applicant: Andrej LJOLJE , Alistair D. Conkie , Ann K. Syrdal

Inventor： Andrej LJOLJE , Alistair D. Conkie , Ann K. Syrdal

IPC: G10L13/06

CPC classification number: G10L15/187 , G10L15/183 , G10L2015/025

Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable media for generating a pronunciation model. The method includes identifying a generic model of speech composed of phonemes, identifying a family of interchangeable phonemic alternatives for a phoneme in the generic model of speech, labeling the family of interchangeable phonemic alternatives as referring to the same phoneme, and generating a pronunciation model which substitutes each family for each respective phoneme. In one aspect, the generic model of speech is a vocal tract length normalized acoustic model. Interchangeable phonemic alternatives can represent a same phoneme for different dialectal classes. An interchangeable phonemic alternative can include a string of phonemes.

Abstract translation: 本文公开了用于生成发音模型的系统，计算机实现的方法和有形的计算机可读介质。该方法包括识别由音素组成的通用语音模型，在通用语音模型中识别音素的可互换音素替代品系列，将可互换音素替代品的家族标记为指相同的音素，以及生成发音模型，其中将每个家庭的每个音素替代。在一个方面，语音的通用模型是声道长度归一化声学模型。可互换的音素替代品可以代表不同方言课程的相同音素。可互换的音素替代品可以包括一串音素。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification