Patent search ap:("AT&T INTELLECTUAL PROPERTY I Page L.P.") AND inv:"Yeon-Jun Kim"

1.

发明授权
Automated detection and filtering of audio advertisements 有权

公开(公告)号：US09703865B2

公开(公告)日：2017-07-11

申请号：US14865979

申请日：2015-09-25

Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventor： Yeon-Jun Kim , I. Dan Melamed , Bernard S. Renger , Steven Neil Tischer

IPC: G10L25/48 , G10L25/78 , G10L25/87 , G06F17/00 , G06F17/30 , G10L15/08 , G10L25/90

CPC classification number: G06F17/30761 , G06F17/00 , G10L15/083 , G10L25/48 , G10L25/78 , G10L25/87 , G10L25/90

Abstract: Apparatuses, systems, methods, and media for filtering a data stream are provided. The data stream is analyzed based on an acoustic parameter to determine extraneous portions in which a first predetermined condition is satisfied. When a first extraneous portion is separated from a second extraneous portion by a non-extraneous portion in which the first predetermined condition is not satisfied, it is determined whether the first extraneous portion being separated from the second extraneous portion by the non-extraneous portion satisfies a second predetermined condition. At least one of the first extraneous portion and the second extraneous portion is deleted from the data stream to produce a filtered data stream in response to determining the second predetermined condition is satisfied.

2.

发明授权
Automatic disclosure detection 有权

公开(公告)号：US09607279B2

公开(公告)日：2017-03-28

申请号：US14686929

申请日：2015-04-15

Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventor： I. Dan Melamed , Andrej Ljolje , Bernard Renger , Yeon-Jun Kim , David J. Smith

IPC: G10L15/18 , G06Q10/06 , G10L15/04 , G06F17/28 , G10L15/26

CPC classification number: G10L25/63 , G06F17/2881 , G06Q10/06395 , G10L15/04 , G10L15/18 , G10L15/1822 , G10L15/26 , G10L15/265

Abstract: A method of detecting pre-determined phrases to determine compliance quality includes determining whether a precursor event has occurred based on a comparison between stored pre-determined phrases and a received communication, and determining a compliance rating based on a presence of a pre-determined phrase associated with the precursor event in the communication.

3.

发明授权
Automatic disclosure detection 有权
Title translation: 自动披露检测

公开(公告)号：US09037465B2

公开(公告)日：2015-05-19

申请号：US13772509

申请日：2013-02-21

Applicant: AT&T Intellectual Property I, L.P.

Inventor： I. Dan Melamed , Andrej Ljolje , Bernard Renger , Yeon-Jun Kim , David J. Smith

IPC: G10L15/18 , G06F17/28 , G10L15/04 , G10L15/26

CPC classification number: G10L25/63 , G06F17/2881 , G06Q10/06395 , G10L15/04 , G10L15/18 , G10L15/1822 , G10L15/26 , G10L15/265

Abstract: A method of detecting pre-determined phrases to determine compliance quality is provided. The method includes determining whether at least one of an event or a precursor event has occurred based on a comparison between pre-determined phrases and a communication between a sender and a recipient in a communications network, and rating the recipient based on the presence of the pre-determined phrases associated with the event or the presence of the pre-determined phrases associated with the precursor event in the communication.

Abstract translation: 提供了检测预定短语以确定顺应性质量的方法。该方法包括基于预定短语与通信网络中的发送者和接收者之间的通信之间的比较来确定事件或前兆事件中的至少一个是否已经发生，并且基于存在与事件相关联的预定短语或与通信中的前体事件相关联的预定短语的存在。

4.

发明授权
Automated detection and filtering of audio advertisements 有权

公开(公告)号：US10146868B2

公开(公告)日：2018-12-04

申请号：US15617256

申请日：2017-06-08

Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventor： Yeon-Jun Kim , I. Dan Melamed , Bernard S. Renger , Steven Neil Tischer

IPC: G10L25/48 , G10L25/78 , G10L25/87 , G06F17/00 , G06F17/30 , G10L15/08 , G10L25/90

Abstract: Apparatuses, systems, methods, and media for filtering a data stream are provided. The data stream is partitioned into a plurality of data stream segments. An acoustic parameter is measured in each of the data stream segments. It is determined whether the acoustic parameter satisfies a first predetermined condition. The first predetermined condition includes a number of variances, in which the acoustic parameter exceeds a predetermined variance threshold, exceeding a predetermined number threshold. An extraneous portion of the data stream is identified in which the first predetermined condition is satisfied. It is determined whether the extraneous portion satisfies a second predetermined condition in the data stream. The extraneous portion is deleted from the data stream to produce a filtered data stream in response to the second predetermined condition being satisfied.

5.

发明授权
Automatic disclosure detection 有权

公开(公告)号：US09934792B2

公开(公告)日：2018-04-03

申请号：US15420477

申请日：2017-01-31

Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventor： I. Dan Melamed , Andrej Ljolje , Bernard S. Renger , David J. Smith , Yeon-Jun Kim

IPC: G10L15/18 , G10L25/63 , G10L15/26

CPC classification number: G10L25/63 , G06F17/2881 , G06Q10/06395 , G10L15/04 , G10L15/18 , G10L15/1822 , G10L15/26 , G10L15/265

Abstract: A method of detecting pre-determined phrases to determine compliance quality of an agent includes determining a presence of a predetermined input based on a comparison between stored pre-determined phrases and a received communication, and determining a compliance rating of the agent based on a presence of a pre-determined phrase associated with the predetermined input in the communication.

6.

发明授权
System and method for automatic detection of abnormal stress patterns in unit selection synthesis 有权

公开(公告)号：US09269348B2

公开(公告)日：2016-02-23

申请号：US14628790

申请日：2015-02-23

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Yeon-Jun Kim , Mark Charles Beutnagel , Alistair D. Conkie , Ann K. Syrdal

IPC: G10L13/00 , G10L13/08 , G10L13/10 , G10L15/18 , G10L25/00

CPC classification number: G10L13/033 , G10L13/027 , G10L13/043 , G10L13/10 , G10L15/1807 , G10L25/00

Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for detecting and correcting abnormal stress patterns in unit-selection speech synthesis. A system practicing the method detects incorrect stress patterns in selected acoustic units representing speech to be synthesized, and corrects the incorrect stress patterns in the selected acoustic units to yield corrected stress patterns. The system can further synthesize speech based on the corrected stress patterns. In one aspect, the system also classifies the incorrect stress patterns using a machine learning algorithm such as a classification and regression tree, adaptive boosting, support vector machine, and maximum entropy. In this way a text-to-speech unit selection speech synthesizer can produce more natural sounding speech with suitable stress patterns regardless of the stress of units in a unit selection database.

7.

发明授权
Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment 有权

公开(公告)号：US10002612B2

公开(公告)日：2018-06-19

申请号：US15350339

申请日：2016-11-14

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Yeon-Jun Kim , David C. Gibbon , Horst J. Schroeter

IPC: G10L15/00 , G10L15/26 , G11B27/10 , G10L21/06 , G10L13/08 , G10L21/055 , H04N21/44 , H04N21/488 , G10L25/51

CPC classification number: G10L15/265 , G10L13/08 , G10L15/26 , G10L21/055 , G10L21/06 , G10L25/51 , G11B27/10 , H04M3/42391 , H04M2201/14 , H04M2201/22 , H04M2201/40 , H04M2203/305 , H04N21/44004 , H04N21/4884

Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.

8.

发明授权
System and method for generalized preselection for unit selection synthesis 有权
Title translation: 用于单位选择合成的广义预选系统和方法

公开(公告)号：US09564121B2

公开(公告)日：2017-02-07

申请号：US14454123

申请日：2014-08-07

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Alistair D. Conkie , Mark Beutnagel , Yeon-Jun Kim , Ann K. Syrdal

IPC: G10L13/06 , G10L13/047 , G10L13/00

CPC classification number: G10L13/06 , G10L13/00 , G10L13/047

Abstract: Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for unit selection synthesis. The method causes a computing device to add a supplemental phoneset to a speech synthesizer front end having an existing phoneset, modify a unit preselection process based on the supplemental phoneset, preselect units from the supplemental phoneset and the existing phoneset based on the modified unit preselection process, and generate speech based on the preselected units. The supplemental phoneset can be a variation of the existing phoneset, can include a word boundary feature, can include a cluster feature where initial consonant clusters and some word boundaries are marked with diacritics, can include a function word feature which marks units as originating from a function word or a content word, and/or can include a pre-vocalic or post-vocalic feature. The speech synthesizer front end can incorporates the supplemental phoneset as an extra feature.

Abstract translation: 本文公开了用于单元选择合成的系统，计算机实现的方法和计算机可读存储介质。该方法使得计算设备将辅助电话机添加到具有现有电话机的语音合成器前端，基于补充电话机修改单元预选过程，基于修改的单位预选过程从辅助电话机和现有电话机中预选单元，并根据预选单位产生语音。补充手机可以是现有手机的变体，可以包括字边界特征，可以包括其中初始辅音簇和一些字边界用变音符标记的群集特征，可以包括将单位标记为源自于功能词或内容词，和/或可以包括语音前或后声部特征。语音合成器前端可以将补充的电话机作为额外的功能。

9.

发明授权
Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment 有权
Title translation: 系统，计算机实现的方法和用于转录对准的有形计算机可读存储介质

公开(公告)号：US09495964B2

公开(公告)日：2016-11-15

申请号：US15071644

申请日：2016-03-16

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Yeon-Jun Kim , David C. Gibbon , Horst J. Schroeter

IPC: G10L15/00 , G10L15/26 , H04N21/488 , G10L21/055 , G10L13/08 , H04N21/44 , G11B27/10

CPC classification number: G10L15/265 , G10L13/08 , G10L15/26 , G10L21/055 , G10L21/06 , G10L25/51 , G11B27/10 , H04N21/44004 , H04N21/4884

Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.

Abstract translation: 本文公开了系统，计算机实现的方法和用于标题媒体呈现的有形的计算机可读存储介质。该方法包括从媒体呈现和媒体呈现的转录接收自动语音识别（ASR）输出。该方法包括：通过处理器选择基于ASR输出和转录的媒体呈现中的一对锚定词，并通过将转录与所选择的一对锚点之间的ASR输出对齐来产生标题。转录可以是人类产生的。选择锚点对可以基于ASR输出和转录之间的相似性阈值。在一个变体中，停止列表上常用的单词不符合锚点词。该方法包括用生成的标题输出媒体呈现。演示文稿可以是现场直播的录音。

10.

发明授权
Systems, computer-implemented methods, and tangible computer-readable storage media for transcription alignment 有权

公开(公告)号：US09305552B2

公开(公告)日：2016-04-05

申请号：US14492616

申请日：2014-09-22

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Yeon-Jun Kim , David C. Gibbon , Horst J. Schroeter

IPC: G10L15/00 , G10L15/26 , G11B27/10 , G10L21/06

CPC classification number: G10L15/265 , G10L13/08 , G10L15/26 , G10L21/055 , G10L21/06 , G10L25/51 , G11B27/10 , H04N21/44004 , H04N21/4884

Abstract: Disclosed herein are systems, computer-implemented methods, and tangible computer-readable storage media for captioning a media presentation. The method includes receiving automatic speech recognition (ASR) output from a media presentation and a transcription of the media presentation. The method includes selecting via a processor a pair of anchor words in the media presentation based on the ASR output and transcription and generating captions by aligning the transcription with the ASR output between the selected pair of anchor words. The transcription can be human-generated. Selecting pairs of anchor words can be based on a similarity threshold between the ASR output and the transcription. In one variation, commonly used words on a stop list are ineligible as anchor words. The method includes outputting the media presentation with the generated captions. The presentation can be a recording of a live event.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification