Patent search ap:("AT&T Intellectual Property I Page L.P.") AND inv:"Eric ZAVESKY"

11.

发明申请
AUGMENTED MULTI-TIER CLASSIFIER FOR MULTI-MODAL VOICE ACTIVITY DETECTION 审中-公开

公开(公告)号：US20180182415A1

公开(公告)日：2018-06-28

申请号：US15894245

申请日：2018-02-12

Applicant: AT&T Intellectual Property I, L.P.

Inventor： Dimitrios DIMITRIADIS , Eric ZAVESKY , Matthew BURLICK

IPC: G10L25/78 , G10L25/84 , G06K9/00 , G10L15/24

CPC classification number: G10L25/78 , G06K9/00335 , G10L15/24 , G10L25/84

Abstract: Disclosed herein are systems, methods, and computer-readable storage media for detecting voice activity in a media signal in an augmented, multi-tier classifier architecture. A system configured to practice the method can receive, from a first classifier, a first voice activity indicator detected in a first modality for a human subject. Then, the system can receive, from a second classifier, a second voice activity indicator detected in a second modality for the human subject, wherein the first voice activity indicator and the second voice activity indicators are based on the human subject at a same time, and wherein the first modality and the second modality are different. The system can concatenate, via a third classifier, the first voice activity indicator and the second voice activity indicator with original features of the human subject, to yield a classifier output, and determine voice activity based on the classifier output.

12.

发明申请
SPEAKER ASSOCIATION WITH A VISUAL REPRESENTATION OF SPOKEN CONTENT 有权

公开(公告)号：US20170162214A1

公开(公告)日：2017-06-08

申请号：US15439125

申请日：2017-02-22

Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventor： David C. GIBBON , Andrea BASSO , Lee BEGEJA , Sumit KUMAR , Zhu LIU , Bernard S. RENGER , Behzad SHAHRARAY , Eric ZAVESKY

IPC: G10L21/10 , G06F17/30 , H04L29/06 , H04L29/08 , G06F3/16 , G10L15/26

CPC classification number: G10L21/10 , G06F3/167 , G06F17/30743 , G06F17/30752 , G06F17/30778 , G06F17/30867 , G10L15/07 , G10L15/26 , G10L15/265 , G10L17/00 , G10L17/005 , G10L2015/0631 , G11B27/28 , G11B27/34 , H04L65/403 , H04L67/306 , H04M2201/40 , H04N5/76

Abstract: Speaker content generated in an audio conference is selectively visually represented. A profile for each audience member who participates in the audio conference is obtained. Speaker content spoken during the audio conference is monitored. Words of the speaker content are classified to have different weights according to a parameter of the profile for each of the audience members. A relation between the speaker content to the profile for each of the audience members is determined. Different visual representations of the speaker content are presented to different ones of the audience members based on the determined relation.

13.

发明申请
SPEAKER ASSOCIATION WITH A VISUAL REPRESENTATION OF SPOKEN CONTENT 有权
Title translation: 与SPOKEN内容的视觉表示的SPEAKER协会

公开(公告)号：US20150235654A1

公开(公告)日：2015-08-20

申请号：US14703413

申请日：2015-05-04

Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventor： David C. GIBBON , Andrea BASSO , Lee BEGEJA , Sumit KUMAR , Zhu LIU , Bernard S. RENGER , Behzad SHAHRARAY , Eric ZAVESKY

IPC: G10L21/10 , G10L15/26

CPC classification number: G10L21/10 , G06F3/167 , G06F17/30743 , G06F17/30752 , G06F17/30778 , G06F17/30867 , G10L15/07 , G10L15/26 , G10L15/265 , G10L17/00 , G10L17/005 , G10L2015/0631 , G11B27/28 , G11B27/34 , H04L65/403 , H04L67/306 , H04M2201/40 , H04N5/76

Abstract: Speaker content generated in an audio conference is selectively visually represented. A profile for each audience member who listen to an audio conference is obtained. Speaker content from audio conference participants who speak in the audio conference is monitored. The speaker content from each of the audio conference participants is analyzed. Based on the analyzing and on the profiles for each of the plurality of audience members, visual representations of the speaker content to present to the audience members are identified. Visual representations of the speaker content are generated based on the analyzing. Different visual representations of the speaker content are presented to different audience members based on the analyzing and identifying.

Abstract translation: 在音频会议中生成的扬声器内容是有选择地视觉表示的。获取听取音频会议的每位听众成员的个人资料。来自音频会议的演讲者的演讲者内容将受到监控。分析每个音频会议参与者的演讲者内容。基于对多个观众成员中的每一个的分析和简档，识别向观众呈现的说话者内容的视觉表示。基于分析生成说话者内容的视觉表示。基于分析和识别，向不同的观众提供演讲者内容的不同视觉表示。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification