Patent search ap:("AT&T Intellectual Property II Page L.P.") AND inv:"Mazin GILBERT"

1.

发明申请
METHOD AND APPARATUS FOR IDENTIFYING ACOUSTIC BACKGROUND ENIRONMENTS BASED ON TIME AND SPEECH TO ENHANCE AUTOMATIC SPEECH RECOGNITION 有权
Title translation: 基于时间和语音识别声音背景的方法和装置，以增强自动语音识别

公开(公告)号：US20160275948A1

公开(公告)日：2016-09-22

申请号：US15171177

申请日：2016-06-02

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Mazin GILBERT

IPC: G10L15/20 , G10L15/07 , G10L15/26 , G10L21/0216

CPC classification number: G10L15/20 , G10L15/065 , G10L15/07 , G10L15/08 , G10L15/26 , G10L15/30 , G10L21/0216 , G11B27/034

Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

Abstract translation: 公开了用于识别呼叫者的声学环境的系统，方法和计算机可读介质。方法实施例包括分析来自呼叫者的接收音频信号的声学特征，接收元数据信息，基于分析的声学特征和元数据对呼叫者的背景环境进行分类，选择与分类背景相匹配的声学模型环境，并且使用所选择的声学模型来执行语音识别作为所接收的音频信号。

2.

发明申请
System and Method for Controlling Presentations Using a Multimodal Interface 审中-公开
Title translation: 使用多模态界面控制演示的系统和方法

公开(公告)号：US20150178044A1

公开(公告)日：2015-06-25

申请号：US14641762

申请日：2015-03-09

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Patrick EHLEN , David Crawford GIBBON , Mazin GILBERT , Michael JOHNSTON , Zhu LIU , Behzad SHAHRARAY

IPC: G06F3/16 , G06F3/0482 , G06F3/0484

Abstract: The invention provides for a system, method, and computer readable medium storing instructions related to controlling a presentation in a multimodal system. The method embodiment of the invention is a method for the retrieval of information on the basis of its content for real-time incorporation into an electronic presentation. The method comprises receiving from a presenter a content-based request for at least one segment of a first plurality of segments within a media presentation and while displaying the media presentation to an audience, displaying to the presenter a second plurality of segments in response to the content-based request. The computing device practicing the method receives a selection from the presenter of a segment from the second plurality of segments and displays to the audience the selected segment.

Abstract translation: 本发明提供了一种系统，方法和计算机可读介质，其存储与在多模式系统中控制演示有关的指令。本发明的方法实施例是一种用于基于其内容来检索信息以便实时并入到电子呈现中的方法。所述方法包括从演示者接收基于内容的对媒体呈现内的第一多个片段的至少一个片段的请求，并且在向观众显示媒体呈现的同时，向演示者显示响应于第基于内容的请求。实施该方法的计算设备从第二多个段接收来自段的演示者的选择，并向观众显示所选择的段。

3.

发明申请
SYSTEM AND METHOD FOR USING SPEECH FOR DATA SEARCHING DURING PRESENTATIONS 审中-公开
Title translation: 在演讲中使用语音进行数据搜索的系统和方法

公开(公告)号：US20170075656A1

公开(公告)日：2017-03-16

申请号：US15341052

申请日：2016-11-02

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Patrick EHLEN , David Crawford GIBBON , Mazin GILBERT , Michael JOHNSTON , Zhu LIU , Behzad SHAHRARAY

IPC: G06F3/16 , G06F3/0482 , G06F17/28 , G06F17/30 , G06F3/0484

CPC classification number: G06F3/167 , G06F3/038 , G06F3/0481 , G06F3/0482 , G06F3/04842 , G06F3/04883 , G06F16/248 , G06F16/435 , G06F16/951 , G06F16/9535 , G06F17/28 , G06F2203/0381 , G10L15/18

Abstract: There is provided for a system, method, and computer readable medium storing instructions related to controlling a presentation in a multimodal system. A method for the retrieval of information on the basis of its content for real-time incorporation into an electronic presentation is discussed. One method includes controlling a media presentation using a multimodal interface. The method involves receiving from a presenter a content-based request associated with a plurality of segments within a media presentation preprocessed for context-based searching; displaying the media presentation and displaying to the presenter results in response to the content-based request; receiving a selection from the presenter of at least one result; and displaying the selected result to an audience.

Abstract translation: 提供了一种用于存储与在多模式系统中控制呈现相关的指令的系统，方法和计算机可读介质。讨论了一种基于其内容来检索信息以便实时并入到电子演示中的方法。一种方法包括使用多模式界面来控制媒体呈现。该方法包括从演示者接收与预处理用于基于上下文的搜索的媒体表现中的与多个片段相关联的基于内容的请求; 响应于基于内容的请求，向演示者显示媒体呈现和显示结果; 从演示者接收至少一个结果的选择; 并将所选结果显示给观众。

4.

发明申请
SYSTEM AND METHOD OF EXTRACTING CLAUSES FOR SPOKEN LANGUAGE UNDERSTANDING 有权

公开(公告)号：US20160026618A1

公开(公告)日：2016-01-28

申请号：US14877272

申请日：2015-10-07

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Srinivas BANGALORE , Narendra K. GUPTA , Mazin GILBERT

IPC: G06F17/27 , G10L15/26 , G06F17/21 , G10L15/05

CPC classification number: G06F17/2705 , G06F17/218 , G06F17/27 , G06F17/2775 , G10L15/05 , G10L15/26 , G10L2015/025 , G10L2015/081 , G10L2015/088

Abstract: A clausifier and method of extracting clauses for spoken language understanding are disclosed. The method relates to generating a set of clauses from speech utterance text and comprises inserting at least one boundary tag in speech utterance text related to sentence boundaries, inserting at least one edit tag indicating a portion of the speech utterance text to remove, and inserting at least one conjunction tag within the speech utterance text. The result is a set of clauses that may be identified within the speech utterance text according to the inserted at least one boundary tag, at least one edit tag and at least one conjunction tag. The disclosed clausifier comprises a sentence boundary classifier, an edit detector classifier, and a conjunction detector classifier. The clausifier may comprise a single classifier or a plurality of classifiers to perform the steps of identifying sentence boundaries, editing text, and identifying conjunctions within the text.

5.

发明申请
System and Method for Using Speech for Data Searching During Presentations 有权
Title translation: 演示期间使用语音进行数据搜索的系统和方法

公开(公告)号：US20150379094A1

公开(公告)日：2015-12-31

申请号：US14833645

申请日：2015-08-24

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Patrick EHLEN , David Crawford GIBBON , Mazin GILBERT , Michael JOHNSTON , Zhu LIU , Behzad SHAHRARAY

IPC: G06F17/30 , G10L15/18 , G06F17/28 , G06F3/16 , G06F3/0484 , G06F3/0482

CPC classification number: G06F3/167 , G06F3/038 , G06F3/0481 , G06F3/0482 , G06F3/04842 , G06F3/04883 , G06F17/28 , G06F17/30029 , G06F17/30554 , G06F17/30864 , G06F17/30867 , G06F2203/0381 , G10L15/18

Abstract: There is provided for a system, method, and computer readable medium storing instructions related to controlling a presentation in a multimodal system. A method for the retrieval of information on the basis of its content for real-time incorporation into an electronic presentation is discussed. One method includes controlling a media presentation using a multimodal interface. The method involves receiving from a presenter a content-based request associated with a plurality of segments within a media presentation preprocessed for context-based searching; displaying the media presentation and displaying to the presenter results in response to the content-based request; receiving a selection from the presenter of at least one result; and displaying the selected result to an audience.

Abstract translation: 提供了一种用于存储与在多模式系统中控制呈现相关的指令的系统，方法和计算机可读介质。讨论了一种基于其内容来检索信息以便实时并入到电子演示中的方法。一种方法包括使用多模式界面来控制媒体呈现。该方法包括从演示者接收与预处理用于基于上下文的搜索的媒体表现中的与多个片段相关联的基于内容的请求; 响应于基于内容的请求，向演示者显示媒体呈现和显示结果; 从演示者接收至少一个结果的选择; 并将所选结果显示给观众。

6.

发明申请
METHOD AND SYSTEM FOR PROVIDING AN AUTOMATED WEB TRANSCRIPTION SERVICE 有权
Title translation: 提供自动WEB转录服务的方法和系统

公开(公告)号：US20140316780A1

公开(公告)日：2014-10-23

申请号：US14321932

申请日：2014-07-02

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Mazin GILBERT , Stephan KANTHAK

IPC: G10L15/26

CPC classification number: G10L15/26 , G06F17/30893 , G10L15/265

Abstract: A system, method and computer readable medium that provides an automated web transcription service is disclosed. The method may include receiving input speech from a user using a communications network, recognizing the received input speech, understanding the recognized speech, transcribing the understood speech to text, storing the transcribed text in a database, receiving a request via a web page to display the transcribed text, retrieving transcribed text from the database, and displaying the transcribed text to the requester using the web page.

Abstract translation: 公开了一种提供自动网页转录服务的系统，方法和计算机可读介质。该方法可以包括使用通信网络从用户接收输入语音，识别接收的输入语音，理解识别的语音，将理解的语音转录为文本，将转录的文本存储在数据库中，经由网页接收请求以显示转录的文本，从数据库检索转录的文本，以及使用网页将转录的文本显示给请求者。

7.

发明申请
LEARNING FROM INTERACTIONS FOR A SPOKEN DIALOG SYSTEM 审中-公开

公开(公告)号：US20170213546A1

公开(公告)日：2017-07-27

申请号：US15483213

申请日：2017-04-10

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Mazin GILBERT , Esther LEVIN , Michael Lederman LITTMAN , Robert E. Schapire

IPC: G10L15/065 , G10L15/18 , G10L15/06

CPC classification number: G10L15/065 , G06F17/2785 , G10L15/063 , G10L15/1815 , G10L15/26

Abstract: In one embodiment, a semantic classifier input and a corresponding label attributed to the semantic classifier input may be obtained. A determination may be made whether the corresponding label is correct based on logged interaction data. An entry of an adaptation corpus may be generated based on a result of the determination. Operation of the semantic classifier may be adapted based on the adaptation corpus.

8.

发明申请
Method and Apparatus for Identifying Acoustic Background Environments Based on Time and Speed to Enhance Automatic Speech Recognition 有权
Title translation: 基于时间和速度识别声学背景环境以增强自动语音识别的方法和装置

公开(公告)号：US20140303972A1

公开(公告)日：2014-10-09

申请号：US14312116

申请日：2014-06-23

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Mazin GILBERT

IPC: G10L15/20

CPC classification number: G10L15/20 , G10L15/065 , G10L15/07 , G10L15/08 , G10L15/26 , G10L15/30 , G10L21/0216 , G11B27/034

Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

Abstract translation: 公开了用于识别呼叫者的声学环境的系统，方法和计算机可读介质。方法实施例包括分析来自呼叫者的接收音频信号的声学特征，接收元数据信息，基于分析的声学特征和元数据对呼叫者的背景环境进行分类，选择与分类背景相匹配的声学模型环境，并且使用所选择的声学模型来执行语音识别作为所接收的音频信号。

9.

发明申请
On-Demand Language Translation for Television Programs 审中-公开

公开(公告)号：US20180046617A1

公开(公告)日：2018-02-15

申请号：US15797656

申请日：2017-10-30

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Srinivas BANGALORE , David Crawford GIBBON , Mazin GILBERT , Patrick Guy HAFFNER , Zhu LIU , Behzad SHAHRARAY

IPC: G06F17/28

CPC classification number: G06F17/289

Abstract: In an embodiment, a method of providing an on demand translation service is provided. A subscriber may be charged a reduced fee or no fee for use of the on demand translation service in exchange for displaying commercial messages to the subscriber, the commercial messages being selected based on subscriber information. A multimedia signal including information in a source language may be received. The information may be obtained as text in the source language from the multimedia signal. The text may be translated from the source language to a target language. Translated information, based on the translated text, may be transmitted to a processing device for presentation to the subscriber. The received multimedia signal may be sent to a multimedia device for viewing.

10.

发明申请
SYSTEM AND METHOD FOR GENERATING CUSTOMIZED TEXT-TO-SPEECH VOICES 有权
Title translation: 用于生成定制的文本到语音的系统和方法

公开(公告)号：US20160093287A1

公开(公告)日：2016-03-31

申请号：US14965251

申请日：2015-12-10

Applicant: AT&T Intellectual Property II, L.P.

Inventor： Srinivas BANGALORE , Junlan FENG , Mazin GILBERT , Juergen SCHROETER , Ann K. SYRDAL , David SCHULZ

IPC: G10L13/033 , G10L15/197

CPC classification number: G10L13/033 , G10L13/00 , G10L13/02 , G10L13/06 , G10L13/08 , G10L15/197

Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.

Abstract translation: 公开了用于为特定应用产生定制的文本到语音语音的系统和方法。该方法包括通过选择用于生成与域相关联的自定义文本到语音语音的语音来生成自定义文本到语音语音，从预先存在的文本数据源收集与域相关联的文本数据，并使用收集的文本数据，通过搜索合成语音单元的预先存在的库存来选择适合于该域的语音单元，或者通过记录所选合成质量水平的最小库存来生成合成语音单元的域内库存。使用合成语音单元的域内库存来生成域的文本到语音定制语音。还可以使用主动学习技术来识别问题短语，其中只需要几分钟的记录数据来传送高质量的TTS定制语音。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification