METHOD AND APPARATUS FOR IDENTIFYING ACOUSTIC BACKGROUND ENIRONMENTS BASED ON TIME AND SPEECH TO ENHANCE AUTOMATIC SPEECH RECOGNITION
    1.
    发明申请
    METHOD AND APPARATUS FOR IDENTIFYING ACOUSTIC BACKGROUND ENIRONMENTS BASED ON TIME AND SPEECH TO ENHANCE AUTOMATIC SPEECH RECOGNITION 有权
    基于时间和语音识别声音背景的方法和装置,以增强自动语音识别

    公开(公告)号:US20160275948A1

    公开(公告)日:2016-09-22

    申请号:US15171177

    申请日:2016-06-02

    Inventor: Mazin GILBERT

    Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

    Abstract translation: 公开了用于识别呼叫者的声学环境的系统,方法和计算机可读介质。 方法实施例包括分析来自呼叫者的接收音频信号的声学特征,接收元数据信息,基于分析的声学特征和元数据对呼叫者的背景环境进行分类,选择与分类背景相匹配的声学模型 环境,并且使用所选择的声学模型来执行语音识别作为所接收的音频信号。

    System and Method for Controlling Presentations Using a Multimodal Interface
    2.
    发明申请
    System and Method for Controlling Presentations Using a Multimodal Interface 审中-公开
    使用多模态界面控制演示的系统和方法

    公开(公告)号:US20150178044A1

    公开(公告)日:2015-06-25

    申请号:US14641762

    申请日:2015-03-09

    Abstract: The invention provides for a system, method, and computer readable medium storing instructions related to controlling a presentation in a multimodal system. The method embodiment of the invention is a method for the retrieval of information on the basis of its content for real-time incorporation into an electronic presentation. The method comprises receiving from a presenter a content-based request for at least one segment of a first plurality of segments within a media presentation and while displaying the media presentation to an audience, displaying to the presenter a second plurality of segments in response to the content-based request. The computing device practicing the method receives a selection from the presenter of a segment from the second plurality of segments and displays to the audience the selected segment.

    Abstract translation: 本发明提供了一种系统,方法和计算机可读介质,其存储与在多模式系统中控制演示有关的指令。 本发明的方法实施例是一种用于基于其内容来检索信息以便实时并入到电子呈现中的方法。 所述方法包括从演示者接收基于内容的对媒体呈现内的第一多个片段的至少一个片段的请求,并且在向观众显示媒体呈现的同时,向演示者显示响应于第 基于内容的请求。 实施该方法的计算设备从第二多个段接收来自段的演示者的选择,并向观众显示所选择的段。

    SYSTEM AND METHOD FOR USING SPEECH FOR DATA SEARCHING DURING PRESENTATIONS
    3.
    发明申请
    SYSTEM AND METHOD FOR USING SPEECH FOR DATA SEARCHING DURING PRESENTATIONS 审中-公开
    在演讲中使用语音进行数据搜索的系统和方法

    公开(公告)号:US20170075656A1

    公开(公告)日:2017-03-16

    申请号:US15341052

    申请日:2016-11-02

    Abstract: There is provided for a system, method, and computer readable medium storing instructions related to controlling a presentation in a multimodal system. A method for the retrieval of information on the basis of its content for real-time incorporation into an electronic presentation is discussed. One method includes controlling a media presentation using a multimodal interface. The method involves receiving from a presenter a content-based request associated with a plurality of segments within a media presentation preprocessed for context-based searching; displaying the media presentation and displaying to the presenter results in response to the content-based request; receiving a selection from the presenter of at least one result; and displaying the selected result to an audience.

    Abstract translation: 提供了一种用于存储与在多模式系统中控制呈现相关的指令的系统,方法和计算机可读介质。 讨论了一种基于其内容来检索信息以便实时并入到电子演示中的方法。 一种方法包括使用多模式界面来控制媒体呈现。 该方法包括从演示者接收与预处理用于基于上下文的搜索的媒体表现中的与多个片段相关联的基于内容的请求; 响应于基于内容的请求,向演示者显示媒体呈现和显示结果; 从演示者接收至少一个结果的选择; 并将所选结果显示给观众。

    METHOD AND SYSTEM FOR PROVIDING AN AUTOMATED WEB TRANSCRIPTION SERVICE
    6.
    发明申请
    METHOD AND SYSTEM FOR PROVIDING AN AUTOMATED WEB TRANSCRIPTION SERVICE 有权
    提供自动WEB转录服务的方法和系统

    公开(公告)号:US20140316780A1

    公开(公告)日:2014-10-23

    申请号:US14321932

    申请日:2014-07-02

    CPC classification number: G10L15/26 G06F17/30893 G10L15/265

    Abstract: A system, method and computer readable medium that provides an automated web transcription service is disclosed. The method may include receiving input speech from a user using a communications network, recognizing the received input speech, understanding the recognized speech, transcribing the understood speech to text, storing the transcribed text in a database, receiving a request via a web page to display the transcribed text, retrieving transcribed text from the database, and displaying the transcribed text to the requester using the web page.

    Abstract translation: 公开了一种提供自动网页转录服务的系统,方法和计算机可读介质。 该方法可以包括使用通信网络从用户接收输入语音,识别接收的输入语音,理解识别的语音,将理解的语音转录为文本,将转录的文本存储在数据库中,经由网页接收请求以显示 转录的文本,从数据库检索转录的文本,以及使用网页将转录的文本显示给请求者。

    Method and Apparatus for Identifying Acoustic Background Environments Based on Time and Speed to Enhance Automatic Speech Recognition
    8.
    发明申请
    Method and Apparatus for Identifying Acoustic Background Environments Based on Time and Speed to Enhance Automatic Speech Recognition 有权
    基于时间和速度识别声学背景环境以增强自动语音识别的方法和装置

    公开(公告)号:US20140303972A1

    公开(公告)日:2014-10-09

    申请号:US14312116

    申请日:2014-06-23

    Inventor: Mazin GILBERT

    Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

    Abstract translation: 公开了用于识别呼叫者的声学环境的系统,方法和计算机可读介质。 方法实施例包括分析来自呼叫者的接收音频信号的声学特征,接收元数据信息,基于分析的声学特征和元数据对呼叫者的背景环境进行分类,选择与分类背景相匹配的声学模型 环境,并且使用所选择的声学模型来执行语音识别作为所接收的音频信号。

    On-Demand Language Translation for Television Programs

    公开(公告)号:US20180046617A1

    公开(公告)日:2018-02-15

    申请号:US15797656

    申请日:2017-10-30

    CPC classification number: G06F17/289

    Abstract: In an embodiment, a method of providing an on demand translation service is provided. A subscriber may be charged a reduced fee or no fee for use of the on demand translation service in exchange for displaying commercial messages to the subscriber, the commercial messages being selected based on subscriber information. A multimedia signal including information in a source language may be received. The information may be obtained as text in the source language from the multimedia signal. The text may be translated from the source language to a target language. Translated information, based on the translated text, may be transmitted to a processing device for presentation to the subscriber. The received multimedia signal may be sent to a multimedia device for viewing.

    SYSTEM AND METHOD FOR GENERATING CUSTOMIZED TEXT-TO-SPEECH VOICES
    10.
    发明申请
    SYSTEM AND METHOD FOR GENERATING CUSTOMIZED TEXT-TO-SPEECH VOICES 有权
    用于生成定制的文本到语音的系统和方法

    公开(公告)号:US20160093287A1

    公开(公告)日:2016-03-31

    申请号:US14965251

    申请日:2015-12-10

    Abstract: A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.

    Abstract translation: 公开了用于为特定应用产生定制的文本到语音语音的系统和方法。 该方法包括通过选择用于生成与域相关联的自定义文本到语音语音的语音来生成自定义文本到语音语音,从预先存在的文本数据源收集与域相关联的文本数据,并使用收集的 文本数据,通过搜索合成语音单元的预先存在的库存来选择适合于该域的语音单元,或者通过记录所选合成质量水平的最小库存来生成合成语音单元的域内库存。 使用合成语音单元的域内库存来生成域的文本到语音定制语音。 还可以使用主动学习技术来识别问题短语,其中只需要几分钟的记录数据来传送高质量的TTS定制语音。

Patent Agency Ranking