Bootstrapping language models for spoken dialog systems using the world wide web
    41.
    发明授权
    Bootstrapping language models for spoken dialog systems using the world wide web 有权
    使用万维网的口语对话系统的自举语言模型

    公开(公告)号:US09299345B1

    公开(公告)日:2016-03-29

    申请号:US11425243

    申请日:2006-06-20

    CPC classification number: G10L15/22 G06F17/279 G10L15/183 G10L15/30

    Abstract: A system, method and computer readable medium that generates a language model from data from a web domain is disclosed. The method may include filtering web data to remove unwanted data from the web domain data, extracting predicate/argument pairs from the filtered web data, generating conversational utterances by merging the extracted predicate/argument pairs into conversational templates, and generating a web data language model using the generated conversational utterances.

    Abstract translation: 公开了一种从Web域的数据生成语言模型的系统,方法和计算机可读介质。 该方法可以包括过滤web数据以从Web域数据中移除不需要的数据,从过滤的web数据中提取谓词/参数对,通过将提取的谓词/参数对合并成对话模板来生成对话话语,以及生成Web数据语言模型 使用生成的会话话语。

    System and method for storing advertising data
    42.
    发明授权
    System and method for storing advertising data 有权
    用于存储广告数据的系统和方法

    公开(公告)号:US09106974B2

    公开(公告)日:2015-08-11

    申请号:US11725995

    申请日:2007-03-20

    Abstract: A computerized method is disclosed for presenting advertising data extracted from a video data stream, the method including storing a plurality of advertising data items extracted from the video data stream at an end user device; and displaying a plurality of sorted advertising indicator data items at the end user device, wherein each of the advertising indicator data items indicates one of the plurality of stored advertising data items. A system is disclosed for performing the method. A data structure is disclosed providing a functional and structural interrelationship between a processor in the system and data in the data structure.

    Abstract translation: 公开了一种用于呈现从视频数据流提取的广告数据的计算机化方法,所述方法包括:在最终用户设备处存储从视频数据流提取的多个广告数据项; 以及在所述终端用户设备处显示多个排序广告指示符数据项,其中所述广告指示符数据项中的每一个指示所述多个存储的广告数据项中的一个。 公开了一种用于执行该方法的系统。 公开了提供系统中的处理器与数据结构中的数据之间的功能和结构相互关系的数据结构。

    Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition
    43.
    发明授权
    Method and apparatus for identifying acoustic background environments based on time and speed to enhance automatic speech recognition 有权
    用于基于时间和速度识别声学背景环境以增强自动语音识别的方法和装置

    公开(公告)号:US08762143B2

    公开(公告)日:2014-06-24

    申请号:US11754814

    申请日:2007-05-29

    Applicant: Mazin Gilbert

    Inventor: Mazin Gilbert

    Abstract: Disclosed are systems, methods, and computer readable media for identifying an acoustic environment of a caller. The method embodiment comprises analyzing acoustic features of a received audio signal from a caller, receiving meta-data information based on a previously recorded time and speed of the caller, classifying a background environment of the caller based on the analyzed acoustic features and the meta-data, selecting an acoustic model matched to the classified background environment from a plurality of acoustic models, and performing speech recognition as the received audio signal using the selected acoustic model.

    Abstract translation: 公开了用于识别呼叫者的声学环境的系统,方法和计算机可读介质。 方法实施例包括分析来自呼叫者的接收到的音频信号的声学特征,基于先前记录的呼叫者的时间和速度接收元数据信息,基于所分析的声学特征和元分析对呼叫者的背景环境进行分类, 数据,从多个声学模型中选择与分类的背景环境匹配的声学模型,以及使用所选择的声学模型来执行语音识别作为所接收的音频信号。

    System and method for optimizing response handling time and customer satisfaction scores

    公开(公告)号:US08359364B2

    公开(公告)日:2013-01-22

    申请号:US13539896

    申请日:2012-07-02

    Abstract: A system and method disclosed for using and updating a database of template responses for a live agent in response to user communications. The method includes computing an average string distance between each response from a live agent and a template, use to generate the response, modifying the computed average string distance based on a customer satisfaction score associated with each response and selecting a response that minimizes the computed average string distance and maximizes customer satisfaction. Upon receiving a further communication on a certain issue, the system presents a prototype response that has been added to the template database to the live agent for use in generating a response to the further communication that reduces handling time and increases customer satisfaction.

    System and method for training a critical e-mail classifier using a plurality of base classifiers and N-grams
    47.
    发明授权
    System and method for training a critical e-mail classifier using a plurality of base classifiers and N-grams 有权
    使用多个基本分类器和N-gram训练关键电子邮件分类器的系统和方法

    公开(公告)号:US08195588B2

    公开(公告)日:2012-06-05

    申请号:US12080443

    申请日:2008-04-03

    CPC classification number: G06Q10/107

    Abstract: Disclosed is a method and system for identifying critical emails. To identify critical emails, a critical email classifier is trained from training data comprising labeled emails. The classifier extracts N-grams from the training data and identifies N-gram features from the extracted N-grams. The classifier also extracts salient features from the training data. The classifier is trained based on the identified N-gram features and the salient features so that the classifier can classify unlabeled emails as critical emails or non-critical emails.

    Abstract translation: 公开了用于识别关键电子邮件的方法和系统。 为了识别关键的电子邮件,关键的电子邮件分类器是从包含标记的电子邮件的数据的培训中进行培训的。 分类器从训练数据中提取N-gram,并从提取的N-gram中识别N-gram特征。 分类器还从训练数据中提取突出特征。 分类器基于识别的N-gram特征和突出特征进行训练,以便分类器可以将未标记的电子邮件分类为关键电子邮件或非关键电子邮件。

    SYSTEM AND METHOD FOR GENERATING MODELS FOR USE IN AUTOMATIC SPEECH RECOGNITION
    48.
    发明申请
    SYSTEM AND METHOD FOR GENERATING MODELS FOR USE IN AUTOMATIC SPEECH RECOGNITION 有权
    用于生成用于自动语音识别的模型的系统和方法

    公开(公告)号:US20120101817A1

    公开(公告)日:2012-04-26

    申请号:US12908222

    申请日:2010-10-20

    CPC classification number: G10L15/063 G10L2015/0638

    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating a model for use with automatic speech recognition. These principles can be implemented as part of a streamlined tool for automatic training and tuning of speech, or other, models with a fast turnaround and with limited human involvement. A system configured to practice the method receives, as part of a request to generate a model, input data and a seed model. The system receives a cost function indicating accuracy and at least one of speed and memory usage, The system processes the input data based on seed model and based on parameters that optimize the cost function to yield an updated model, and outputs the updated model.

    Abstract translation: 这里公开了用于生成用于自动语音识别的模型的系统,方法和非暂时的计算机可读存储介质。 这些原则可以作为精简工具的一部分,用于自动训练和调整语音或其他模型,具有快速的周转时间和人力参与有限。 配置为练习该方法的系统作为生成模型的请求的一部分接收输入数据和种子模型。 系统接收指示精度和速度和存储器使用中的至少一个的成本函数。系统基于种子模型处理输入数据,并且基于优化成本函数以产生更新模型的参数,并输出更新的模型。

    System And Method Of Generating Responses To Text-Based Messages
    49.
    发明申请
    System And Method Of Generating Responses To Text-Based Messages 有权
    生成对基于文本的消息的响应的系统和方法

    公开(公告)号:US20120065963A1

    公开(公告)日:2012-03-15

    申请号:US13300752

    申请日:2011-11-21

    CPC classification number: G06F17/2785

    Abstract: In accordance with one aspect of the present invention, an automated method of and system for generating a response to a text-based natural language message is disclosed. The method includes identifying a first selected input clause in a sentence in the text-based natural language message. Also, assigning a semantic tag to the first selected input clause and matching the semantic tag to a historical input tag. The historical input tag associated with a first previously generated response clause. Further; generating an output response message based on the historical response clause, the output response message derived from the historical input tag and a second previously generated response clause. The system includes means for performing the method steps.

    Abstract translation: 根据本发明的一个方面,公开了一种用于生成对基于文本的自然语言消息的响应的自动化方法和系统。 该方法包括识别基于文本的自然语言消息中的句子中的第一选择的输入子句。 此外,将语义标签分配给第一选择的输入子句并将语义标签与历史输入标签进行匹配。 与先前生成的第一个响应子句相关联的历史输入标签。 进一步; 基于历史响应子句生成输出响应消息,从历史输入标签导出的输出响应消息和第二个先前生成的响应子句。 该系统包括用于执行方法步骤的装置。

    Systems and Methods for Targeted Advertising in Voicemail to Text Systems
    50.
    发明申请
    Systems and Methods for Targeted Advertising in Voicemail to Text Systems 审中-公开
    语音邮件到文本系统中的目标广告系统和方法

    公开(公告)号:US20120022950A1

    公开(公告)日:2012-01-26

    申请号:US12843836

    申请日:2010-07-26

    CPC classification number: H04M3/53333 G06Q30/0241 G06Q30/0269 G06Q30/0273

    Abstract: Systems and methods are provided for a voice message to text system supporting targeted advertisements. Voice messages received from users are converted to raw text messages that are normalized to insert proper punctuation and extract entity information. The normalized text and entity information are processed to extract concepts, such as critical phrases, from the normalized text. Extracted concepts are then matched to advertisements on an advertisement database having user selection criteria. Advertisements having selection criteria matching the extracted concepts are transmitted to the users, and the advertisers that placed the advertisements are charged fees for the advertisements. User profile information and user context information can additionally be used to select advertisements for transmission to users.

    Abstract translation: 系统和方法被提供用于支持目标广告的文本系统的语音消息。 从用户接收的语音消息被转换为原始文本消息,其被标准化以插入适当的标点符号并提取实体信息。 处理归一化的文本和实体信息以从标准化文本中提取概念,例如关键短语。 然后将提取的概念与具有用户选择标准的广告数据库上的广告相匹配。 将具有与所提取的概念相匹配的选择标准的广告传送给用户,并且放置广告的广告商对广告收取费用。 用户简档信息和用户上下文信息可另外用于选择用于传输给用户的广告。

Patent Agency Ranking