METHOD AND APPARATUS FOR DETECTING AND EXTRACTING INFORMATION FROM DYNAMICALLY GENERATED WEB PAGES
    52.
    发明申请
    METHOD AND APPARATUS FOR DETECTING AND EXTRACTING INFORMATION FROM DYNAMICALLY GENERATED WEB PAGES 有权
    用于从动态生成的网页中检测和提取信息的方法和装置

    公开(公告)号:US20110184973A1

    公开(公告)日:2011-07-28

    申请号:US13007931

    申请日:2011-01-17

    CPC classification number: G06F17/3089

    Abstract: A method and apparatus for automatically detecting and extracting information from dynamically generated web pages are disclosed. For example, the present method stores user provided information that is entered into a form interlace of a web page for a first query. Responsive to the first query, a first response web page is received and stored. The present method then automatically generates a second query to acquire a second response web page that is responsive to the second query. Finally, the present method compares the first response web page and the second response web page. In one embodiment, the present invention extracts information that is dissimilar between the first response web page and the second response web page. This extracted information is deemed to be the pertinent information requested by the user.

    Abstract translation: 公开了一种用于从动态生成的网页自动检测和提取信息的方法和装置。 例如,本方法将用户提供的信息存储在用于第一查询的网页的交错格式中。 响应于第一个查询,收到并存储第一个响应网页。 本方法然后自动生成第二查询以获取响应于第二查询的第二响应网页。 最后,本方法比较第一响应网页和第二响应网页。 在一个实施例中,本发明提取在第一响应网页和第二响应网页之间不相似的信息。 该提取的信息被认为是用户请求的相关信息。

    System and method for identifying critical emails
    53.
    发明申请
    System and method for identifying critical emails 有权
    用于识别关键电子邮件的系统和方法

    公开(公告)号:US20090254498A1

    公开(公告)日:2009-10-08

    申请号:US12080443

    申请日:2008-04-03

    CPC classification number: G06Q10/107

    Abstract: Disclosed is a method and system for identifying critical emails. To identify critical emails, a critical email classifier is trained from training data comprising labeled emails. The classifier extracts N-grams from the training data and identifies N-gram features from the extracted N-grams. The classifier also extracts salient features from the training data. The classifier is trained based on the identified N-gram features and the salient features so that the classifier can classify unlabeled emails as critical emails or non-critical emails.

    Abstract translation: 公开了用于识别关键电子邮件的方法和系统。 为了识别关键的电子邮件,关键的电子邮件分类器是从包含标记的电子邮件的数据的培训中进行培训的。 分类器从训练数据中提取N-gram,并从提取的N-gram中识别N-gram特征。 分类器还从训练数据中提取突出特征。 分类器基于识别的N-gram特征和突出特征进行训练,以便分类器可以将未标记的电子邮件分类为关键电子邮件或非关键电子邮件。

    SYSTEM AND METHOD FOR INCREASING ACCURACY OF SEARCHES BASED ON COMMUNITIES OF INTEREST
    54.
    发明申请
    SYSTEM AND METHOD FOR INCREASING ACCURACY OF SEARCHES BASED ON COMMUNITIES OF INTEREST 有权
    基于兴趣社区提高搜索精确度的系统和方法

    公开(公告)号:US20090112600A1

    公开(公告)日:2009-04-30

    申请号:US11931830

    申请日:2007-10-31

    CPC classification number: G10L15/193 G10L15/065 G10L2015/228

    Abstract: Disclosed are systems, methods and computer-readable media for using a local communication network to generate a speech model. The method includes retrieving for an individual a list of numbers in a calling history, identifying a local neighborhood associated with each number in the calling history, truncating the local neighborhood associated with each number based on the at least one parameter, retrieving a local communication network associated with each number in the calling history and each phone number in the local neighborhood, and creating a language model for the individual based on the retrieved local communication network. The generated language model may be used for improved automatic speech recognition for audible searches as well as other modules in a spoken dialog system.

    Abstract translation: 公开了用于使用本地通信网络生成语音模型的系统,方法和计算机可读介质。 该方法包括检索个人在呼叫历史中的号码列表,识别与呼叫历史中的每个号码相关联的本地邻域,基于该至少一个参数截断与每个号码相关联的本地邻域,检索本地通信网络 与呼叫历史中的每个号码和本地邻域中的每个电话号码相关联,以及基于所检索的本地通信网络为个人创建语言模型。 生成的语言模型可以用于改善声音搜索的自动语音识别以及口语对话系统中的其他模块。

    System And Method Of Generating Responses To Text-Based Messages
    55.
    发明申请
    System And Method Of Generating Responses To Text-Based Messages 有权
    生成对基于文本的消息的响应的系统和方法

    公开(公告)号:US20090076795A1

    公开(公告)日:2009-03-19

    申请号:US11857036

    申请日:2007-09-18

    CPC classification number: G06F17/2785

    Abstract: In accordance with one aspect of the present invention, an automated method of and system for generating a response to a text-based natural language message is disclosed. The method includes identifying a sentence in the text-based natural language message. Also, identifying an input clause in the sentence. Further, comparing the input clause to a previously received clause, where the previously received clause is correlated with a previously generated response message. Additionally, generating an output response message based on the previously generated response message. The system includes means for performing the method steps.

    Abstract translation: 根据本发明的一个方面,公开了一种用于生成对基于文本的自然语言消息的响应的自动化方法和系统。 该方法包括识别基于文本的自然语言消息中的句子。 另外,确定句子中的一个input子句。 此外,将输入子句与先前接收的子句进行比较,其中先前接收的子句与先前生成的响应消息相关联。 另外,基于先前生成的响应消息生成输出响应消息。 该系统包括用于执行方法步骤的装置。

    SYSTEM AND METHOD FOR IMPROVING ROBUSTNESS OF SPEECH RECOGNITION USING VOCAL TRACT LENGTH NORMALIZATION CODEBOOKS
    56.
    发明申请
    SYSTEM AND METHOD FOR IMPROVING ROBUSTNESS OF SPEECH RECOGNITION USING VOCAL TRACT LENGTH NORMALIZATION CODEBOOKS 有权
    使用VOCAL TRACT LENGTH NORMALIZATION CODEBOOKS来提高语音识别的鲁棒性的系统和方法

    公开(公告)号:US20080319741A1

    公开(公告)日:2008-12-25

    申请号:US11765527

    申请日:2007-06-20

    Applicant: Mazin Gilbert

    Inventor: Mazin Gilbert

    CPC classification number: G10L15/07

    Abstract: Disclosed are systems, methods, and computer readable media for performing speech recognition. The method embodiment comprises selecting a codebook from a plurality of codebooks with a minimal acoustic distance to a received speech sample, the plurality of codebooks generated by a process of (a) computing a vocal tract length for a each of a plurality of speakers, (b) for each of the plurality of speakers, clustering speech vectors, and (c) creating a codebook for each speaker, the codebook containing entries for the respective speaker's vocal tract length, speech vectors, and an optional vector weight for each speech vector, (2) applying the respective vocal tract length associated with the selected codebook to normalize the received speech sample for use in speech recognition, and (3) recognizing the received speech sample based on the respective vocal tract length associated with the selected codebook.

    Abstract translation: 公开了用于执行语音识别的系统,方法和计算机可读介质。 方法实施例包括从具有对接收到的语音样本的最小声距离的多个码本中选择码本,所述多个码本通过(a)计算多个扬声器中的每一个的声道长度的处理而生成( b)对于所述多个扬声器中的每一个,聚类语音向量,以及(c)为每个说话者创建码本,所述码本包含用于每个语音向量的相应说话者声道长度,语音向量和可选矢量权重的条目, (2)应用与所选码本相关联的相应声道长度,以规范化用于语音识别的接收到的语音样本,以及(3)基于与所选码本相关联的相应声道长度来识别所接收的语音样本。

    Automatically associating relevant advertising with video content
    57.
    发明申请
    Automatically associating relevant advertising with video content 审中-公开
    自动将相关广告与视频内容相关联

    公开(公告)号:US20080120646A1

    公开(公告)日:2008-05-22

    申请号:US11601993

    申请日:2006-11-20

    Abstract: A method and system are provided for automatically selecting advertisements for placement in media content segments such as video segments. The method utilizes a classification engine to analyze values of a feature set extracted from the video segment, and to select one or more categories of advertisements to place in the segment. The classification engine is trainable using training data such as historical video segments in which advertisements were placed manually, and using performance data measuring the effectiveness of past advertisement placement in particular segments.

    Abstract translation: 提供了一种方法和系统,用于自动选择广告以便放置在诸如视频片段的媒体内容片段中。 该方法利用分类引擎来分析从视频片段提取的特征集的值,并且选择一个或多个类别的广告以放置在片段中。 分类引擎可以使用诸如历史视频段之类的训练数据进行训练,其中手动放置广告,并且使用测量过去广告布置在特定段中的有效性的性能数据。

    System and method for tracking fraudulent electronic transactions using voiceprints of uncommon words
    59.
    发明授权
    System and method for tracking fraudulent electronic transactions using voiceprints of uncommon words 有权
    使用不常见词语声纹跟踪欺诈性电子交易的系统和方法

    公开(公告)号:US08831941B2

    公开(公告)日:2014-09-09

    申请号:US11754800

    申请日:2007-05-29

    Abstract: Disclosed are systems, methods, and computer readable media for comparing customer voice prints comprising of uncommonly spoken words with a database of known fraudulent voice signatures and continually updating the database to decrease the risk of identity theft. The method embodiment comprises comparing a received voice signal against a database of known fraudulent voice signatures, denying the caller's transaction if the voice signal substantially matches the database of known fraudulent voice signatures, adding the caller's voice signal to the database of known fraudulent voice signatures if the voice signal does not substantially match a separate speaker verification database and received additional information is not verified.

    Abstract translation: 公开的是系统,方法和计算机可读介质,用于将包含不常用语音单词的客户语音输入与已知欺诈语音签名的数据库进行比较,并持续更新数据库以减少身份盗用的风险。 方法实施例包括将接收到的语音信号与已知欺诈性语音签名的数据库进行比较,如果语音信号基本上与已知欺诈性语音签名的数据库匹配,则拒绝主叫方的交易,如果呼叫者的话音信号加到已知的欺诈语音签名的数据库中, 语音信号基本上不匹配单独的扬声器验证数据库,并且未验证所接收的附加信息。

    On-Demand language translation for television programs
    60.
    发明授权
    On-Demand language translation for television programs 有权
    电视节目的按需语言翻译

    公开(公告)号:US08589146B2

    公开(公告)日:2013-11-19

    申请号:US12772580

    申请日:2010-05-03

    Abstract: A method, a system and a machine-readable medium are provided for an on demand translation service. A translation module including at least one language pair module for translating a source language to a target language may be made available for use by a subscriber. The subscriber may be charged a fee for use of the requested on demand translation service or may be provided use of the on demand translation service for free in exchange for displaying commercial messages to the subscriber. A video signal may be received including information in the source language, which may be obtained as text from the video signal and may be translated from the source language to the target language by use of the translation module. Translated information, based on the translated text, may be added into the received video signal. The video signal including the translated information in the target language may be sent to a display device.

    Abstract translation: 为按需翻译服务提供方法,系统和机器可读介质。 包括用于将源语言翻译成目标语言的至少一个语言对模块的翻译模块可以被用户使用。 用户可能会收取使用所请求的按需翻译服务的费用,或者可以免费使用按需翻译服务,以便向用户显示商业消息。 可以接收包括源语言的信息的视频信号,其可以从视频信号获取为文本,并且可以通过使用翻译模块从源语言翻译成目标语言。 基于翻译文本的翻译信息可以被添加到接收的视频信号中。 可以将包括目标语言的翻译信息的视频信号发送到显示装置。

Patent Agency Ranking