Identification of topics for online discussions based on language patterns
    1.
    发明授权
    Identification of topics for online discussions based on language patterns 有权
    基于语言模式识别在线讨论的主题

    公开(公告)号:US07739261B2

    公开(公告)日:2010-06-15

    申请号:US11763282

    申请日:2007-06-14

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30731 G06Q30/02

    摘要: A topic identification system identifies topics of online discussions by iteratively identifying topic words or keywords of the online discussions and identifying language patterns associated with those keywords. The topic identification system starts out with an initial set of keywords and identifies language patterns that each include a keyword. The topic identification system then uses the identified language patterns to identify additional keywords of the online discussion that match the patterns. The topic identification system then again identifies language patterns using the keywords including the newly identified keywords. The topic identification system may repeat the process of identifying language patterns and keywords until a termination criterion is satisfied.

    摘要翻译: 主题识别系统通过迭代地识别在线讨论的主题或关键字并识别与这些关键字相关联的语言模式来识别在线讨论的主题。 主题识别系统以一组初始关键字开始,并识别每个关键字的语言模式。 然后,主题识别系统使用所识别的语言模式来识别与模式匹配的在线讨论的附加关键字。 然后,主题识别系统再次使用包括新确定的关键字的关键字来识别语言模式。 主题识别系统可以重复识别语言模式和关键字的过程,直到满足终止标准。

    USER ADVERTISEMENT CLICK BEHAVIOR MODELING
    2.
    发明申请
    USER ADVERTISEMENT CLICK BEHAVIOR MODELING 有权
    用户广告点击行为建模

    公开(公告)号:US20090299967A1

    公开(公告)日:2009-12-03

    申请号:US12131126

    申请日:2008-06-02

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06Q30/02

    摘要: Described herein is technology for, among other things, mining similar user clusters based on user advertisement click behaviors. The technology involves methods and systems for mining similar user clusters based on log data available on an online advertising platform. By building a user linkage representation based on one or more attributes from the log data, the similar user clusters can be harvested in more efficient manner.

    摘要翻译: 这里描述的是用于基于用户广告点击行为挖掘类似用户群的技术。 该技术涉及基于在线广告平台上可用的日志数据挖掘类似用户群集的方法和系统。 通过基于日志数据中的一个或多个属性构建用户链接表示,可以以更有效的方式收集类似的用户群集。

    ADVERTISEMENT APPROVAL BASED ON TRAINING DATA
    3.
    发明申请
    ADVERTISEMENT APPROVAL BASED ON TRAINING DATA 审中-公开
    基于培训数据的广告批准

    公开(公告)号:US20080300971A1

    公开(公告)日:2008-12-04

    申请号:US11755523

    申请日:2007-05-30

    IPC分类号: G06Q30/00

    摘要: A system for determining whether to approve a target document (e.g., advertisement) is provided. The system trains a classifier using tuples of words from appropriate documents and tuples of words from inappropriate documents. To approve a target document, the system identifies tuples of words of the target document. The system then applies the classifier to the identified tuples to classify the document as being appropriate or inappropriate. If the document is classified as appropriate, the system automatically approves the document.

    摘要翻译: 提供用于确定是否批准目标文档(例如,广告)的系统。 系统使用适当文件的单词组和不适当文件的单词元组来训练分类器。 要批准目标文档,系统会标识目标文档的单词元组。 然后,系统将分类器应用于所识别的元组,以将文档分类为合适或不合适。 如果文档被分类为适当的,系统将自动批准文档。

    CLICK-THROUGH LOG MINING
    4.
    发明申请
    CLICK-THROUGH LOG MINING 有权
    点击通过日志采矿

    公开(公告)号:US20080208841A1

    公开(公告)日:2008-08-28

    申请号:US11870359

    申请日:2007-10-10

    IPC分类号: G06F17/30

    摘要: Click-through log mining is described. Raw search click-through log data is processed to generate ordered query keywords, utilizing an algorithm to expand user-submitted keywords to include high frequency user queries, managing the keywords for a keyword expansion file, analyzing the algorithm performance on a bidding criteria, and identifying related phrases with similar page-click behaviors for advertisements.

    摘要翻译: 描述了点击式日志挖掘。 处理原始搜索点击后日志数据以生成有序查询关键字,利用算法来扩展用户提交的关键字以包括高频用户查询,管理关键字扩展文件的关键字,以出价标准分析算法性能;以及 识别与广告相似的页面点击行为的相关短语。

    Keyword usage score based on frequency impulse and frequency weight
    5.
    发明授权
    Keyword usage score based on frequency impulse and frequency weight 失效
    基于频率冲击和频率权重的关键词使用得分

    公开(公告)号:US07644075B2

    公开(公告)日:2010-01-05

    申请号:US11756740

    申请日:2007-06-01

    IPC分类号: G06F17/30

    摘要: A method and system for assessing keyword usage based on frequency of usage of the keywords during various periods is provided. A keyword usage measurement system is provided with the frequency of keywords during various periods. The measurement system then calculates a recent usage score for a keyword by combining a frequency impulse score for the keyword with a frequency weight for the keyword. The frequency impulse score for a keyword indicates whether a recent change in the frequency of the keyword has occurred. The frequency weight for a keyword indicates a recent measure of the frequency of the keyword.

    摘要翻译: 提供了一种基于各种期间关键词使用频率来评估关键字使用的方法和系统。 关键字使用测量系统在不同时期提供关键字的频率。 然后,测量系统通过将关键字的频率脉冲得分与该关键字的频率权重组合来计算关键字的最近使用分数。 关键字的频率脉冲得分指示是否发生了关键字的频率的最近的改变。 关键字的频率权重表示最近对关键字频率的度量。

    Advertiser monetization modeling
    6.
    发明授权
    Advertiser monetization modeling 有权
    广告商营利建模

    公开(公告)号:US08117050B2

    公开(公告)日:2012-02-14

    申请号:US12131124

    申请日:2008-06-02

    IPC分类号: G06Q40/00 G06Q30/00 G01C21/34

    摘要: Embodiments of the claimed subject matter provide a method and system for modeling advertiser monetization. The claimed subject matter provides a method and system from which an advertisement may be evaluated according to various metrics to determine a quality relative to other advertisements. The relative quality considers the content of the advertisement, the performance of the advertisement and the history of the advertiser's bidding behavior.One embodiment of the claimed subject matter is implemented as a method for advertiser monetization modeling. One or more advertisements are received from one or more advertisers. The quality of the advertisement(s) is defined according to certain metrics, such as the quality of the content of the advertisement, the quality of the past and estimated future performance of the advertisement and the history of bidding behavior of the advertiser. After the respective quality of the advertisement(s) is determined, the advertisement(s) is ranked with other advertisements according to the determined quality.

    摘要翻译: 所要求保护的主题的实施例提供了用于对广告商获利进行建模的方法和系统。 所要求保护的主题提供了一种方法和系统,从该方法和系统可以根据各种度量来评估广告以确定相对于其他广告的质量。 相对质量考虑广告的内容,广告的表现以及广告商的投标行为的历史。 所要求保护的主题的一个实施例被实现为广告商获利建模的方法。 从一个或多个广告商接收一个或多个广告。 广告的质量根据广告内容的质量,过去的质量以及广告的未来预测以及广告主的投标行为的历史等某些指标来定义。 在确定了广告的相应质量之后,根据所确定的质量对广告进行其他广告的排序。

    Abbreviation expansion based on learned weights
    7.
    发明授权
    Abbreviation expansion based on learned weights 有权
    基于学习权重的缩写扩展

    公开(公告)号:US07848918B2

    公开(公告)日:2010-12-07

    申请号:US11538770

    申请日:2006-10-04

    IPC分类号: G06F17/27 G06F17/21 G10L21/00

    CPC分类号: G06F17/28

    摘要: A method and system for identifying expansions of abbreviations using learned weights is provided. An abbreviation system generates features for various expansions of an abbreviation and generates a score indicating the likelihood that an expansion is a correct expansion of the abbreviation. A expansion with the same number of words as letters in the abbreviation is more likely in general to be a correct expansion than an expansion with more or fewer words. The abbreviation system calculates a score based on a weighted combination of the features. The abbreviation system learns the weights for the features from training data of abbreviations, candidate expansions, and scores for the candidate expansions.

    摘要翻译: 提供了一种用于使用学习的权重来识别缩写的扩展的方法和系统。 缩写系统产生缩写的各种扩展的特征,并生成表示扩展是缩写的正确扩展的可能性的分数。 与缩写中的字母相同数量的单词的扩展通常可能是具有更多或更少单词的扩展的正确扩展。 缩写系统基于特征的加权组合来计算得分。 缩写系统从候选扩展的缩写,候选扩展和分数的训练数据中学习特征的权重。

    IDENTIFICATION OF TOPICS FOR ONLINE DISCUSSIONS BASED ON LANGUAGE PATTERNS
    8.
    发明申请
    IDENTIFICATION OF TOPICS FOR ONLINE DISCUSSIONS BASED ON LANGUAGE PATTERNS 有权
    基于语言模式的在线讨论主题的识别

    公开(公告)号:US20080313180A1

    公开(公告)日:2008-12-18

    申请号:US11763282

    申请日:2007-06-14

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30731 G06Q30/02

    摘要: A topic identification system identifies topics of online discussions by iteratively identifying topic words or keywords of the online discussions and identifying language patterns associated with those keywords. The topic identification system starts out with an initial set of keywords and identifies language patterns that each include a keyword. The topic identification system then uses the identified language patterns to identify additional keywords of the online discussion that match the patterns. The topic identification system then again identifies language patterns using the keywords including the newly identified keywords. The topic identification system may repeat the process of identifying language patterns and keywords until a termination criterion is satisfied.

    摘要翻译: 主题识别系统通过迭代地识别在线讨论的主题或关键字并识别与这些关键字相关联的语言模式来识别在线讨论的主题。 主题识别系统以一组初始关键字开始,并识别每个关键字的语言模式。 然后,主题识别系统使用所识别的语言模式来识别与模式匹配的在线讨论的附加关键字。 然后,主题识别系统再次使用包括新确定的关键字的关键字来识别语言模式。 主题识别系统可以重复识别语言模式和关键字的过程,直到满足终止标准。

    ABBREVIATION EXPANSION BASED ON LEARNED WEIGHTS
    9.
    发明申请
    ABBREVIATION EXPANSION BASED ON LEARNED WEIGHTS 有权
    基于知识权重的缩小扩张

    公开(公告)号:US20080086297A1

    公开(公告)日:2008-04-10

    申请号:US11538770

    申请日:2006-10-04

    IPC分类号: G06F17/28

    CPC分类号: G06F17/28

    摘要: A method and system for identifying expansions of abbreviations using learned weights is provided. An abbreviation system generates features for various expansions of an abbreviation and generates a score indicating the likelihood that an expansion is a correct expansion of the abbreviation. A expansion with the same number of words as letters in the abbreviation is more likely in general to be a correct expansion than an expansion with more or fewer words. The abbreviation system calculates a score based on a weighted combination of the features. The abbreviation system learns the weights for the features from training data of abbreviations, candidate expansions, and scores for the candidate expansions.

    摘要翻译: 提供了一种用于使用学习的权重来识别缩写的扩展的方法和系统。 缩写系统产生缩写的各种扩展的特征,并生成表示扩展是缩写的正确扩展的可能性的分数。 与缩写中的字母相同数量的单词的扩展通常可能是具有更多或更少单词的扩展的正确扩展。 缩写系统基于特征的加权组合来计算得分。 缩写系统从候选扩展的缩写,候选扩展和分数的训练数据中学习特征的权重。

    Predicting keyword monetization
    10.
    发明授权
    Predicting keyword monetization 有权
    预测关键字营利

    公开(公告)号:US08682839B2

    公开(公告)日:2014-03-25

    申请号:US12131125

    申请日:2008-06-02

    IPC分类号: G06F17/30 G06Q40/00

    摘要: Embodiments of the claimed subject matter provide a method and system for predicting bidding keyword monetization. The claimed subject matter provides a method and system with which the value of a keyword for the purpose of relevant online advertisement may be evaluated according to various metrics to determine a bidding landscape for use in advertising campaigns. The value of the keyword considers certain attributes related to the monetization of the keyword.One embodiment of the claimed subject matter is implemented as a method for predicting keyword monetization for one or more keyword-advertisement relationships. Historical data for the one or more keyword-advertisement relationships is referenced and used to generate a global model of the one or more keyword-advertisement relationship. The relationships are then evaluated according to a time-series analysis, which parses the data from the historical data and the global model to create predictions for the keyword monetization according to the keyword-advertisement relationships.

    摘要翻译: 所要求保护的主题的实施例提供了用于预测投标关键字货币化的方法和系统。 所要求保护的主题提供了一种方法和系统,其中可以根据各种度量来评估用于相关在线广告的关键字的价值,以确定用于广告活动的投标景观。 该关键字的值考虑与关键字获利相关的特定属性。 所要求保护的主题的一个实施例被实现为用于预测一个或多个关键字 - 广告关系的关键字获利的方法。 引用一个或多个关键字 - 广告关系的历史数据,并用于生成一个或多个关键字 - 广告关系的全局模型。 然后根据时间序列分析来评估关系,该时间序列分析从历史数据和全球模型中分析数据,以根据关键字 - 广告关系创建关键字营利的预测。