CLICK-THROUGH LOG MINING
    1.
    发明申请
    CLICK-THROUGH LOG MINING 有权
    点击通过日志采矿

    公开(公告)号:US20080208841A1

    公开(公告)日:2008-08-28

    申请号:US11870359

    申请日:2007-10-10

    IPC分类号: G06F17/30

    摘要: Click-through log mining is described. Raw search click-through log data is processed to generate ordered query keywords, utilizing an algorithm to expand user-submitted keywords to include high frequency user queries, managing the keywords for a keyword expansion file, analyzing the algorithm performance on a bidding criteria, and identifying related phrases with similar page-click behaviors for advertisements.

    摘要翻译: 描述了点击式日志挖掘。 处理原始搜索点击后日志数据以生成有序查询关键字,利用算法来扩展用户提交的关键字以包括高频用户查询,管理关键字扩展文件的关键字,以出价标准分析算法性能;以及 识别与广告相似的页面点击行为的相关短语。

    User query mining for advertising matching
    2.
    发明授权
    User query mining for advertising matching 有权
    用户查询挖掘广告匹配

    公开(公告)号:US08285745B2

    公开(公告)日:2012-10-09

    申请号:US11849136

    申请日:2007-08-31

    IPC分类号: G06F17/30

    摘要: Systems and methods to determine relevant keywords from a user's search query sessions are disclosed. The described method includes identifying search session logs of a user, segmenting the search session logs into one or more search sessions. After the segmentation, the search sessions are analyzed to compose a list of semantically relevant keyword sets including at least a first keyword set and a second keyword set. The described method further includes determining a semantic relevance between the first and second keyword sets according to the frequency at which the first and second keyword sets are reported in the query results and displaying one or more semantically high relevant keyword sets after being filtered by a threshold.

    摘要翻译: 公开了从用户的搜索查询会话确定相关关键词的系统和方法。 所描述的方法包括识别用户的搜索会话日志,将搜索会话日志分割成一个或多个搜索会话。 在分割之后,分析搜索会话以构成包括至少第一关键词集合和第二关键字集合的语义相关关键字集合的列表。 所描述的方法还包括根据在查询结果中报告第一和第二关键字集合的频率来确定第一和第二关键字集合之间的语义相关性,并且在被阈值过滤之后显示一个或多个语义上相关的关键字集合 。

    USER QUERY MINING FOR ADVERTISING MATCHING
    3.
    发明申请
    USER QUERY MINING FOR ADVERTISING MATCHING 有权
    用户查询采购广告匹配

    公开(公告)号:US20090063461A1

    公开(公告)日:2009-03-05

    申请号:US11849136

    申请日:2007-08-31

    IPC分类号: G06F7/06 G06F17/30

    摘要: Systems and methods to determine relevant keywords from a user's search query sessions are disclosed. The described method includes identifying search session logs of a user, segmenting the search session logs into one or more search sessions. After the segmentation, the search sessions are analyzed to compose a list of semantically relevant keyword sets including at least a first keyword set and a second keyword set. The described method further includes determining a semantic relevance between the first and second keyword sets according to the frequency at which the first and second keyword sets are reported in the query results and displaying one or more semantically high relevant keyword sets after being filtered by a threshold.

    摘要翻译: 公开了从用户的搜索查询会话确定相关关键词的系统和方法。 所描述的方法包括识别用户的搜索会话日志,将搜索会话日志分割成一个或多个搜索会话。 在分割之后,分析搜索会话以构成包括至少第一关键词集合和第二关键字集合的语义相关关键字集合的列表。 所描述的方法还包括根据在查询结果中报告第一和第二关键字集合的频率来确定第一和第二关键字集合之间的语义相关性,并且在被阈值过滤之后显示一个或多个语义上相关的关键字集合 。

    Identification of topics for online discussions based on language patterns
    4.
    发明授权
    Identification of topics for online discussions based on language patterns 有权
    基于语言模式识别在线讨论的主题

    公开(公告)号:US07739261B2

    公开(公告)日:2010-06-15

    申请号:US11763282

    申请日:2007-06-14

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30731 G06Q30/02

    摘要: A topic identification system identifies topics of online discussions by iteratively identifying topic words or keywords of the online discussions and identifying language patterns associated with those keywords. The topic identification system starts out with an initial set of keywords and identifies language patterns that each include a keyword. The topic identification system then uses the identified language patterns to identify additional keywords of the online discussion that match the patterns. The topic identification system then again identifies language patterns using the keywords including the newly identified keywords. The topic identification system may repeat the process of identifying language patterns and keywords until a termination criterion is satisfied.

    摘要翻译: 主题识别系统通过迭代地识别在线讨论的主题或关键字并识别与这些关键字相关联的语言模式来识别在线讨论的主题。 主题识别系统以一组初始关键字开始,并识别每个关键字的语言模式。 然后,主题识别系统使用所识别的语言模式来识别与模式匹配的在线讨论的附加关键字。 然后,主题识别系统再次使用包括新确定的关键字的关键字来识别语言模式。 主题识别系统可以重复识别语言模式和关键字的过程,直到满足终止标准。

    ADVERTISEMENT APPROVAL BASED ON TRAINING DATA
    5.
    发明申请
    ADVERTISEMENT APPROVAL BASED ON TRAINING DATA 审中-公开
    基于培训数据的广告批准

    公开(公告)号:US20080300971A1

    公开(公告)日:2008-12-04

    申请号:US11755523

    申请日:2007-05-30

    IPC分类号: G06Q30/00

    摘要: A system for determining whether to approve a target document (e.g., advertisement) is provided. The system trains a classifier using tuples of words from appropriate documents and tuples of words from inappropriate documents. To approve a target document, the system identifies tuples of words of the target document. The system then applies the classifier to the identified tuples to classify the document as being appropriate or inappropriate. If the document is classified as appropriate, the system automatically approves the document.

    摘要翻译: 提供用于确定是否批准目标文档(例如,广告)的系统。 系统使用适当文件的单词组和不适当文件的单词元组来训练分类器。 要批准目标文档,系统会标识目标文档的单词元组。 然后,系统将分类器应用于所识别的元组,以将文档分类为合适或不合适。 如果文档被分类为适当的,系统将自动批准文档。

    IDENTIFICATION OF TOPICS FOR ONLINE DISCUSSIONS BASED ON LANGUAGE PATTERNS
    6.
    发明申请
    IDENTIFICATION OF TOPICS FOR ONLINE DISCUSSIONS BASED ON LANGUAGE PATTERNS 有权
    基于语言模式的在线讨论主题的识别

    公开(公告)号:US20080313180A1

    公开(公告)日:2008-12-18

    申请号:US11763282

    申请日:2007-06-14

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30731 G06Q30/02

    摘要: A topic identification system identifies topics of online discussions by iteratively identifying topic words or keywords of the online discussions and identifying language patterns associated with those keywords. The topic identification system starts out with an initial set of keywords and identifies language patterns that each include a keyword. The topic identification system then uses the identified language patterns to identify additional keywords of the online discussion that match the patterns. The topic identification system then again identifies language patterns using the keywords including the newly identified keywords. The topic identification system may repeat the process of identifying language patterns and keywords until a termination criterion is satisfied.

    摘要翻译: 主题识别系统通过迭代地识别在线讨论的主题或关键字并识别与这些关键字相关联的语言模式来识别在线讨论的主题。 主题识别系统以一组初始关键字开始,并识别每个关键字的语言模式。 然后,主题识别系统使用所识别的语言模式来识别与模式匹配的在线讨论的附加关键字。 然后,主题识别系统再次使用包括新确定的关键字的关键字来识别语言模式。 主题识别系统可以重复识别语言模式和关键字的过程,直到满足终止标准。

    KEYWORD USAGE SCORE BASED ON FREQUENCY IMPULSE AND FREQUENCY WEIGHT
    7.
    发明申请
    KEYWORD USAGE SCORE BASED ON FREQUENCY IMPULSE AND FREQUENCY WEIGHT 失效
    基于频率和频率的关键字使用分数

    公开(公告)号:US20080301117A1

    公开(公告)日:2008-12-04

    申请号:US11756740

    申请日:2007-06-01

    IPC分类号: G06F7/76 G06F17/30

    摘要: A method and system for assessing keyword usage based on frequency of usage of the keywords during various periods is provided. A keyword usage measurement system is provided with the frequency of keywords during various periods. The measurement system then calculates a recent usage score for a keyword by combining a frequency impulse score for the keyword with a frequency weight for the keyword. The frequency impulse score for a keyword indicates whether a recent change in the frequency of the keyword has occurred. The frequency weight for a keyword indicates a recent measure of the frequency of the keyword.

    摘要翻译: 提供了一种基于各种期间关键词使用频率来评估关键字使用的方法和系统。 关键字使用测量系统在不同时期提供关键字的频率。 然后,测量系统通过将关键字的频率脉冲得分与该关键字的频率权重组合来计算关键字的最近使用分数。 关键字的频率脉冲得分指示是否发生了关键字的频率的最近的改变。 关键字的频率权重表示最近对关键字频率的度量。

    Keyword usage score based on frequency impulse and frequency weight
    8.
    发明授权
    Keyword usage score based on frequency impulse and frequency weight 失效
    基于频率冲击和频率权重的关键词使用得分

    公开(公告)号:US07644075B2

    公开(公告)日:2010-01-05

    申请号:US11756740

    申请日:2007-06-01

    IPC分类号: G06F17/30

    摘要: A method and system for assessing keyword usage based on frequency of usage of the keywords during various periods is provided. A keyword usage measurement system is provided with the frequency of keywords during various periods. The measurement system then calculates a recent usage score for a keyword by combining a frequency impulse score for the keyword with a frequency weight for the keyword. The frequency impulse score for a keyword indicates whether a recent change in the frequency of the keyword has occurred. The frequency weight for a keyword indicates a recent measure of the frequency of the keyword.

    摘要翻译: 提供了一种基于各种期间关键词使用频率来评估关键字使用的方法和系统。 关键字使用测量系统在不同时期提供关键字的频率。 然后,测量系统通过将关键字的频率脉冲得分与该关键字的频率权重组合来计算关键字的最近使用分数。 关键字的频率脉冲得分指示是否发生了关键字的频率的最近的改变。 关键字的频率权重表示最近对关键字频率的度量。

    Block tracking mechanism for web personalization
    9.
    发明申请
    Block tracking mechanism for web personalization 有权
    网站个性化的块跟踪机制

    公开(公告)号:US20080281834A1

    公开(公告)日:2008-11-13

    申请号:US11801404

    申请日:2007-05-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30861

    摘要: Described is a technology by which blocks of web pages may be selected, such as for building a user-personalized web page containing selected blocks. A selection mechanism, such as a browser toolbar add-on, provides a user interface for selecting blocks, and records information about selected blocks. A block tracking mechanism (e.g., a daemon program) uses the information to locate selected blocks of the web pages, including when the web page containing the block is updated with respect to content and/or layout. The block tracking mechanism may update a local gadget that when invoked, such as by browsing to a particular web page, which shows updated versions of the block on a personalized web page. Blocks may be efficiently located by processing trees representing web pages into reduced trees, and then by performing a minimum distance mapping algorithm on the reduced trees.

    摘要翻译: 描述了可以选择网页块的技术,诸如用于构建包含所选块的用户个性化网页。 诸如浏览器工具栏附件的选择机制提供用于选择块的用户界面,并且记录关于所选块的信息。 块跟踪机制(例如,守护程序)使用该信息来定位网页的所选块,包括当包含块的网页相对于内容和/或布局被更新时。 块跟踪机制可以更新当调用时​​的本地小工具,诸如通过浏览到特定网页,其显示个性化网页上块的更新版本。 可以通过将表示网页的树处理成缩小的树,然后通过在缩小的树上执行最小距离映射算法来有效地定位块。

    Efficient retrieval algorithm by query term discrimination
    10.
    发明授权
    Efficient retrieval algorithm by query term discrimination 有权
    通过查询词辨别的有效检索算法

    公开(公告)号:US07925644B2

    公开(公告)日:2011-04-12

    申请号:US12038652

    申请日:2008-02-27

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30675 G06Q10/10

    摘要: A method and system for use in information retrieval includes, for each of a plurality of terms, selecting a predetermined number of top scoring documents for the term to form a corresponding document set for the term. When a plurality of terms are received, optionally as a query, the system ranks, using an inverse document frequency algorithm, the plurality of terms for importance based on the document sets for the plurality of terms. Then a number of ranked terms are selected based on importance and a union set is formed based on the document sets associated with the selected number of ranked terms.

    摘要翻译: 用于信息检索的方法和系统包括对于多个术语中的每一个,为术语选择预定数量的最高评分文档以形成用于该术语的相应文档集合。 当接收到多个术语时,可选地作为查询,系统使用逆文档频率算法基于多个术语的文档集来排列多个重要术语。 然后,基于重要性选择多个排名项,并且基于与所选择的排序项数相关联的文档集合形成联合集合。