Block tracking mechanism for web personalization
    1.
    发明授权
    Block tracking mechanism for web personalization 有权
    网站个性化的块跟踪机制

    公开(公告)号:US07818330B2

    公开(公告)日:2010-10-19

    申请号:US11801404

    申请日:2007-05-09

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30861

    摘要: Described is a technology by which blocks of web pages may be selected, such as for building a user-personalized web page containing selected blocks. A selection mechanism, such as a browser toolbar add-on, provides a user interface for selecting blocks, and records information about selected blocks. A block tracking mechanism (e.g., a daemon program) uses the information to locate selected blocks of the web pages, including when the web page containing the block is updated with respect to content and/or layout. The block tracking mechanism may update a local gadget that when invoked, such as by browsing to a particular web page, which shows updated versions of the block on a personalized web page. Blocks may be efficiently located by processing trees representing web pages into reduced trees, and then by performing a minimum distance mapping algorithm on the reduced trees.

    摘要翻译: 描述了可以选择网页块的技术,诸如用于构建包含所选块的用户个性化网页。 诸如浏览器工具栏附件的选择机制提供用于选择块的用户界面,并且记录关于所选块的信息。 块跟踪机制(例如,守护程序)使用该信息来定位网页的所选块,包括当包含块的网页相对于内容和/或布局被更新时。 块跟踪机制可以更新当调用时​​的本地小工具,诸如通过浏览到特定网页,其显示个性化网页上块的更新版本。 可以通过将表示网页的树处理成缩小的树,然后通过在缩小的树上执行最小距离映射算法来有效地定位块。

    Creating home pages based on user-selected information of web pages
    2.
    发明授权
    Creating home pages based on user-selected information of web pages 有权
    根据用户选择的网页信息创建主页

    公开(公告)号:US07594013B2

    公开(公告)日:2009-09-22

    申请号:US11136029

    申请日:2005-05-24

    IPC分类号: G06F15/173

    CPC分类号: G06F17/3089

    摘要: A method of creating a personal home page containing information of interest assembled from various web sites. The method includes the partitioning of web pages into web blocks. Users may collect various web blocks from different web pages and utilize those web blocks to define the dynamic personal homepage. In addition, the web blocks may be tracked to update content in the personal home page based on corresponding changes in the original web page.

    摘要翻译: 一种创建个人主页的方法,该个人主页包含从各种网站组装的感兴趣的信息。 该方法包括将网页划分成网页块。 用户可以从不同的网页收集各种网页块,并利用这些网页块定义动态个人主页。 此外,可以基于原始网页中的相应变化来跟踪网页块以更新个人主页中的内容。

    Ranking online advertisements using retailer and product reputations
    3.
    发明申请
    Ranking online advertisements using retailer and product reputations 审中-公开
    使用零售商和产品信誉排名在线广告

    公开(公告)号:US20080288348A1

    公开(公告)日:2008-11-20

    申请号:US11803461

    申请日:2007-05-15

    IPC分类号: G06Q30/00

    摘要: A method for ranking online advertisements using retailer reputation and product reputation. In one implementation, a query may be received. Advertisements may be selected by determining a level of relevance between the query and each advertisement and selecting the advertisements with a level of relevance above a pre-determined level of relevance. A predicted reputation for a retailer and a predicted reputation for a product may be retrieved for each of the selected advertisements. The selected advertisements may then be ranked based on the predicted reputation for the retailer and the predicted reputation of the product. The ranking of the selected advertisements may be accomplished by calculating a ranking score for each selected advertisement based on the retailer predicted reputation and the product predicted reputation. The selected advertisements may then be displayed according to the ranking.

    摘要翻译: 使用零售商信誉和产品信誉对在线广告进行排名的方法。 在一个实现中,可以接收查询。 可以通过确定查询和每个广告之间的相关性级别并且选择具有相关性水平高于预定相关性水平的广告来选择广告。 可以为每个选定的广告检索零售商的预测声誉和产品的预测声誉。 所选择的广告然后可以基于零售商的预测信誉和产品的预测声誉进行排名。 所选择的广告的排名可以通过基于零售商预测的声誉和产品预测的声誉来计算每个所选广告的排名得分来实现。 然后可以根据排名显示所选择的广告。

    Scalable probabilistic latent semantic analysis
    5.
    发明申请
    Scalable probabilistic latent semantic analysis 有权
    可扩展概率潜在语义分析

    公开(公告)号:US20070239431A1

    公开(公告)日:2007-10-11

    申请号:US11392763

    申请日:2006-03-30

    IPC分类号: G06F17/27

    CPC分类号: G06F17/2785

    摘要: A scalable two-pass scalable probabilistic latent semantic analysis (PLSA) methodology is disclosed that may perform more efficiently, and in some cases more accurately, than traditional PLSA, especially where large and/or sparse data sets are provided for analysis. The improved methodology can greatly reduce the storage and/or computational costs of training a PLSA model. In the first pass of the two-pass methodology, objects are clustered into groups, and PLSA is performed on the groups instead of the original individual objects. In the second pass, the conditional probability of a latent class, given an object, is obtained. This may be done by extending the training results of the first pass. During the second pass, the most likely latent classes for each object are identified.

    摘要翻译: 公开了一种可扩展的双向可伸缩概率潜在语义分析(PLSA)方法,其可以比传统的PLSA更有效地执行,在某些情况下可以更准确地执行,特别是在提供大数据集和/或稀疏数据集用于分析的情况下。 改进的方法可以大大降低培训PLSA模型的存储和/或计算成本。 在双路方法的第一遍中,对象被聚集成组,并且PLSA在组而不是原始的单个对象上执行。 在第二遍中,获得给定对象的潜在类的条件概率。 这可以通过扩展第一遍的训练结果来完成。 在第二遍期间,识别每个对象最可能的潜在类。

    Efficient retrieval algorithm by query term discrimination
    6.
    发明授权
    Efficient retrieval algorithm by query term discrimination 有权
    通过查询词辨别的有效检索算法

    公开(公告)号:US07925644B2

    公开(公告)日:2011-04-12

    申请号:US12038652

    申请日:2008-02-27

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30675 G06Q10/10

    摘要: A method and system for use in information retrieval includes, for each of a plurality of terms, selecting a predetermined number of top scoring documents for the term to form a corresponding document set for the term. When a plurality of terms are received, optionally as a query, the system ranks, using an inverse document frequency algorithm, the plurality of terms for importance based on the document sets for the plurality of terms. Then a number of ranked terms are selected based on importance and a union set is formed based on the document sets associated with the selected number of ranked terms.

    摘要翻译: 用于信息检索的方法和系统包括对于多个术语中的每一个,为术语选择预定数量的最高评分文档以形成用于该术语的相应文档集合。 当接收到多个术语时,可选地作为查询,系统使用逆文档频率算法基于多个术语的文档集来排列多个重要术语。 然后,基于重要性选择多个排名项,并且基于与所选择的排序项数相关联的文档集合形成联合集合。

    Scalable probabilistic latent semantic analysis
    7.
    发明授权
    Scalable probabilistic latent semantic analysis 有权
    可扩展概率潜在语义分析

    公开(公告)号:US07844449B2

    公开(公告)日:2010-11-30

    申请号:US11392763

    申请日:2006-03-30

    IPC分类号: G06F17/27

    CPC分类号: G06F17/2785

    摘要: A scalable two-pass scalable probabilistic latent semantic analysis (PLSA) methodology is disclosed that may perform more efficiently, and in some cases more accurately, than traditional PLSA, especially where large and/or sparse data sets are provided for analysis. The improved methodology can greatly reduce the storage and/or computational costs of training a PLSA model. In the first pass of the two-pass methodology, objects are clustered into groups, and PLSA is performed on the groups instead of the original individual objects. In the second pass, the conditional probability of a latent class, given an object, is obtained. This may be done by extending the training results of the first pass. During the second pass, the most likely latent classes for each object are identified.

    摘要翻译: 公开了一种可扩展的双向可伸缩概率潜在语义分析(PLSA)方法,其可以比传统的PLSA更有效地执行,在某些情况下可以更准确地执行,特别是在提供大型和/或稀疏数据集用于分析的情况下。 改进的方法可以大大降低培训PLSA模型的存储和/或计算成本。 在双路方法的第一遍中,对象被聚集成组,并且PLSA在组而不是原始的单个对象上执行。 在第二遍中,获得给定对象的潜在类的条件概率。 这可以通过扩展第一遍的训练结果来完成。 在第二遍期间,识别每个对象最可能的潜在类。

    Efficient retrieval algorithm by query term discrimination
    8.
    发明授权
    Efficient retrieval algorithm by query term discrimination 有权
    通过查询词辨别的有效检索算法

    公开(公告)号:US07822752B2

    公开(公告)日:2010-10-26

    申请号:US11804627

    申请日:2007-05-18

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30675

    摘要: Described is an efficient retrieval mechanism that quickly locates documents (e.g., corresponding to online advertisements) based on query term discrimination. A topmost subset (e.g., two) of search terms is selected according to their ranked importance, e.g., as ranked by inverted document frequency. The topmost terms are then used to narrow the number of rows of an inverted query index that are searched to find document identifiers and associated scores, such as computed offline by a BM25 algorithm. For example, for each document identifier of each important term, a fast search within each of the narrowed subset of rows (that also contain that document identifier) may be performed by comparing document identifiers to jump a pointer within each other row, followed by a binary search to locate a particular document. The scores of the set of particular documents may then be used to rank their relative importance for returning as results.

    摘要翻译: 描述了一种有效的检索机制,其基于查询词辨别快速定位文档(例如,对应于在线广告)。 根据其排序的重要性来选择搜索项的最顶层子集(例如,两个),例如按照倒排的文档频率排序。 然后使用最上面的术语来缩小被搜索以查找文档标识符和相关分数的反向查询索引的行数,例如通过BM25算法离线计算。 例如,对于每个重要术语的每个文档标识符,可以通过比较文档标识符来跳过每个其他行中的指针,然后是一个指针,来执行每个狭窄的行子集(也包含该文档标识符)的快速搜索 二进制搜索查找特定文档。 然后可以使用该组特定文件的分数来排列其作为结果返回的相对重要性。

    Adaptive grouping in a file network
    9.
    发明授权
    Adaptive grouping in a file network 有权
    文件网络中的自适应分组

    公开(公告)号:US07634471B2

    公开(公告)日:2009-12-15

    申请号:US11392760

    申请日:2006-03-30

    IPC分类号: G06F17/30

    摘要: Extraction of semantic information and the generation of semantic attributes allows for improved organization and management of data. Semantic attributes are automatically generated and eliminate the need for manual entry of attribute information. A semantic file network may further be constructed based on similarities between files that are based on the semantic attribute information. Semantic links representing a semantic relationship may be built between similar or relevant files. In addition, user operations and user operation patterns may also be considered in building the file network. Semantic attributes and information may further facilitate browsing the file systems as well as improve the accuracy and speed of queries.

    摘要翻译: 语义信息的提取和语义属性的产生可以改善数据的组织和管理。 自动生成语义属性,无需手动输入属性信息。 还可以基于基于语义属性信息的文件之间的相似性来构建语义文件网络。 表示语义关系的语义链接可以在相似或相关文件之间建立。 此外,在构建文件网络时也可以考虑用户操作和用户操作模式。 语义属性和信息可以进一步促进文件系统的浏览以及提高查询的准确性和速度。

    WEBPAGE BLOCK TRACKING GADGET
    10.
    发明申请
    WEBPAGE BLOCK TRACKING GADGET 审中-公开
    WEBPAGE块跟踪GADGET

    公开(公告)号:US20080215997A1

    公开(公告)日:2008-09-04

    申请号:US12038687

    申请日:2008-02-27

    IPC分类号: G06F3/048

    CPC分类号: G06F3/0481

    摘要: An exemplary web browser system includes a selection module for selecting a webpage block and recording information about a selected webpage block; a tracking module for tracking changes to a selected webpage block based at least in part on the recorded information for that webpage block; and a display module for displaying a selected webpage block wherein the tracking module updates the display module as to changes to the selected webpage block. Various other exemplary systems, methods, devices are also disclosed.

    摘要翻译: 示例性网络浏览器系统包括用于选择网页块并记录关于所选网页块的信息的选择模块; 跟踪模块,用于至少部分地基于所述网页块的记录信息跟踪对所选网页块的改变; 以及用于显示所选网页块的显示模块,其中所述跟踪模块更新所述显示模块以改变所选择的网页块。 还公开了各种其它示例性系统,方法,装置。