RETRIEVAL OF STRUCTURED DOCUMENTS
    72.
    发明申请
    RETRIEVAL OF STRUCTURED DOCUMENTS 有权
    检索结构化文件

    公开(公告)号:US20060155690A1

    公开(公告)日:2006-07-13

    申请号:US11277344

    申请日:2006-03-23

    申请人: Ji-Rong Wen Hang Cui

    发明人: Ji-Rong Wen Hang Cui

    IPC分类号: G06F17/30

    摘要: This disclosure relates to performing a query for a search term of a database containing a plurality of structured documents. Those structured documents that do not include the search term are ferreted or filtered out during an initial search. Matched structured documents which are those structured documents that do contain the search term are evaluated by ranking the individual elements based on how well each individual element matches the search term, and indicating to the user the ranking of the individual elements wherein the individual elements can be accessed by the user.

    摘要翻译: 本公开涉及对包含多个结构化文档的数据库执行关于搜索项的查询。 在初始搜索期间,不包括搜索条件的结构化文档被转移或过滤掉。 通过基于每个单独元素与搜索项匹配的程度对各个元素进行排名来评估包含搜索词的那些结构化文档的匹配结构化文档,并向用户指示各个元素的排名,其中各个元素可以是 由用户访问

    Method and system for troubleshooting a misconfiguration of a computer system based on product support services information
    74.
    发明申请
    Method and system for troubleshooting a misconfiguration of a computer system based on product support services information 有权
    基于产品支持服务信息对计算机系统配置错误进行故障排除的方法和系统

    公开(公告)号:US20060025962A1

    公开(公告)日:2006-02-02

    申请号:US10899939

    申请日:2004-07-27

    IPC分类号: G06F15/00

    CPC分类号: G06Q10/10

    摘要: A method and system for ranking possible causes of a component exhibiting a certain behavior is provided. In one embodiment, a troubleshooting system ranks candidate configuration parameters that may be causing a software application to exhibit an undesired behavior using support information relating to problems resulting from the settings of configuration parameters. The support information may be collected from problem reports generated by product support services personnel when troubleshooting problems that users encounter with the application. The troubleshooting system ranks the candidate configuration parameters as likely causing the application to exhibit the undesired behavior based on analysis of the support information.

    摘要翻译: 提供了一种用于对表现出某种行为的部件的可能原因进行排序的方法和系统。 在一个实施例中,故障排除系统对可能导致软件应用程序使用与由配置参数的设置产生的问题有关的支持信息来展示不期望行为的候选配置参数进行排序。 当对用户遇到的应用程序遇到的问题进行故障排除时,支持信息可以从产品支持服务人员生成的问题报告中收集。 故障排除系统将候选配置参数排列在可能的基础上,导致应用程序基于对支持信息的分析而展示不期望的行为。

    Method and system for indexing and searching databases
    75.
    发明申请
    Method and system for indexing and searching databases 有权
    索引和搜索数据库的方法和系统

    公开(公告)号:US20050256865A1

    公开(公告)日:2005-11-17

    申请号:US10846776

    申请日:2004-05-14

    IPC分类号: G06F17/30 G06F7/00

    摘要: A search system generates an index for databases by generatively sampling the databases and uses that index to identify and formulate queries for searching the databases. The generated index is referred to as a domain-attribute index and contains a domain-level index and site-level indexes. A site-level index for a database maps site attributes to distinct attribute values within the database. The domain-level index for a domain maps attribute values to database and site attribute pairs that contain those attribute values. To generate a site-level index for a database within a certain domain, the search system starts out with an initial set of the sample data for that domain. The search system generates sampling queries based on the sample data and submits the sampling queries to a database. The search system updates the site-level index based on the sampling results and uses the results to generate more sampling queries.

    摘要翻译: 搜索系统通过生成数据库的索引来生成数据库的索引,并使用该索引来识别和制定用于搜索数据库的查询。 生成的索引称为域属性索引,并包含域级索引和站点级索引。 数据库的站点级索引将站点属性映射到数据库中的不同属性值。 域的域级索引将属性值映射到包含这些属性值的数据库和站点属性对。 要为特定域中的数据库生成站点级索引,搜索系统将以该域的样本数据的初始集合开始。 搜索系统根据样本数据生成抽样查询,并将抽样查询提交到数据库。 搜索系统根据抽样结果更新站点级索引,并使用结果生成更多的抽样查询。

    Query Reformulation Using Post-Execution Results Analysis
    76.
    发明申请
    Query Reformulation Using Post-Execution Results Analysis 审中-公开
    使用执行后结果分析查询重组

    公开(公告)号:US20130086024A1

    公开(公告)日:2013-04-04

    申请号:US13248894

    申请日:2011-09-29

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951 G06F16/3338

    摘要: Systems, methods, devices, and media are described to facilitate the training and employing of a three-class classifier for post-execution search query reformulation. In some embodiments, the classification is trained through a supervised learning process, based on a training set of queries mined from a query log. Query reformulation candidates are determined for each query in the training set, and searches are performed using each reformulation candidate and the un-reformulated training query. The resulting documents lists are analyzed to determine ranking and topic drift features, and to calculate a quality classification. The features and classification for each reformulation candidate are used to train the classifier in an offline mode. In some embodiments, the classifier is employed in an online mode to dynamically perform query reformulation on user-submitted queries.

    摘要翻译: 描述了系统,方法,设备和媒体,以便于训练和采用用于执行后搜索查询重新设计的三类分类器。 在一些实施例中,基于从查询日志挖掘的查询的训练集,通过监督学习过程训练分类。 针对训练集中的每个查询确定查询重写候选,并且使用每个重新配置候选和未重新编排的训练查询执行搜索。 分析结果文件列表以确定排名和主题漂移特征,并计算质量分类。 每个重组候选人的特征和分类用于在离线模式下训练分类器。 在一些实施例中,分类器以在线模式使用以动态地对用户提交的查询进行查询重新配置。

    Retrieval of structured documents
    77.
    发明授权
    Retrieval of structured documents 有权
    检索结构化文件

    公开(公告)号:US08046370B2

    公开(公告)日:2011-10-25

    申请号:US12211793

    申请日:2008-09-16

    申请人: Ji-Rong Wen Hang Cui

    发明人: Ji-Rong Wen Hang Cui

    IPC分类号: G06F7/00 G06F17/30

    摘要: This disclosure relates to performing a query for a search term of a database containing a plurality of structured documents. Those structured documents that do not include the search term are ferreted or filtered out during an initial search. Matched structured documents which are those structured documents that do contain the search term are evaluated by ranking the individual elements based on how well each individual element matches the search term, and indicating to the user the ranking of the individual elements wherein the individual elements can be accessed by the user.

    摘要翻译: 本公开涉及对包含多个结构化文档的数据库执行关于搜索项的查询。 在初始搜索期间,不包括搜索条件的结构化文档被转移或过滤掉。 通过基于每个单独元素与搜索项匹配的程度对各个元素进行排名来评估包含搜索词的那些结构化文档的匹配结构化文档,并向用户指示各个元素的排名,其中各个元素可以是 由用户访问

    Using Anchor Text With Hyperlink Structures for Web Searches
    78.
    发明申请
    Using Anchor Text With Hyperlink Structures for Web Searches 有权
    使用超链接结构使用锚文本进行网页搜索

    公开(公告)号:US20110238644A1

    公开(公告)日:2011-09-29

    申请号:US12748903

    申请日:2010-03-29

    IPC分类号: G06F3/14 G06F17/30

    CPC分类号: G06F17/30887

    摘要: This document describes tools for adjusting anchor text weight to provide more relevant search engine results. Specifically, these tools take advantage of a site-relationship model to consider relationships not only between an anchor text source site and a destination page but also relationships between multiple anchor text source sites to improve web searches. Consideration of these relationships aids in determining a new an anchor text weight, which in turn results in more relevant search results.

    摘要翻译: 本文档描述了调整锚文本权重以提供更相关的搜索引擎结果的工具。 具体来说,这些工具利用站点关系模型来考虑不仅锚文本源站点和目标页面之间的关系,还考虑多个锚文本源站点之间的关系,以改进Web搜索。 考虑这些关系有助于确定新的锚文本权重,这又导致更相关的搜索结果。

    Interactive System for Extracting Data from a Website
    79.
    发明申请
    Interactive System for Extracting Data from a Website 审中-公开
    从网站提取数据的互动系统

    公开(公告)号:US20110191381A1

    公开(公告)日:2011-08-04

    申请号:US12696061

    申请日:2010-01-29

    IPC分类号: G06F17/30

    CPC分类号: G06F16/00

    摘要: Described is a technology for efficiently labeling a webpage. A wrapper tool labels records of a webpage at the record level. If an existing wrapper exists that is appropriate for labeling a record, the wrapper tool automatically labels that record. For unlabeled records, the tool provides a user interface to label those records, and updates the set of existing wrappers with a new wrapper that is generated based upon the labeling operation; the new wrapper is then applied to any unlabeled records if appropriate for those records. As a result, a user typically needs only to label a relatively few records, with the wrappers generated for those records automatically used to label the other unlabeled records of the webpage.

    摘要翻译: 描述了一种有效地标记网页的技术。 包装工具在记录级别上标记网页的记录。 如果存在适用于标记记录的现有包装器,则包装工具会自动标记该记录。 对于未标记的记录,该工具提供用户界面来标记这些记录,并使用基于标签操作生成的新包装器来更新现有包装器集合; 如果适用于这些记录,则将新的包装器应用于任何未标记的记录。 因此,用户通常仅需要标记相对较少的记录,为这些记录生成的包装器自动用于标记网页的其他未标记的记录。

    Community mining based on core objects and affiliated objects
    80.
    发明授权
    Community mining based on core objects and affiliated objects 失效
    基于核心对象和附属对象的社区挖掘

    公开(公告)号:US07885960B2

    公开(公告)日:2011-02-08

    申请号:US10624759

    申请日:2003-07-22

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30873 G06F17/30864

    摘要: In community mining based on core objects and affiliated objects, a set of core objects for a community of objects are identified from a plurality of objects. The community is expanded, based on the set of core objects, to include a set of affiliated objects. According to one aspect, a model of a community of objects is obtained by grouping a first collection of a plurality of objects into a center portion, and grouping a second collection of the plurality of objects into one or more concentric portions around the center portion. The groupings of the first and second collections of the objects are identified as the community of objects.

    摘要翻译: 在基于核心对象和附属对象的社区挖掘中,从多个对象中识别出用于对象社区的一组核心对象。 基于一组核心对象扩展社区,包括一组附属对象。 根据一个方面,通过将多个对象的第一集合分组成中心部分并将多个对象的第二集合分组成围绕中心部分的一个或多个同心部分来获得对象社区的模型。 对象的第一和第二集合的分组被标识为对象的社区。