Method and system for identifying an author of a paper
    61.
    发明申请
    Method and system for identifying an author of a paper 审中-公开
    识别论文作者的方法和系统

    公开(公告)号:US20060059121A1

    公开(公告)日:2006-03-16

    申请号:US10930617

    申请日:2004-08-31

    IPC分类号: G06F17/30 G06K9/62

    CPC分类号: G06F16/313 G06F16/38

    摘要: A system that identifies a person associated with a document is provided. The system retrieves a name associated with a document and reduces the name to a canonical form. The system then compares the canonical form of the name to the canonical form of the names of known persons. If a match is not found, then the system indicates that the person whose name is associated with the document is a previously unknown person. If a match is found, then the system compares attributes of the document with attributes of documents associated with the matching known person. If those attributes are similar, then the system indicates that the person whose name is associated with the document is the matching known person. Otherwise, the system indicates that the person whose name is associated with the document is a previously unknown person.

    摘要翻译: 提供了识别与文档相关联的人的系统。 系统检索与文档关联的名称,并将名称缩小为规范形式。 然后,系统将名称的规范形式与已知人员的名称的规范形式进行比较。 如果没有找到匹配项,则系统指示姓名与该文档相关联的人员是以前未知的人员。 如果找到匹配项,则系统将文档的属性与匹配的已知人员相关联的文档的属性进行比较。 如果这些属性相似,系统会指出姓名与文档相关联的人员是匹配的已知人员。 否则,系统表示姓名与文档相关联的人员是以前未知的人员。

    Method and system for summarizing a document
    62.
    发明申请
    Method and system for summarizing a document 有权
    汇总文件的方法和系统

    公开(公告)号:US20060036596A1

    公开(公告)日:2006-02-16

    申请号:US10918242

    申请日:2004-08-13

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30705 G06F17/30719

    摘要: A method and system for calculating the significance of a sentence within a document is provided. The summarization system calculates the significance of the sentences of a document and selects the most significant sentences as the summary of the document. The summarization system calculates the significance of a sentence based on the “important” words of the document that are contained within the sentence. The summarization system calculates the importance of words of the document using various scoring techniques and then combines the scores to classify a word as important or not important. The summarization system can then be used to identify significant sentences of the document based on the important words that a sentence contains and select significant sentences as a summary of the document.

    摘要翻译: 提供了一种用于计算文档中句子的重要性的方法和系统。 总结系统计算文档的句子的重要性,并选择最重要的句子作为文档的摘要。 总结系统根据文本中包含的“重要”字来计算句子的意义。 总结系统使用各种评分技术计算文档的单词的重要性,然后将分数组合成一个单词重要或不重要。 然后,总结系统可以用于基于句子包含的重要词语来识别文档的重要句子,并且将重要句子作为文档的摘要来选择。

    Method and system for classifying display pages using summaries
    64.
    发明申请
    Method and system for classifying display pages using summaries 有权
    使用汇总分类显示页面的方法和系统

    公开(公告)号:US20050246410A1

    公开(公告)日:2005-11-03

    申请号:US10836319

    申请日:2004-04-30

    IPC分类号: G06F17/30 G06F15/16

    CPC分类号: G06F17/30719 G06F17/30864

    摘要: A method and system for classifying display pages based on automatically generated summaries of display pages. A web page classification system uses a web page summarization system to generate summaries of web pages. The summary of a web page may include the sentences of the web page that are most closely related to the primary topic of the web page. The summarization system may combine the benefits of multiple summarization techniques to identify the sentences of a web page that represent the primary topic of the web page. Once the summary is generated, the classification system may apply conventional classification techniques to the summary to classify the web page. The classification system may use conventional classification techniques such as a Naïve Bayesian classifier or a support vector machine to identify the classifications of a web page based on the summary generated by the summarization system.

    摘要翻译: 一种基于自动生成的显示页面摘要来分类显示页面的方法和系统。 网页分类系统使用网页摘要系统来生成网页摘要。 网页的摘要可以包括与网页的主要主题最密切相关的网页的句子。 总结系统可以结合多个汇总技术的优点来识别代表网页的主要主题的网页的句子。 一旦生成摘要,分类系统可以将常规分类技术应用于摘要以对网页进行分类。 分类系统可以使用诸如朴素贝叶斯分类器或支持向量机的常规分类技术来基于由汇总系统生成的摘要来识别网页的分类。

    Mining service requests for product support
    65.
    发明申请
    Mining service requests for product support 审中-公开
    采矿服务请求产品支持

    公开(公告)号:US20050234973A1

    公开(公告)日:2005-10-20

    申请号:US10826160

    申请日:2004-04-15

    CPC分类号: G06N5/00 G06N5/02

    摘要: Systems and methods for mining service requests for product support are described. In one aspect, unstructured service requests are converted to one or more structured answer objects. Each structured answer object includes hierarchically structured historic problem diagnosis data. In view of a product problem description, a set of the one or more structured answer data objects is identified. Each structured solution data object in the set includes term(s) and/or phrase(s) related to the product problem description. Historic and hierarchically structured problem diagnosis data from the set is provided to an end-user for product problem diagnosis.

    摘要翻译: 描述了产品支持挖掘服务请求的系统和方法。 在一个方面,非结构化服务请求被转换成一个或多个结构化答案对象。 每个结构化答案对象包括分层结构的历史问题诊断数据。 鉴于产品问题描述,识别一组一个或多个结构化答案数据对象。 该集合中的每个结构化解决方案数据对象包括与产品问题描述相关的术语和/或短语。 将集合中的历史和分层结构的问题诊断数据提供给最终用户进行产品问题诊断。

    Method and system for determining similarity of objects based on heterogeneous relationships
    66.
    发明申请
    Method and system for determining similarity of objects based on heterogeneous relationships 失效
    基于异构关系确定对象相似度的方法和系统

    公开(公告)号:US20050256833A1

    公开(公告)日:2005-11-17

    申请号:US10846949

    申请日:2004-05-14

    IPC分类号: G06F17/30 G06F7/00

    摘要: A method and system for measuring the similarity of objects based on relationships with objects of the same type and different types and similarities of those objects to other objects is provided. In one embodiment, the similarity system defines intra-type and inter-type similarity functions for each type of object. The similarity system may combine the intra-type and inter-type similarity functions for a certain type into an overall similarity function for that type. After defining the similarity functions, the similarity system collects attribute values for the objects, which may include relationship data between objects of the same type, referred to as intra-type relationships, and relationships between objects of different types, referred to as inter-type relationships. After collecting the attribute values for the objects, the similarity system solves the intra-type and inter-type similarity functions by iteratively calculating the similarities for the objects until the similarities converge on a solution.

    摘要翻译: 提供了一种用于基于与这些对象的相同类型和不同类型的对象与其他对象的相似性的关系的对象来测量对象的相似性的方法和系统。 在一个实施例中,相似系统定义每种类型的对象的类型内和类型间相似度函数。 相似系统可以将某种类型的类型内和类型间相似度函数组合成该类型的整体相似度函数。 在定义相似度函数之后,相似度系统收集对象的属性值,其可以包括相同类型的对象之间的关系数据,称为内部类型关系,以及不同类型的对象之间的关系,被称为相互类型 关系。 在收集对象的属性值之后,相似系统通过迭代地计算对象的相似度来解决类型内和类型间相似度函数,直到相似性收敛于解。

    Search engine enhancement using mined implicit links
    67.
    发明授权
    Search engine enhancement using mined implicit links 有权
    使用挖掘隐式链接的搜索引擎增强

    公开(公告)号:US08312035B2

    公开(公告)日:2012-11-13

    申请号:US12505426

    申请日:2009-07-17

    IPC分类号: G06F7/00 G06F17/30

    摘要: An implicit links enhancement system and method for search engines that generates implicit links obtained from mining user access logs to facilitate enhanced local searching of web sites and intranets. Embodiments of the implicit links search enhancement system and method includes extracting implicit links by mining users' access patterns and then using a modified link analysis algorithm to re-rank search results obtained from traditional search engines. More specifically, embodiments of the method include extracting implicit links from a user access log, generating an implicit links graph from the extracted implicit links, and computing page rankings using the implicit links graph. The implicit links are extracted from the log using a two-item sequential pattern mining technique. Search results obtained from a search engine are re-ranked based on an implicit links analysis performed using an updated implicit links graph, a modified re-ranking formula, and at least one re-ranking technique.

    摘要翻译: 一种用于搜索引擎的隐式链接增强系统和方法,用于生成从挖掘用户访问日志中获取的隐含链接,以促进对网站和内部网的增强的本地搜索。 隐式链接搜索增强系统和方法的实施例包括通过挖掘用户的访问模式来提取隐含链接,然后使用经修改的链接分析算法重新排列从传统搜索引擎获得的搜索结果。 更具体地,该方法的实施例包括从用户访问日志提取隐含链接,从提取的隐式链接生成隐式链接图,以及使用隐式链接图计算页面排名。 使用两项顺序模式挖掘技术从日志中提取隐式链接。 基于使用更新的隐式链接图,修改的重新排列公式和至少一个重新排序技术执行的隐式链接分析,从搜索引擎获得的搜索结果被重新排序。

    SEARCH ENGINE ENHANCEMENT USING MINED IMPLICIT LINKS
    68.
    发明申请
    SEARCH ENGINE ENHANCEMENT USING MINED IMPLICIT LINKS 有权
    使用精简的隐含链接搜索引擎增强

    公开(公告)号:US20100023508A1

    公开(公告)日:2010-01-28

    申请号:US12505426

    申请日:2009-07-17

    IPC分类号: G06F17/30 G06F17/00

    摘要: An implicit links enhancement system and method for search engines that generates implicit links obtained from mining user access logs to facilitate enhanced local searching of web sites and intranets. Embodiments of the implicit links search enhancement system and method includes extracting implicit links by mining users' access patterns and then using a modified link analysis algorithm to re-rank search results obtained from traditional search engines. More specifically, embodiments of the method include extracting implicit links from a user access log, generating an implicit links graph from the extracted implicit links, and computing page rankings using the implicit links graph. The implicit links are extracted from the log using a two-item sequential pattern mining technique. Search results obtained from a search engine are re-ranked based on an implicit links analysis performed using an updated implicit links graph, a modified re-ranking formula, and at least one re-ranking technique.

    摘要翻译: 一种用于搜索引擎的隐式链接增强系统和方法,用于生成从挖掘用户访问日志中获取的隐含链接,以促进对网站和内部网的增强的本地搜索。 隐式链接搜索增强系统和方法的实施例包括通过挖掘用户的访问模式来提取隐含链接,然后使用经修改的链接分析算法重新排列从传统搜索引擎获得的搜索结果。 更具体地,该方法的实施例包括从用户访问日志提取隐含链接,从提取的隐式链接生成隐式链接图,以及使用隐式链接图计算页面排名。 使用两项顺序模式挖掘技术从日志中提取隐式链接。 基于使用更新的隐式链接图,修改的重新排列公式和至少一个重新排序技术执行的隐式链接分析,从搜索引擎获得的搜索结果被重新排序。

    Implicit links search enhancement system and method for search engines using implicit links generated by mining user access patterns
    69.
    发明授权
    Implicit links search enhancement system and method for search engines using implicit links generated by mining user access patterns 有权
    使用由采矿用户访问模式生成的隐式链接的搜索引擎的隐式链接搜索增强系统和方法

    公开(公告)号:US07584181B2

    公开(公告)日:2009-09-01

    申请号:US10676794

    申请日:2003-09-30

    IPC分类号: G06F7/00 G06F17/30

    摘要: An implicit links enhancement system and method for search engines that generates implicit links obtained from mining user access logs to facilitate enhanced local searching of web sites and intranets. The implicit links search enhancement system and method includes extracting implicit links by mining users' access patterns and then using a modified link analysis algorithm to re-rank search results obtained from traditional search engines. More specifically, the implicit links search enhancement method includes extracting implicit links from a user access log, generating an implicit links graph from the extracted implicit links, and computing page rankings using the implicit links graph. The implicit links are extracted from the log using a two-item sequential pattern mining technique. Search results obtained from a search engine are re-ranked based on an implicit links analysis performed using an updated implicit links graph, a modified re-ranking formula, and at least one re-ranking technique.

    摘要翻译: 一种用于搜索引擎的隐式链接增强系统和方法,用于生成从挖掘用户访问日志中获取的隐含链接,以促进对网站和内部网的增强的本地搜索。 隐式链接搜索增强系统和方法包括通过挖掘用户访问模式提取隐含链接,然后使用修改的链接分析算法对从传统搜索引擎获取的搜索结果进行重新排序。 更具体地,隐式链接搜索增强方法包括从用户访问日志提取隐含链接,从提取的隐式链接生成隐式链接图,以及使用隐式链接图计算页面排名。 使用两项顺序模式挖掘技术从日志中提取隐式链接。 基于使用更新的隐式链接图,修改的重新排列公式和至少一个重新排序技术执行的隐式链接分析,从搜索引擎获得的搜索结果被重新排序。

    Implicit links search enhancement system and method for search engines using implicit links generated by mining user access patterns
    70.
    发明申请
    Implicit links search enhancement system and method for search engines using implicit links generated by mining user access patterns 有权
    使用由采矿用户访问模式生成的隐式链接的搜索引擎的隐式链接搜索增强系统和方法

    公开(公告)号:US20050071465A1

    公开(公告)日:2005-03-31

    申请号:US10676794

    申请日:2003-09-30

    IPC分类号: G06F15/173 G06F17/30

    摘要: An implicit links enhancement system and method for search engines that generates implicit links obtained from mining user access logs to facilitate enhanced local searching of web sites and intranets. The implicit links search enhancement system and method includes extracting implicit links by mining users' access patterns and then using a modified link analysis algorithm to re-rank search results obtained from traditional search engines. More specifically, the implicit links search enhancement method includes extracting implicit links from a user access log, generating an implicit links graph from the extracted implicit links, and computing page rankings using the implicit links graph. The implicit links are extracted from the log using a two-item sequential pattern mining technique. Search results obtained from a search engine are re-ranked based on an implicit links analysis performed using an updated implicit links graph, a modified re-ranking formula, and at least one re-ranking technique.

    摘要翻译: 一种用于搜索引擎的隐式链接增强系统和方法,用于生成从挖掘用户访问日志中获取的隐含链接,以促进对网站和内部网的增强的本地搜索。 隐式链接搜索增强系统和方法包括通过挖掘用户访问模式提取隐含链接,然后使用修改的链接分析算法对从传统搜索引擎获取的搜索结果进行重新排序。 更具体地,隐式链接搜索增强方法包括从用户访问日志提取隐含链接,从提取的隐式链接生成隐式链接图,以及使用隐式链接图计算页面排名。 使用两项顺序模式挖掘技术从日志中提取隐式链接。 基于使用更新的隐式链接图,修改的重新排列公式和至少一个重新排序技术执行的隐式链接分析,从搜索引擎获得的搜索结果被重新排序。