Automatic word-cloud generation
    1.
    发明授权
    Automatic word-cloud generation 有权
    自动字云生成

    公开(公告)号:US08892554B2

    公开(公告)日:2014-11-18

    申请号:US13113110

    申请日:2011-05-23

    IPC分类号: G06F17/30 G06F7/00 G06F17/24

    摘要: Method, system, and computer program product for automatic generation of a word-cloud for a content item are provided. The method includes: extracting terms from a content item using statistical selection criteria; weighting a term by a probability that the term is used as a tag; and generating a visual representation of terms with enhanced representation of terms according to the weighting. Weighting a term by a probability that the term is used as a tag may include determining the relative frequency of the term in a folksonomy of tag terms for a domain.

    摘要翻译: 提供了用于自动生成内容项的单词云的方法,系统和计算机程序产品。 该方法包括:使用统计选择标准从内容项提取术语; 以术语用作标签的概率加权项; 以及根据加权产生具有增强的术语表示的术语的视觉表示。 以术语用作标签的概率来加权术语可以包括确定域的标签术语的民间学习中术语的相对频率。

    AUTOMATIC WOD-CLOUD GENERATION
    2.
    发明申请
    AUTOMATIC WOD-CLOUD GENERATION 有权
    自动生成云生成

    公开(公告)号:US20120303637A1

    公开(公告)日:2012-11-29

    申请号:US13113110

    申请日:2011-05-23

    IPC分类号: G06F17/30 G06F7/00

    摘要: Method, system, and computer program product for automatic generation of a word-cloud for a content item are provided. The method includes: extracting terms from a content item using statistical selection criteria; weighting a term by a probability that the term is used as a tag; and generating a visual representation of terms with enhanced representation of terms according to the weighting. Weighting a term by a probability that the term is used as a tag may include determining the relative frequency of the term in a folksonomy of tag terms for a domain.

    摘要翻译: 提供了用于自动生成内容项的单词云的方法,系统和计算机程序产品。 该方法包括:使用统计选择标准从内容项提取术语; 以术语用作标签的概率加权项; 以及根据加权产生具有增强的术语表示的术语的视觉表示。 以术语用作标签的概率来加权术语可以包括确定域的标签术语的民间学习中术语的相对频率。

    POPULARITY PREDICTION OF USER-GENERATED CONTENT
    3.
    发明申请
    POPULARITY PREDICTION OF USER-GENERATED CONTENT 审中-公开
    用户生成内容的人气预测

    公开(公告)号:US20110302103A1

    公开(公告)日:2011-12-08

    申请号:US12795684

    申请日:2010-06-08

    IPC分类号: G06Q99/00 G06Q50/00

    CPC分类号: G06Q10/10 G06Q30/0282

    摘要: A method, system, and computer program product for popularity prediction of user-generated content are provided. The method includes measuring the novelty of a user-generated content and predicting the popularity of the user-generated content based on the measured novelty. Predicting the popularity of the user-generated content includes: extracting basic features of the user-generated content; measuring novelty features of the user-generated content; and predicting the popularity based on the basic features and novelty features. Measuring the novelty of a user-generated content includes one or more of: measuring a relative novelty of the user-generated content with respect to the contribution history of the same user in a given time period; measuring a relative novelty of the user-generated content with respect to user-generated content of other users in a given time period; and measuring a relative novelty of the user-generated content with respect to the references by other users to the user-generated content.

    摘要翻译: 提供了一种用于用户生成内容的普及预测的方法,系统和计算机程序产品。 该方法包括测量用户生成的内容的新颖性,并且基于所测量的新颖性来预测用户生成的内容的普及度。 预测用户生成内容的普及性包括:提取用户生成的内容的基本特征; 测量用户生成内容的新奇特征; 并根据基本特征和新颖特征预测人气。 测量用户产生的内容的新颖性包括以下一个或多个:在给定时间段内相对于同一用户的贡献历史测量用户生成的内容的相对新颖性; 测量在给定时间段内用户生成的内容相对于其他用户的用户生成内容的相对新颖性; 并且测量用户生成的内容相对于其他用户对用户生成的内容的引用的相对新颖性。

    MEASURING WEB SITE SATISFACTION OF INFORMATION NEEDS
    4.
    发明申请
    MEASURING WEB SITE SATISFACTION OF INFORMATION NEEDS 失效
    衡量网站对信息需求的满意度

    公开(公告)号:US20110106799A1

    公开(公告)日:2011-05-05

    申请号:US12987181

    申请日:2011-01-10

    IPC分类号: G06F17/30

    摘要: A method, system, and computer program product for measuring web site satisfaction of information needs are provided. The method includes: selecting a page for analysis; generating a page profile in the form of a list of keywords representing the page; generating a page traffic profile in the form of lists of keywords representing information needs of users, wherein the page traffic profile is generated from keywords used by users to visit the page; determining the success of users' visits to the page; and analyzing whether a page satisfies users' information needs by applying a distance measure between the keywords of the page profile and the keywords of the page traffic profile and combining the distance measure result with a success rate of the keywords.

    摘要翻译: 提供了一种用于衡量信息需求的网站满意度的方法,系统和计算机程序产品。 该方法包括:选择页面进行分析; 以表示页面的关键字的列表的形式生成页面简档; 以表示用户的信息需求的关键词列表的形式生成页面流量简档,其中页面流量简档是由用户访问页面所使用的关键字生成的; 确定用户访问页面的成功; 以及通过在页面简档的关键字和页面流量简档的关键字之间应用距离度量来分析页面是否满足用户信息需求,并且将距离测量结果与关键字的成功率相结合。

    Indexing and searching entity-relationship data
    5.
    发明授权
    Indexing and searching entity-relationship data 有权
    索引和搜索实体关系数据

    公开(公告)号:US08751505B2

    公开(公告)日:2014-06-10

    申请号:US13417248

    申请日:2012-03-11

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30604

    摘要: Method, system, and computer program product for indexing and searching entity-relationship data are provided. The method includes: defining a logical document model for entity-relationship data including: representing an entity as a document containing the entity's searchable content and metadata; dually representing the entity as a document and as a category; and representing each relationship instance for the entity as a category set that contains categories of all participating entities in the relationship. The method also includes: translating entity-relationship data into the logical document model; and indexing the entity-relationship data of the populated logical document model as an inverted index. The method may include searching indexed entity-relationship data using a faceted search, wherein the categories are all categories required for supporting faceted navigation.

    摘要翻译: 提供了索引和搜索实体关系数据的方法,系统和计算机程序产品。 该方法包括:定义用于实体关系数据的逻辑文档模型,包括:将实体表示为包含该实体的可搜索内容和元数据的文档; 将实体双重表示为文件和类别; 并将实体的每个关系实例表示为包含关系中所有参与实体的类别的类别集合。 该方法还包括:将实体关系数据转换为逻辑文档模型; 并将填充的逻辑文档模型的实体关系数据索引为反向索引。 该方法可以包括使用分面搜索搜索索引的实体关系数据,其中类别是支持分面导航所需的所有类别。

    Method and system for using social bookmarks
    6.
    发明授权
    Method and system for using social bookmarks 有权
    使用社交书签的方法和系统

    公开(公告)号:US08266157B2

    公开(公告)日:2012-09-11

    申请号:US12550376

    申请日:2009-08-30

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30

    摘要: A method and system for using social bookmarks wherein a social bookmark is a triplet of the entities of user, document, and tag. The method including: collecting multiple bookmarks; representing the bookmarks as a three-dimensional space or matrix of the number of times a user u, used tag t to bookmark document d; measuring the similarity of two entities of the same type; and using the similarity to weight bookmarks or entities. The weightings may be used to provide a measure of a usefulness of a bookmark for describing a document for retrieval purposes. Two-dimensions of the bookmark space may also be used to predict the third-dimension.

    摘要翻译: 一种用于使用社交书签的方法和系统,其中社交书签是用户,文档和标签的实体的三元组。 该方法包括:收集多个书签; 将书签代表用户u的三维空间或矩阵,使用标签t来书签文件d; 测量相同类型的两个实体的相似度; 并使用相似度加权书签或实体。 权重可用于提供用于描述用于检索目的的文档的书签的有用性的量度。 书签空间的二维还可以用于预测第三维。

    Method and system for maintaining profiles of information channels
    7.
    发明授权
    Method and system for maintaining profiles of information channels 有权
    维护信息通道的方法和系统

    公开(公告)号:US07970739B2

    公开(公告)日:2011-06-28

    申请号:US12111972

    申请日:2008-04-30

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30867 H04L69/14

    摘要: A method and system are provided for maintaining profiles of information channels available on the Web, wherein the information channels are accessed via pull-only protocols. The method includes monitoring one or more channels by a channel pull action at a monitoring rate, wherein the monitoring rate is determined for the one or more channels based on the number of update events in a previous time period. The method may optimally include filtering the update events in the time period by a novelty measure, wherein the filtering disregards events that do not include significant novel information. The monitoring rate is adapted based on reinforcement learning applying iterative learning rules over time.

    摘要翻译: 提供了一种用于维护在Web上可用的信息信道的简档的方法和系统,其中通过仅拉协议访问信息信道。 该方法包括以监视速率通过信道拉动操作监视一个或多个信道,其中基于前一时间段内的更新事件的数量来确定针对一个或多个信道的监视速率。 该方法可以最佳地包括通过新颖度量来对该时间段内的更新事件进行过滤,其中过滤忽略不包括重要新颖信息的事件。 基于强化学习,随着时间的推移应用迭代学习规则,对监测率进行了调整。

    METHOD AND SYSTEM FOR IMPROVED QUERY EXPANSION IN FACETED SEARCH
    8.
    发明申请
    METHOD AND SYSTEM FOR IMPROVED QUERY EXPANSION IN FACETED SEARCH 审中-公开
    用于在面向搜索中改进查询扩展的方法和系统

    公开(公告)号:US20110125764A1

    公开(公告)日:2011-05-26

    申请号:US12626642

    申请日:2009-11-26

    IPC分类号: G06F17/30

    CPC分类号: G06F16/3338 G06F16/332

    摘要: A method and system for improved query expansion in faceted search are provided. The method includes: receiving a search query; expanding the search query to obtain query expansion terms; and receiving a facet selection for the search query. A facet profile is retrieved in the form of collected important terms for the facet; and the query expansion terms are weighted by comparing them to the facet profile. The query expansion terms are re-ranked and the method includes executing the re-weighted query expansion terms whilst filtering for the facet.

    摘要翻译: 提供了一种用于改进多面搜索中查询扩展的方法和系统。 该方法包括:接收搜索查询; 扩展搜索查询以获取查询扩展条款; 并接收搜索查询的小平面选择。 以收集的重要术语的形式检索小平面; 并且通过将查询扩展项与小平面轮廓进行比较来加权查询扩展项。 查询扩展术语被重新排序,并且该方法包括执行重新加权的查询扩展项,同时对小平面进行过滤。

    Method and System of Prioritising Operations On Network Objects
    9.
    发明申请
    Method and System of Prioritising Operations On Network Objects 审中-公开
    网络对象优先操作优先级的方法与系统

    公开(公告)号:US20100281035A1

    公开(公告)日:2010-11-04

    申请号:US12432808

    申请日:2009-04-30

    IPC分类号: G06F17/30 G06N7/02 G06Q99/00

    CPC分类号: G06Q50/01 G06F16/951

    摘要: A method and system for prioritising operations on network objects are provided. The method includes gathering Web 2.0 available relationship data on the relationships between network entities, wherein network entities are network users and network objects. The relationship data for a network entity is analysed and a first relative score is determined based on the relationship data. For a network object, a second relative score is determined which is a dynamic score based on user interactions with the network object and formed using the first relative scores of network entities interacting with the object. The method then prioritizes an operation on a network object using the second relative score.

    摘要翻译: 提供了一种用于对网络对象进行优先级操作的方法和系统。 该方法包括收集关于网络实体之间的关系的Web 2.0可用关系数据,其中网络实体是网络用户和网络对象。 分析网络实体的关系数据,并且基于关系数据确定第一相对分数。 对于网络对象,确定第二相对分数,其是基于与网络对象的用户交互的动态分数,并且使用与对象交互的网络实体的第一相对分数形成。 该方法然后使用第二相对分数对网络对象的操作进行优先级排序。

    INDEXING AND SEARCHING ENTITY-RELATIONSHIP DATA
    10.
    发明申请
    INDEXING AND SEARCHING ENTITY-RELATIONSHIP DATA 有权
    指数和搜索实体关系数据

    公开(公告)号:US20130238631A1

    公开(公告)日:2013-09-12

    申请号:US13417248

    申请日:2012-03-11

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30604

    摘要: Method, system, and computer program product for indexing and searching entity-relationship data are provided. The method includes: defining a logical document model for entity-relationship data including: representing an entity as a document containing the entity's searchable content and metadata; dually representing the entity as a document and as a category; and representing each relationship instance for the entity as a category set that contains categories of all participating entities in the relationship. The method also includes: translating entity-relationship data into the logical document model; and indexing the entity-relationship data of the populated logical document model as an inverted index. The method may include searching indexed entity-relationship data using a faceted search, wherein the categories are all categories required for supporting faceted navigation.

    摘要翻译: 提供了索引和搜索实体关系数据的方法,系统和计算机程序产品。 该方法包括:定义用于实体关系数据的逻辑文档模型,包括:将实体表示为包含该实体的可搜索内容和元数据的文档; 将实体双重表示为文件和类别; 并将实体的每个关系实例表示为包含关系中所有参与实体的类别的类别集合。 该方法还包括:将实体关系数据转换为逻辑文档模型; 并将填充的逻辑文档模型的实体关系数据索引为反向索引。 该方法可以包括使用分面搜索搜索索引的实体关系数据,其中类别是支持分面导航所需的所有类别。