Method and apparatus for tracking a change in a collection of web documents
    1.
    发明授权
    Method and apparatus for tracking a change in a collection of web documents 有权
    跟踪Web文档集合中的变化的方法和装置

    公开(公告)号:US08886660B2

    公开(公告)日:2014-11-11

    申请号:US12027316

    申请日:2008-02-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3089

    摘要: A method and an apparatus for tracking changes in a collection of web documents, for example, provided by a web site. The web documents are retrieved at a first assigned point in time and a second assigned point in time. Then a similarity measure for a combination of a retrieved web document at a first assigned point in time and a retrieved web document at a second assigned point in time is calculated for determining pairs of corresponding web documents. By comparing said calculated similarity measure of a pair of corresponding web documents with predetermined thresholds for the similarity measure a change in the content of the corresponding web document between the first assigned point in time and second assigned point in time is detected. Instead of referring to identifiers like URLs for web pages the content similarities of web pages are considered. The proposed strategy facilitates the work of marketing analysts.

    摘要翻译: 用于跟踪网站文档集合中的变化的方法和装置,例如由网站提供的。 在第一个分配的时间点和第二个分配的时间点检索网络文档。 然后,计算用于在第一分配时间点检索到的web文档和在第二分配时间点的检索到的web文档的组合的相似性度量,以确定对应的web文档对。 通过将一对相应的web文档的所述计算的相似性度量与相似性度量的预定阈值进行比较,检测在第一分配时间点和第二指定时间点之间的对应web文档的内容的变化。 不考虑网页的URL等标识符,而是考虑网页的内容相似性。 拟议的战略有助于营销分析师的工作。

    Method and apparatus for tracking a change in a collection of web documents
    2.
    发明申请
    Method and apparatus for tracking a change in a collection of web documents 有权
    跟踪Web文档集合中的变化的方法和装置

    公开(公告)号:US20090204595A1

    公开(公告)日:2009-08-13

    申请号:US12027316

    申请日:2008-02-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/3089

    摘要: A method and an apparatus for tracking changes in a collection of web documents, for example, provided by a web site. The web documents are retrieved at a first assigned point in time and a second assigned point in time. Then a similarity measure for a combination of a retrieved web document at a first assigned point in time and a retrieved web document at a second assigned point in time is calculated for determining pairs of corresponding web documents. By comparing said calculated similarity measure of a pair of corresponding web documents with predetermined thresholds for the similarity measure a change in the content of the corresponding web document between the first assigned point in time and second assigned point in time is detected. Instead of referring to identifiers like URLs for web pages the content similarities of web pages are considered. The proposed strategy facilitates the work of marketing analysts.

    摘要翻译: 用于跟踪网站文档集合中的变化的方法和装置,例如由网站提供的。 在第一个分配的时间点和第二个分配的时间点检索网络文档。 然后,计算用于在第一分配时间点检索到的web文档和在第二分配时间点的检索到的web文档的组合的相似性度量,以确定对应的web文档对。 通过将一对相应的web文档的所述计算的相似性度量与相似性度量的预定阈值进行比较,检测在第一分配时间点和第二指定时间点之间的对应web文档的内容的变化。 不考虑网页的URL等标识符,而是考虑网页的内容相似性。 拟议的战略有助于营销分析师的工作。

    Method and apparatus for comparing entities
    3.
    发明申请
    Method and apparatus for comparing entities 审中-公开
    用于比较实体的方法和装置

    公开(公告)号:US20090198593A1

    公开(公告)日:2009-08-06

    申请号:US12012181

    申请日:2008-01-31

    IPC分类号: G06Q30/00

    CPC分类号: G06Q30/02 G06Q30/0601

    摘要: A method and apparatus for comparing entities, such as companies or trademarks, based on available pricing and feature information regarding goods or services provided by the companies. Web services, e.g. price finding robots, are employed to obtain pricing and feature information for similar products from different companies. Products having similar features but derive from different companies can be grouped to clusters and be analyzed with respect to average pricing levels. Data formats for pricing and feature information stemming from different web services may be automatically retrieved and mapped into a common format. The method allows to evaluate systematic deviations in the company's pricing policies or to detect if companies do not have a matching product portfolio. One can also estimate the prestige or acceptance of a company or brand as a function of prices tolerated by the underlying market. This may facilitate marketing strategies.

    摘要翻译: 根据公司提供的商品或服务的可用定价和特征信息,比较公司或商标等实体的方法和装置。 Web服务,例如 价格寻找机器人,被用来获取来自不同公司的类似产品的定价和特征信息。 具有相似特征但来自不同公司的产品可以分组到集群中,并根据平均价格水平进行分析。 来自不同网络服务的定价和特征信息的数据格式可以自动检索并映射成通用格式。 该方法允许评估公司定价政策的系统偏差,或者检测公司是否没有匹配的产品组合。 人们还可以估计一个公司或品牌的威信或接受程度,作为潜在市场容忍的价格的函数。 这可能有助于营销策略。

    METHOD AND SYSTEM FOR ESTIMATING A SENTIMENT FOR AN ENTITY
    4.
    发明申请
    METHOD AND SYSTEM FOR ESTIMATING A SENTIMENT FOR AN ENTITY 有权
    用于估算实体感知的方法和系统

    公开(公告)号:US20090216524A1

    公开(公告)日:2009-08-27

    申请号:US12023085

    申请日:2008-02-26

    IPC分类号: G06F17/27 G06F17/21

    CPC分类号: G06F17/2785

    摘要: A method for estimating a sentiment conveyed by the content of information sources towards an entity is presented. The sentiment is obtained with respect to a query context that may be specified, e.g. by specific terms or expressions, like a product or service name. A sentiment dictionary having a plurality of sentiment terms is provided, wherein each sentiment term has assigned a sentiment value, and at least one of said sentiment terms is associated to a group context. Text documents are screened for occurrences of sentiment terms that are associated to a group context corresponding to the query context. Calculating a sentiment score value is performed as a function of the occurrences of sentiment terms having a similar or same group context as the query context. The method may be carried out automatically without manual analysis of the actual semantic content of the text documents under consideration.

    摘要翻译: 提出了一种估计信息源内容向实体传递的情绪的方法。 相对于可以指定的查询上下文获得情绪,例如, 通过具体的术语或表达,如产品或服务名称。 提供具有多个情绪条件的情绪词典,其中每个情绪词语已经分配了情感值,并且所述情绪词中的至少一个与组语境相关联。 对与查询上下文相对应的组上下文相关联的情绪术语的出现来筛选文本文档。 作为与具有与查询上下文相似或相同的组上下文的情绪项的出现的函数来执行计算情绪评分值。 该方法可以自动执行,而无需手动分析正在考虑的文本文档的实际语义内容。

    Method and system for estimating a sentiment for an entity
    5.
    发明授权
    Method and system for estimating a sentiment for an entity 有权
    用于估计实体情绪的方法和系统

    公开(公告)号:US08239189B2

    公开(公告)日:2012-08-07

    申请号:US12023085

    申请日:2008-02-26

    IPC分类号: G06F17/27 G06F17/28 G06F17/30

    CPC分类号: G06F17/2785

    摘要: A method for estimating a sentiment conveyed by the content of information sources towards an entity is presented. The sentiment is obtained with respect to a query context that may be specified, e. g. by specific terms or expressions, like a product or service name. A sentiment dictionary having a plurality of sentiment terms is provided, wherein each sentiment term has assigned a sentiment value, and at least one of said sentiment terms is associated to a group context. Text documents are screened for occurrences of sentiment terms that are associated to a group context corresponding to the query context. Calculating a sentiment score value is performed as a function of the occurrences of sentiment terms having a similar or same group context as the query context. The method may be carried out automatically without manual analysis of the actual semantic content of the text documents under consideration.

    摘要翻译: 提出了一种估计信息源内容向实体传递的情绪的方法。 对于可以指定的查询上下文获得情绪,例如, G。 通过具体的术语或表达,如产品或服务名称。 提供具有多个情绪条件的情绪词典,其中每个情绪词语已经分配了情感值,并且所述情绪词中的至少一个与组语境相关联。 对与查询上下文相对应的组上下文相关联的情绪术语的出现来筛选文本文档。 作为与具有与查询上下文相似或相同的组上下文的情绪项的出现的函数来执行计算情绪评分值。 该方法可以自动执行,而无需手动分析正在考虑的文本文档的实际语义内容。

    Method for estimating a prestige of an entity
    6.
    发明授权
    Method for estimating a prestige of an entity 有权
    估计实体信誉的方法

    公开(公告)号:US07895212B2

    公开(公告)日:2011-02-22

    申请号:US12069169

    申请日:2008-02-07

    IPC分类号: G06F17/30

    CPC分类号: G06Q10/10

    摘要: A method and an apparatus for estimating a prestige of an entity, as for example a firm, company or name, is disclosed wherein a score value is assigned to an entity as a function of an occurrence of terms associated with said entity in search results. The search results are obtained by searching an information space such as the internet. This enables, for example, companies or divisions, to infer their public standing from an analysis of search results obtained through internet search engines. It is possible to compare a plurality of entities with respect to each other in an automated fashion.

    摘要翻译: 公开了一种用于估计实体(例如公司,公司或名称)的声誉的方法和装置,其中将得分值作为在搜索结果中与所述实体相关联的术语的发生的函数分配给实体。 通过搜索诸如因特网的信息空间来获得搜索结果。 这使得例如公司或部门能够通过互联网搜索引擎获得的搜索结果的分析来推断他们的公共地位。 可以以自动方式相对于彼此来比较多个实体。

    Method for estimating a prestige of an entity
    7.
    发明申请
    Method for estimating a prestige of an entity 有权
    估计实体信誉的方法

    公开(公告)号:US20090150378A1

    公开(公告)日:2009-06-11

    申请号:US12069169

    申请日:2008-02-07

    IPC分类号: G06F17/30

    CPC分类号: G06Q10/10

    摘要: A method and an apparatus for estimating a prestige of an entity, as for example a firm, company or name, is disclosed wherein a score value is assigned to an entity as a function of an occurrence of terms associated with said entity in search results. The search results are obtained by searching an information space such as the internet. This enables, for example, companies or divisions, to infer their public standing from an analysis of search results obtained through internet search engines. It is possible to compare a plurality of entities with respect to each other in an automated fashion.

    摘要翻译: 公开了一种用于估计实体(例如公司,公司或名称)的声誉的方法和装置,其中将得分值作为在搜索结果中与所述实体相关联的术语的发生的函数分配给实体。 通过搜索诸如因特网的信息空间来获得搜索结果。 这使得例如公司或部门能够通过互联网搜索引擎获得的搜索结果的分析来推断他们的公共地位。 可以以自动方式相对于彼此来比较多个实体。