System and method for visualizing the temporal evolution of object metadata
    1.
    发明申请
    System and method for visualizing the temporal evolution of object metadata 有权
    用于可视化对象元数据的时间演化的系统和方法

    公开(公告)号:US20070283290A1

    公开(公告)日:2007-12-06

    申请号:US11437415

    申请日:2006-05-19

    IPC分类号: G06F17/30

    摘要: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.

    摘要翻译: 提供了一种改进的系统和方法,用于随着时间的推移来选择和可视化对象元数据。 应用可以生成描绘在多个时间间隔中描述对象存储中的对象的元数据的时间演变的可视化。 应用程序可以在像河流一样流动的对象元数据的可视化或瀑布之间随着时间的流逝而进行切换。 可以在预处理步骤期间为某些预选间隔确定元数据项目的排名列表。 然后在运行时,当可以接收到用于为查询间隔提供元数据项的排序列表的请求时,可以确定来自预先选择的时间间隔的时间间隔的组合,其覆盖查询时间间隔,并且排列的元数据列表 涵盖查询时间间隔的时间间隔组合的每个时间间隔的项目可以被聚合并输出以用于可视化。

    Method and Apparatus for Detecting and Explaining Bursty Stream Events in Targeted Groups
    2.
    发明申请
    Method and Apparatus for Detecting and Explaining Bursty Stream Events in Targeted Groups 有权
    用于检测和解释目标组中突发事件的方法和装置

    公开(公告)号:US20090157651A1

    公开(公告)日:2009-06-18

    申请号:US11958913

    申请日:2007-12-18

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30864

    摘要: A method and apparatus are provided for detecting and explaining bursty stream events in targeted groups. In one example, the method includes receiving validated bursty events, finding explanatory data sources having relevant bursty events that are relevant to the validated bursty events, wherein the explanatory sources explain the presence of the validated bursty events, correlating the validated bursty events to the relevant bursty events of the explanatory data sources to obtain burst results, and sending the burst results to a burst database that is accessible to an end user.

    摘要翻译: 提供了一种用于检测和解释目标组中的突发流事件的方法和装置。 在一个示例中,该方法包括接收经过验证的突发事件,找到具有与经验证的突发事件相关的相关突发事件的说明性数据源,其中解释源解释存在经验证的突发事件,将验证的突发事件与相关的突发事件相关联 解释数据源的突发事件以获得突发结果,并将突发结果发送到最终用户可访问的突发数据库。

    System and method for selecting object metadata evolving over time
    3.
    发明申请
    System and method for selecting object metadata evolving over time 有权
    随着时间的推移,用于选择对象元数据的系统和方法

    公开(公告)号:US20070271270A1

    公开(公告)日:2007-11-22

    申请号:US11437425

    申请日:2006-05-19

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30864

    摘要: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.

    摘要翻译: 提供了一种改进的系统和方法,用于随着时间的推移来选择和可视化对象元数据。 应用可以生成描绘在多个时间间隔中描述对象存储中的对象的元数据的时间演变的可视化。 应用程序可以在像河流一样流动的对象元数据的可视化或瀑布之间随着时间的流逝而进行切换。 可以在预处理步骤期间为某些预选间隔确定元数据项目的排名列表。 然后在运行时,当可以接收到用于为查询间隔提供元数据项的排序列表的请求时,可以确定来自预先选择的时间间隔的时间间隔的组合,其覆盖查询时间间隔,并且排列的元数据列表 涵盖查询时间间隔的时间间隔组合的每个时间间隔的项目可以被聚合并输出以用于可视化。

    Extracting informative phrases from unstructured text
    4.
    发明申请
    Extracting informative phrases from unstructured text 有权
    从非结构化文本中提取信息短语

    公开(公告)号:US20070067289A1

    公开(公告)日:2007-03-22

    申请号:US11231075

    申请日:2005-09-20

    申请人: Jasmine Novak

    发明人: Jasmine Novak

    IPC分类号: G06F17/30

    摘要: Disclosed is a method of extracting informative phrases from a full corpus of documents. An index of phrases contained in the full corpus of documents is built. Then, a user specifies a subset of text to analyze. The subset may be defined as: (1) all paragraphs or sentences containing terms selected as defining a subject; (2) all documents in a category; (3) all documents written within a date range; and/or (3) all documents matching a Boolean query of terms. Once the subset is specified, it is analyzed to extract informative phrases. Specifically, the index is queried to retrieve all phrases within the subset. The number of times each of the phases occurs in the subset and in the corpus is counted. Each phrase contained in the subset is scored according to informativeness based on a comparison of a likelihood that the phrase occurs in the subset and a likelihood that the phrase occurs in the corpus as a whole. Only those phrases having an informativeness score above a predetermined value are considered highly informative and extracted.

    摘要翻译: 公开了一种从全文的文档中提取信息性短语的方法。 建立了文件全集中包含的短语索引。 然后,用户指定要分析的文本的子集。 该子集可以定义为:(1)包含选定为定义主题的术语的所有段落或句子; (2)类别中的所有文件; (3)在日期范围内编写的所有文件; 和/或(3)所有符合条件的布尔查询的文档。 一旦子集被指定,就会对其进行分析以提取信息性短语。 具体来说,查询索引以检索子集内的所有短语。 每个阶段发生在子集和语料库中的次数被计数。 基于在短语发生在子集中的可能性与该短语出现在语料库中作为整体的可能性的比较,根据信息性对包含在该子集中的每个短语进行评分。 只有具有高于预定值的信息分数的那些短语被认为是高度信息和提取的。

    System, service, and method for predicting sales from online public discussions
    5.
    发明申请
    System, service, and method for predicting sales from online public discussions 有权
    用于从在线公众讨论预测销售的系统,服务和方法

    公开(公告)号:US20070027741A1

    公开(公告)日:2007-02-01

    申请号:US11191776

    申请日:2005-07-27

    IPC分类号: G07G1/00

    CPC分类号: G06Q30/02 G06Q30/0202

    摘要: A sales prediction system predicts sales from online public discussions. The system utilizes manually or automatically formulated predicates to capture subsets of postings in online public discussions. The system predicts spikes in sales rank based on online chatter. The system comprises automated algorithms that predict spikes in sales rank given a time series of counts of online discussions such as blog postings. The system utilizes a stateless model of customer behavior based on a series of states of excitation that are increasingly likely to lead to a purchase decision. The stateless model of customer behavior yields a predictor of sales rank spikes that is significantly more accurate than conventional techniques operating on sales rank data alone.

    摘要翻译: 销售预测系统预测在线公众讨论的销售。 系统利用手动或自动制定的谓词来捕获在线公开讨论中的帖子。 该系统基于在线聊天预测销售额的高峰。 该系统包括自动算法,用于预测在线讨论的时间序列(如博客帖子)的销售排名。 该系统基于一系列越来越有可能导致购买决定的激励状态,利用客户行为的无状态模型。 客户行为的无状态模型产生了销售排名尖峰的预测指标,这比传统的技术仅针对销售排名数据进行操作。

    System, method and service for ranking search results using a modular scoring system
    6.
    发明申请
    System, method and service for ranking search results using a modular scoring system 有权
    使用模块化评分系统对搜索结果进行排名的系统,方法和服务

    公开(公告)号:US20050262050A1

    公开(公告)日:2005-11-24

    申请号:US10841391

    申请日:2004-05-07

    IPC分类号: G06F7/00 G06F17/30

    摘要: A modular scoring system using rank aggregation merges search results into an ordered list of results using many different features of documents. The ranking functions of the present system can easily be customized to the needs of a particular corpus or collection of users such as an intranet. Rank aggregation is independent of the underlying score distributions between the different factors, and can be applied to merge any set of ranking functions. Rank aggregation holds the advantage of combining the influence of many different heuristic factors in a robust way to produce high-quality results for queries. The modular scoring system combines factors such as indegree, page ranking, URL length, proximity to the root server of an intranet, etc, to form a single ordering on web pages that closely obeys the individual orderings, but also mediates between the collective wisdom of individual heuristics.

    摘要翻译: 使用秩聚合的模块评分系统将搜索结果合并到使用许多不同文档特征的结果的有序列表中。 本系统的排名功能可以根据特定语料库或诸如内部网的用户集合的需要进行定制。 排名聚合独立于不同因素之间的底层分数分布,并且可以应用于合并任何一组排名函数。 排名聚合具有将许多不同启发式因素的影响结合在一起的优势,可以为查询产生高质量的结果。 模块化评分系统结合诸如不分级,页面排序,URL长度,与内部网的根服务器的接近度等因素,在严格遵循单个排序的网页上形成单一排序,而且还在集体智慧之间进行调解 个人启发式

    Detecting relationships in unstructured text
    8.
    发明授权
    Detecting relationships in unstructured text 有权
    检测非结构化文本中的关系

    公开(公告)号:US08001144B2

    公开(公告)日:2011-08-16

    申请号:US12056048

    申请日:2008-03-26

    申请人: Jasmine Novak

    发明人: Jasmine Novak

    IPC分类号: G06F17/30 G06F17/27

    CPC分类号: G06F17/278 G06F17/30731

    摘要: Disclosed are embodiments of a system and a method for detecting relationships described in unstructured text-based electronic documents. The system and method incorporate the use of an input file that contains one or more text patterns that represent particular relationships. The text patterns each include regular text expressions that describe the particular relationship and slots for the location of each entity in that relationship. Document(s) are selected by a user and scanned by a proper noun tagger that identifies and tags every occurrence of proper names within the document(s). Then, a pattern matcher scans the document(s) to match text patterns. If a text pattern is matched within a document a relationship detector extracts all pairs of proper names found in the slots for each matched text pattern. The output from the relationship detector includes the names for each entity in the relationship, the type of relationship, and the identity of the document and the location of the sentence describing the relationship in the document.

    摘要翻译: 公开了用于检测非结构化基于文本的电子文档中描述的关系的系统和方法的实施例。 系统和方法结合使用包含表示特定关系的一个或多个文本模式的输入文件。 每个文本模式都包括常规文本表达式,描述该关系中每个实体的位置的特定关系和位置。 文档由用户选择并由专有名词标签器进行扫描,该标签器标识和标记文档中每个出现的专有名称。 然后,模式匹配器扫描文档以匹配文本模式。 如果在文档中匹配文本模式,则关系检测器提取针对每个匹配的文本模式在时隙中找到的所有正确名称对。 关系检测器的输出包括关系中每个实体的名称,关系的类型以及文档的身份以及描述文档中关系的句子的位置。

    ENRICHED DOCUMENT REPRESENTATIONS USING AGGREGATED ANCHOR TEXT
    9.
    发明申请
    ENRICHED DOCUMENT REPRESENTATIONS USING AGGREGATED ANCHOR TEXT 审中-公开
    使用聚集的锚固文本增强文档表示

    公开(公告)号:US20100318533A1

    公开(公告)日:2010-12-16

    申请号:US12482377

    申请日:2009-06-10

    IPC分类号: G06F17/00 G06F17/30

    CPC分类号: G06F16/958

    摘要: A system and method for aggregating anchor text over the web graph and using the aggregated anchor text to enrich document representations. For a target page, its internal inlinks, which point to the target page and are within the site containing the target page, are identified first. Then external anchors that point to the internal inlinks from pages outside of the site are identified. Anchor text of the external anchors are collected, weighted, stored, and used to enrich document presentations. The method not only reduces the number of pages with no anchor text, but also adds lines of anchor text to URLs.

    摘要翻译: 一种用于在网络图上聚合锚文本并使用聚合锚文本来丰富文档表示的系统和方法。 对于目标页面,首先标识其指向目标页面并且在包含目标页面的站点内的内部链接。 然后识别指向站点外部的内部链接的外部锚点。 收集,加权,存储和使用外部锚点的锚文本来丰富文档演示。 该方法不仅减少了没有锚文本的页面数,而且还向URL添加了一些锚文本。

    System and method for selecting object metadata evolving over time
    10.
    发明授权
    System and method for selecting object metadata evolving over time 有权
    随着时间的推移,用于选择对象元数据的系统和方法

    公开(公告)号:US07739275B2

    公开(公告)日:2010-06-15

    申请号:US11437425

    申请日:2006-05-19

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: An improved system and method for selecting and visualizing object metadata evolving over time is provided. An application may generate a visualization depicting the temporal evolution of metadata describing objects in an object store over a plurality of time intervals. The application may switch between a visualization of object metadata flowing like a river or cascading like a waterfall over time. A ranked list of metadata items may be determined for some pre-selected intervals during a pre-processing step. Then at runtime when a request may be received for providing a ranked list of metadata items for a query interval, a combination of time intervals from the pre-selected time intervals may be determined that cover the query time interval, and the ranked lists of metadata items for each time interval in the combination of time intervals that cover the query time interval may be aggregated and output for visualization.

    摘要翻译: 提供了一种改进的系统和方法,用于随着时间的推移来选择和可视化对象元数据。 应用可以生成描绘在多个时间间隔中描述对象存储中的对象的元数据的时间演变的可视化。 应用程序可以在像河流一样流动的对象元数据的可视化或瀑布之间随着时间的流逝而进行切换。 可以在预处理步骤期间为某些预选间隔确定元数据项目的排名列表。 然后在运行时,当可以接收到用于为查询间隔提供元数据项的排序列表的请求时,可以确定来自预先选择的时间间隔的时间间隔的组合,其覆盖查询时间间隔,并且排列的元数据列表 涵盖查询时间间隔的时间间隔组合的每个时间间隔的项目可以被聚合并输出以用于可视化。