Methods and apparatus for clustering evolving data streams through online and offline components
    11.
    发明申请
    Methods and apparatus for clustering evolving data streams through online and offline components 有权
    通过在线和离线组件对不断发展的数据流进行聚类的方法和装置

    公开(公告)号:US20050038769A1

    公开(公告)日:2005-02-17

    申请号:US10641951

    申请日:2003-08-14

    IPC分类号: G06F17/30 G06F7/00

    摘要: A technique of clustering data of a data stream is provided. Online statistics are first created from the data stream. Offline processing of the online statistics is then performed when offline processing either required or desired. Online statistics may be created through the reception of data points from the data stream and the formation and updating of data groups. Offline processing may be performed by reclustering groups of data points around sampled data points and reporting the newly formed clusters.

    摘要翻译: 提供了一种数据流数据聚类技术。 在线统计信息首先从数据流创建。 然后,当离线处理需要或需要时,执行脱机处理在线统计信息。 可以通过从数据流接收数据点以及数据组的形成和更新来创建在线统计。 离线处理可以通过重新聚集采样数据点周围的数据点组并报告新形成的簇来执行。

    System and method for scalable processing of multi-way data stream correlations
    12.
    发明申请
    System and method for scalable processing of multi-way data stream correlations 失效
    用于多路数据流相关性的可扩展处理的系统和方法

    公开(公告)号:US20070288635A1

    公开(公告)日:2007-12-13

    申请号:US11417838

    申请日:2006-05-04

    IPC分类号: G06F15/173

    摘要: A computer implemented method, apparatus, and computer usable program code for processing multi-way stream correlations. Stream data are received for correlation. A task is formed for continuously partitioning a multi-way stream correlation workload into smaller workload pieces. Each of the smaller workload pieces may be processed by a single host. The stream data are sent to different hosts for correlation processing.

    摘要翻译: 一种用于处理多路流相关性的计算机实现的方法,装置和计算机可用程序代码。 接收流数据进行相关。 形成一个任务,用于将多路流相关工作负载连续划分成较小的工作负载。 每个较小的工作负载片段可以由单个主机处理。 流数据被发送到不同的主机进行相关处理。

    Method and apparatus for analyzing community evolution in graph data streams
    13.
    发明申请
    Method and apparatus for analyzing community evolution in graph data streams 失效
    用于分析图形数据流中的社区进化的方法和装置

    公开(公告)号:US20070288465A1

    公开(公告)日:2007-12-13

    申请号:US11243727

    申请日:2005-10-05

    IPC分类号: G06F7/00

    CPC分类号: G06Q10/00

    摘要: Improved techniques are disclosed for detecting patterns of interaction among a set of entities and analyzing community evolution in a stream environment. By way of example, a technique for processing data from a data stream includes the following steps/operations. A data point of the data stream representing an interaction event is obtained. An interaction graph is updated on-line based on the data point representing the interaction event. The updated interaction graph is stored in a nonvolatile memory. An interaction evolution is determined off-line from the updated interaction graph stored in the nonvolatile memory.

    摘要翻译: 公开了用于检测一组实体之间的交互模式并分析流环境中的社区进化的改进的技术。 作为示例,用于从数据流处理数据的技术包括以下步骤/操作。 获得表示交互事件的数据流的数据点。 基于表示交互事件的数据点,在线更新交互图。 更新的交互图存储在非易失性存储器中。 从存储在非易失性存储器中的更新的交互图中离线确定交互演进。

    Systems and methods for providing real-time classification of continuous data streatms
    14.
    发明申请
    Systems and methods for providing real-time classification of continuous data streatms 有权
    提供连续数据维护的实时分类的系统和方法

    公开(公告)号:US20070043565A1

    公开(公告)日:2007-02-22

    申请号:US11208893

    申请日:2005-08-22

    IPC分类号: G10L15/06

    CPC分类号: G10L15/063 G10L17/00

    摘要: Systems and methods are provided for real-time classification of streaming data. In particular, systems and methods for real-time classification of continuous data streams implement micro-clustering methods for offline and online processing of training data to build and dynamically update training models that are used for classification, as well as incrementally clustering the data over contiguous segments of a continuous data stream (in real-time) into a plurality of micro-clusters from which target profiles are constructed which define/model the behavior of the data in individual segments of the data stream.

    摘要翻译: 提供了系统和方法,用于流式传输数据的实时分类。 特别地,用于连续数据流的实时分类的系统和方法实现用于离线和在线处理训练数据的微聚类方法,以构建和动态地更新用于分类的训练模型,以及在连续数据上逐渐聚类数据 将连续数据流的段(实时)分割成多个微群集,从中构建目标简档,其定义/模拟数据流的各个段中的数据的行为。

    Method and apparatus for adaptive load shedding

    公开(公告)号:US20060195599A1

    公开(公告)日:2006-08-31

    申请号:US11068137

    申请日:2005-02-28

    IPC分类号: G06F15/16

    CPC分类号: H04L49/90

    摘要: One embodiment of the present method and apparatus adaptive load shedding includes receiving at least one data stream (comprising a plurality of tuples, or data items) into a first sliding window of memory. A subset of tuples from the received data stream is then selected for processing in accordance with at least one data stream operation, such as a data stream join operation. Tuples that are not selected for processing are ignored. The number of tuples selected and the specific tuples selected depend at least in part on a variety of dynamic parameters, including the rate at which the data stream (and any other processed data streams) is received, time delays associated with the received data stream, a direction of a join operation performed on the data stream and the values of the individual tuples with respect to an expected output.

    Method, system, and storage medium for implementing a multi-stage, multi-classification sales opportunity modeling system
    16.
    发明申请
    Method, system, and storage medium for implementing a multi-stage, multi-classification sales opportunity modeling system 审中-公开
    用于实施多阶段,多分类销售机会建模系统的方法,系统和存储介质

    公开(公告)号:US20060106666A1

    公开(公告)日:2006-05-18

    申请号:US10988666

    申请日:2004-11-15

    IPC分类号: G06F17/30

    摘要: A method for implementing a multi-stage, multi-classification sales opportunity modeling system. The method includes receiving operational data relating to past sales activities and receiving parameters identified as being relevant in determining a likelihood of whether exploitation of a sales opportunity will be successful. The method also includes generating a multi-stage model by applying the operational data and the parameters to an analytic engine for evaluating different factors affecting success of the sales opportunity.

    摘要翻译: 一种实现多阶段多分类销售机会建模系统的方法。 该方法包括接收与过去的销售活动相关的操作数据,并且接收被确定为与确定销售机会的利用是否成功的可能性相关的参数。 该方法还包括通过将操作数据和参数应用于分析引擎来生成多阶段模型,以评估影响销售机会成功的不同因素。

    System and method for graph indexing
    17.
    发明申请
    System and method for graph indexing 失效
    图索引的系统和方法

    公开(公告)号:US20060036564A1

    公开(公告)日:2006-02-16

    申请号:US10835729

    申请日:2004-04-30

    申请人: Xifeng Yan Philip Yu

    发明人: Xifeng Yan Philip Yu

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30247 G06Q30/0201

    摘要: Techniques for graph indexing are provided. In one aspect, a method for indexing graphs in a database, the graphs comprising graphic data, comprises the following steps. Frequent subgraphs among one or more of the graphs in the database are identified, the frequent subgraphs appearing in at least a threshold number of the graphs in the database. One or more of the frequent subgraphs are used to create an index of the graphs in the database.

    摘要翻译: 提供图索引技术。 一方面,一种用于索引数据库中的图形的方法,包括图形数据的图形包括以下步骤。 识别数据库中一个或多个图形中的频繁子图,频繁的子图出现在数据库中至少阈值数量的图形中。 一个或多个频繁子图用于创建数据库中图形的索引。

    Systems and methods for sequential modeling in less than one sequential scan
    18.
    发明申请
    Systems and methods for sequential modeling in less than one sequential scan 失效
    在不到一次顺序扫描中进行顺序建模的系统和方法

    公开(公告)号:US20060026110A1

    公开(公告)日:2006-02-02

    申请号:US10903336

    申请日:2004-07-30

    IPC分类号: G06F15/18

    CPC分类号: G06N99/005 Y10S707/99931

    摘要: Most recent research of scalable inductive learning on very large streaming dataset focuses on eliminating memory constraints and reducing the number of sequential data scans. However, state-of-the-art algorithms still require multiple scans over the data set and use sophisticated control mechanisms and data structures. There is discussed herein a general inductive learning framework that scans the dataset exactly once. Then, there is proposed an extension based on Hoeffding's inequality that scans the dataset less than once. The proposed frameworks are applicable to a wide range of inductive learners.

    摘要翻译: 对最大流式数据集的可伸缩归纳学习的最新研究着重于消除记忆限制并减少顺序数据扫描的次数。 然而,最先进的算法仍然需要对数据集进行多次扫描,并使用复杂的控制机制和数据结构。 这里讨论了一般的归纳学习框架,该框架一次扫描数据集。 然后,提出了一种基于Hoeffding不等式的扩展,可以扫描数据集不止一次。 提出的框架适用于广泛的归纳学习者。

    Methods and apparatus for dynamic classification of data in evolving data stream
    19.
    发明申请
    Methods and apparatus for dynamic classification of data in evolving data stream 失效
    在进化数据流中数据动态分类的方法和装置

    公开(公告)号:US20060004754A1

    公开(公告)日:2006-01-05

    申请号:US10881036

    申请日:2004-06-30

    IPC分类号: G06F7/00

    摘要: A technique for classifying data from a test data stream is provided. A stream of training data having class labels is received. One or more class-specific clusters of the training data are determined and stored. At least one test instance of the test data stream is classified using the one or more class-specific clusters.

    摘要翻译: 提供了一种从测试数据流中分类数据的技术。 接收具有类标签的训练数据流。 确定并存储训练数据的一个或多个类特定的簇。 测试数据流的至少一个测试实例使用一个或多个类特定簇进行分类。

    Secured method and apparatus for selling and distributing software and related services
    20.
    发明申请
    Secured method and apparatus for selling and distributing software and related services 审中-公开
    销售和分销软件及相关服务的安全方法和装置

    公开(公告)号:US20050108170A1

    公开(公告)日:2005-05-19

    申请号:US10715287

    申请日:2003-11-17

    IPC分类号: G06Q30/00 G06F17/60

    摘要: A method for distributing and utilizing software is provided. In the method of distribution, a software application is provided on a hardware device by a manufacturer of the software application, wherein the software application is executable on the hardware device. The hardware device is enclosed within a box and distributed. The manufacturer provides continued services for the software application, wherein the hardware device is connectable between at least one end user's computer and the manufacturer. The hardware device is adapted to provide the continued services via a communication link between the hardware device and the manufacturer.

    摘要翻译: 提供了一种分发和利用软件的方法。 在分发方法中,由软件应用的制造商在硬件设备上提供软件应用,其中软件应用可在硬件设备上执行。 硬件设备封装在一个盒子内并分发。 制造商为软件应用程序提供持续的服务,其中硬件设备可在至少一个最终用户的计算机和制造商之间连接。 硬件设备适于通过硬件设备和制造商之间的通信链路来提供持续的服务。