IDENTIFYING INCONSISTENCIES IN OBJECT SIMILARITIES FROM MULTIPLE INFORMATION SOURCES
    11.
    发明申请
    IDENTIFYING INCONSISTENCIES IN OBJECT SIMILARITIES FROM MULTIPLE INFORMATION SOURCES 有权
    识别来自多个信息来源的对象类似中的不符合

    公开(公告)号:US20130151543A1

    公开(公告)日:2013-06-13

    申请号:US13316178

    申请日:2011-12-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30598 G11B27/32

    摘要: A horizontal anomaly detection method includes receiving at plurality of objects described in a plurality of information sources, wherein each individual information source captures a plurality of similarity relationships between the objects, combining the information sources to determine a similarity matrix whose entries represent quantitative scores of similarity between pairs of the objects, and identifying at least one horizontal anomaly of the objects within the similarity matrix, wherein the horizontal anomalies are anomalous relationships across the plurality of information sources.

    摘要翻译: 水平异常检测方法包括在多个信息源中描述的多个对象处接收,其中每个单独的信息源捕获对象之间的多个相似关系,组合信息源以确定其条目表示相似度的定量分数的相似性矩阵 在所述对象之间,并且识别所述相似性矩阵内的对象的至少一个水平异常,其中所述水平异常是所述多个信息源之间的异常关系。

    System and method for scalable cost-sensitive learning
    13.
    发明授权
    System and method for scalable cost-sensitive learning 有权
    可扩展成本敏感学习的系统和方法

    公开(公告)号:US07904397B2

    公开(公告)日:2011-03-08

    申请号:US12690502

    申请日:2010-01-20

    IPC分类号: G06F15/18 G06N3/00 G06N3/12

    CPC分类号: G06N99/005

    摘要: A method (and structure) for processing an inductive learning model for a dataset of examples, includes dividing the dataset of examples into a plurality of subsets of data and generating, using a processor on a computer, a learning model using examples of a first subset of data of the plurality of subsets of data. The learning model being generated for the first subset comprises an initial stage of an evolving aggregate learning model (ensemble model) for an entirety of the dataset, the ensemble model thereby providing an evolving estimated learning model for the entirety of the dataset if all the subsets were to be processed. The generating of the learning model using data from a subset includes calculating a value for at least one parameter that provides an objective indication of an adequacy of a current stage of the ensemble model.

    摘要翻译: 一种用于处理实例的数据集的感应学习模型的方法(和结构),包括将示例的数据集划分成多个数据子集,并使用计算机上的处理器生成使用第一子集的示例的学习模型 的多个数据子集的数据。 为第一子集生成的学习模型包括用于整个数据集的演进聚合学习模型(集合模型)的初始阶段,从而为整个数据集提供演进的估计学习模型,如果所有子集 被处理。 使用来自子集的数据生成学习模型包括计算至少一个参数的值,所述参数提供对所述集合模型的当前阶段的充分性的客观指示。

    System and method for sequence-based subspace pattern clustering
    14.
    发明授权
    System and method for sequence-based subspace pattern clustering 失效
    基于序列的子空间模式聚类的系统和方法

    公开(公告)号:US07565346B2

    公开(公告)日:2009-07-21

    申请号:US10858541

    申请日:2004-05-31

    IPC分类号: G06F17/30

    CPC分类号: G06K9/6215 Y10S707/99936

    摘要: Unlike traditional clustering methods that focus on grouping objects with similar values on a set of dimensions, clustering by pattern similarity finds objects that exhibit a coherent pattern of rise and fall in subspaces. Pattern-based clustering extends the concept of traditional clustering and benefits a wide range of applications, including e-Commerce target marketing, bioinformatics (large scale scientific data analysis), and automatic computing (web usage analysis), etc. However, state-of-the-art pattern-based clustering methods (e.g., the pCluster algorithm) can only handle datasets of thousands of records, which makes them inappropriate for many real-life applications. Furthermore, besides the huge data volume, many data sets are also characterized by their sequentiality, for instance, customer purchase records and network event logs are usually modeled as data sequences. Hence, it becomes important to enable pattern-based clustering methods i) to handle large datasets, and ii) to discover pattern similarity embedded in data sequences. There is presented herein a novel method that offers this capability.

    摘要翻译: 与传统的集群方法不同,传统的集群方法集中在对一组维度上具有类似值的对象进行分组,通过模式相似性进行聚类可以找到在子空间中呈现一致的上升和下降模式的对象。 基于模式的群集扩展了传统群集的概念,受益于广泛的应用,包括电子商务目标营销,生物信息学(大规模科学数据分析)和自动计算(Web使用分析)等。然而,状态 基于图案的聚类方法(例如,pCluster算法)只能处理数千条记录的数据集,这使得它们不适合许多现实生活中的应用。 此外,除了巨大的数据量之外,许多数据集的特征还在于它们的顺序性,例如,客户购买记录和网络事件日志通常被建模为数据序列。 因此,重要的是启用基于图案的聚类方法i)处理大数据集,以及ii)发现嵌入在数据序列中的模式相似性。 这里提供了一种提供这种能力的新颖方法。

    Systems and methods for sequential modeling in less than one sequential scan
    15.
    发明申请
    Systems and methods for sequential modeling in less than one sequential scan 失效
    在不到一次顺序扫描中进行顺序建模的系统和方法

    公开(公告)号:US20060026110A1

    公开(公告)日:2006-02-02

    申请号:US10903336

    申请日:2004-07-30

    IPC分类号: G06F15/18

    CPC分类号: G06N99/005 Y10S707/99931

    摘要: Most recent research of scalable inductive learning on very large streaming dataset focuses on eliminating memory constraints and reducing the number of sequential data scans. However, state-of-the-art algorithms still require multiple scans over the data set and use sophisticated control mechanisms and data structures. There is discussed herein a general inductive learning framework that scans the dataset exactly once. Then, there is proposed an extension based on Hoeffding's inequality that scans the dataset less than once. The proposed frameworks are applicable to a wide range of inductive learners.

    摘要翻译: 对最大流式数据集的可伸缩归纳学习的最新研究着重于消除记忆限制并减少顺序数据扫描的次数。 然而,最先进的算法仍然需要对数据集进行多次扫描,并使用复杂的控制机制和数据结构。 这里讨论了一般的归纳学习框架,该框架一次扫描数据集。 然后,提出了一种基于Hoeffding不等式的扩展,可以扫描数据集不止一次。 提出的框架适用于广泛的归纳学习者。

    CHARGE SHARING SYSTEM AND METHOD OF LCOS DISPLAY
    17.
    发明申请
    CHARGE SHARING SYSTEM AND METHOD OF LCOS DISPLAY 审中-公开
    充电共享系统和LCOS显示方法

    公开(公告)号:US20120050245A1

    公开(公告)日:2012-03-01

    申请号:US13265447

    申请日:2010-09-16

    IPC分类号: G09G3/36 G06F3/038

    摘要: In the field of a liquid crystal on silicon (LCoS) display, a charge sharing system and a charge sharing method of an LCoS display are provided. The system includes: a column driving circuit, a row driving circuit, a pixel matrix, a control circuit, a gamma reference voltage circuit, and a first switching module. The column driving circuit includes a second switching module, a buffer, a shift register, a latch, and a digital-to-analog (D/A) converter. The second switching module is serially connected between the D/A converter and the buffer. The control circuit simultaneously outputs a signal for controlling turn-off of the buffer, a signal for controlling turn-off of a switch in the second switching module, and a signal for controlling turn-off of the gamma reference voltage circuit. The control circuit outputs a signal for controlling turn-on of a switch in the first switching module. The control circuit outputs a signal for controlling the row driving circuit. The control circuit simultaneously outputs a signal for controlling turn-on of the buffer, a signal for controlling turn-on of the switch in the second switching module, and a signal for controlling turn-on of the gamma reference voltage circuit.

    摘要翻译: 在液晶硅(LCoS)显示领域,提供LCoS显示的电荷共享系统和电荷共享方法。 该系统包括:列驱动电路,行驱动电路,像素矩阵,控制电路,伽马参考电压电路和第一开关模块。 列驱动电路包括第二开关模块,缓冲器,移位寄存器,锁存器和数模(D / A)转换器。 第二开关模块串联连接在D / A转换器和缓冲器之间。 控制电路同时输出用于控制缓冲器的关断的信号,用于控制第二开关模块中的开关的关断的信号和用于控制伽马参考电压电路的关断的信号。 控制电路输出用于控制第一开关模块中的开关的接通的信号。 控制电路输出用于控制行驱动电路的信号。 控制电路同时输出用于控制缓冲器的接通的信号,用于控制第二开关模块中的开关的导通的信号和用于控制伽马参考电压电路的导通的信号。

    Method and system for predicting resource usage of reusable stream processing elements
    18.
    发明授权
    Method and system for predicting resource usage of reusable stream processing elements 有权
    用于预测可重用流处理元素资源使用的方法和系统

    公开(公告)号:US07941387B2

    公开(公告)日:2011-05-10

    申请号:US11935079

    申请日:2007-11-05

    IPC分类号: G06F17/00 G06N5/02

    CPC分类号: G06N99/005

    摘要: A method is provided for generating a resource function estimate of resource usage by an instance of a processing element configured to consume zero or more input data streams in a stream processing system having a set of available resources that comprises receiving at least one specified performance metric for the zero or more input data streams and a processing power of the set of available resources, wherein one specified performance metric is stream rate; generating a multi-part signature of executable-specific information for the processing element and a multi-part signature of context-specific information for the instance; accessing a database of resource functions to identify a static resource function corresponding to the executable-specific information and a context-dependent resource function corresponding to the context-specific information; combining the static resource function and the context-dependent resource function to form a composite resource function for the instance; and applying the resource function to the at least one specified performance metric and the processing power to generate the resource function estimate of the at least one specified performance metric for processing by the instance.

    摘要翻译: 提供了一种用于通过被配置为在具有一组可用资源的流处理系统中消耗零个或多个输入数据流的处理元件的实例来生成资源使用的资源功能估计的方法,所述流处理系统包括:一组可用资源,其包括接收至少一个指定的性能度量 零个或多个输入数据流和可用资源集合的处理能力,其中一个指定的性能度量是流速率; 生成用于处理元件的可执行特定信息的多部分签名和该实例的上下文特定信息的多部分签名; 访问资源功能的数据库以识别与所述可执行特定信息相对应的静态资源功能以及与所述上下文特定信息相对应的与上下文相关的资源功能; 结合静态资源功能和上下文相关资源功能,形成实例的复合资源功能; 以及将所述资源功能应用于所述至少一个指定的性能度量和所述处理能力,以生成所述至少一个指定的性能度量的资源功能估计,以供所述实例处理。

    SELF-POWERED IN-PIPE FLUID METER AND PIPING NETWORK COMPRISING A PLURALITY OF SUCH FLUID METERS
    19.
    发明申请
    SELF-POWERED IN-PIPE FLUID METER AND PIPING NETWORK COMPRISING A PLURALITY OF SUCH FLUID METERS 失效
    自供电的管内流量计和包含这种流量计的多重流量的管网

    公开(公告)号:US20100085211A1

    公开(公告)日:2010-04-08

    申请号:US12376733

    申请日:2007-08-06

    IPC分类号: G08B1/00

    CPC分类号: G01N33/18 G01F1/10 G01F15/063

    摘要: A self-powered in-pipe fluid meter to be mounted inside of a pipe carrying a fluid therein. The fluid meter comprises at least one sensing unit capable of measuring one or more parameters of the fluid inside of the pipe; a telemetric data transmission unit capable of telemetrically transmitting data including a measured fluid parameter to a host terminal and/or another fluid meter; and at least one fluid-driven power source unit capable of generating power from the fluid flow within the pipe and supplying power to the sensing unit and/or the transmission unit.

    摘要翻译: 一种自供电的管内流量计,安装在其中携带流体的管道的内部。 所述流量计包括能够测量所述管内部的流体的一个或多个参数的至少一个传感单元; 遥测数据传输单元,其能够将包括测量的流体参数的数据遥测地传送到主机终端和/或另一个流量计; 以及至少一个流体驱动的电源单元,其能够从管道内的流体流产生功率,并向感测单元和/或传输单元供电。

    METHOD AND APPARATUS FOR STRUCTURAL DATA CLASSIFICATION
    20.
    发明申请
    METHOD AND APPARATUS FOR STRUCTURAL DATA CLASSIFICATION 有权
    用于结构数据分类的方法和装置

    公开(公告)号:US20090319457A1

    公开(公告)日:2009-12-24

    申请号:US12141251

    申请日:2008-06-18

    IPC分类号: G06N5/02

    CPC分类号: G06N99/005

    摘要: Techniques for classifying structural data with skewed distribution are disclosed. By way of example, a method classifying structural input data comprises a computer system performing the following steps. Multiple classifiers are constructed, wherein each classifier is constructed on a subset of training data, using one or more selected composite features from the subset of training data. A consensus among the multiple classifiers is computed in accordance with a voting scheme such that at least a portion of the structural input data is assigned to a particular class in accordance with the computed consensus. Such techniques for structured data classification are capable of handling skewed class distribution and partial feature coverage issues.

    摘要翻译: 公开了分布具有偏斜分布的结构数据的技术。 作为示例,分类结构输入数据的方法包括执行以下步骤的计算机系统。 构建多个分类器,其中使用来自训练数据的子集的一个或多个选定的复合特征,在训练数据的子集上构建每个分类器。 根据投票方案计算多个分类器之间的共识,使得至少一部分结构输入数据根据所计算的一致性被分配给特定类别。 这种用于结构化数据分类的技术能够处理倾斜的类分布和部分特征覆盖问题。