METHOD AND APPARATUS FOR GENERATING A MEDIA COMPILATION BASED ON CRITERIA BASED SAMPLING
    101.
    发明申请
    METHOD AND APPARATUS FOR GENERATING A MEDIA COMPILATION BASED ON CRITERIA BASED SAMPLING 有权
    基于标准抽样生成媒体编译的方法和装置

    公开(公告)号:US20140122485A1

    公开(公告)日:2014-05-01

    申请号:US13665457

    申请日:2012-10-31

    IPC分类号: G06F17/30

    摘要: An approach is provided for initiating generation of a media compilation based on one or more sampling criteria. A sampling platform determines at least one subset of one or more media items captured of at least one event. The sampling platform also partitions the at least one subset of the one or more media items into one or more bins and generates at least one compilation of the at least one subset of the one or more items based, at least in part, on whether the one or more media items in the one or more bins at least substantially meet one or more sampling criteria.

    摘要翻译: 提供了一种用于基于一个或多个抽样标准启动生成媒体编辑的方法。 采样平台确定至少一个事件所捕获的一个或多个媒体项目的至少一个子集。 所述采样平台还将所述一个或多个媒体项目的所述至少一个子集划分成一个或多个分组,并且至少部分地基于所述一个或多个分组是否生成所述一个或多个项目的所述至少一个子集的至少一个子集, 该一个或多个箱中的一个或多个介质物品至少基本满足一个或多个采样标准。

    Method and apparatus for adding a database partition
    103.
    发明授权
    Method and apparatus for adding a database partition 有权
    用于添加数据库分区的方法和装置

    公开(公告)号:US08209306B2

    公开(公告)日:2012-06-26

    申请号:US13180866

    申请日:2011-07-12

    IPC分类号: G06F17/30

    摘要: A data repository system and method are provided. A method in accordance with an embodiment includes an operation that can be used to port data from one or more existing database partitions to new database partitions according to a minimally progressive hash. The method can be used to increase the overall size of databases while a system runs hot, with little or no downtime.

    摘要翻译: 提供了数据存储库系统和方法。 根据实施例的方法包括可以用于根据最小程度的散列将来自一个或多个现有数据库分区的数据传送到新的数据库分区的操作。 该方法可用于在系统运行热的同时很少或无停机时间增加数据库的总体大小。

    Optimization technique for dealing with data skew on foreign key joins
    104.
    发明授权
    Optimization technique for dealing with data skew on foreign key joins 有权
    处理外键联接数据偏移的优化技术

    公开(公告)号:US08078610B2

    公开(公告)日:2011-12-13

    申请号:US12055535

    申请日:2008-03-26

    申请人: Stephen Molini

    发明人: Stephen Molini

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30466 Y10S707/968

    摘要: A method for determining when a database system query optimizer should employ join skew avoidance steps. The method includes dynamically calculating the worst-case anticipated frequency distribution for a particular relation along a particular set of join column(s) at query execution time. The calculated frequency distribution value is compared to a skew threshold, the skew threshold representing the number of rows on the same distinct value that would lead to avoidable processing inefficiencies. It is then determined that the database system query optimizer should employ join skew avoidance steps if the calculated frequency distribution value exceeds the skew threshold.

    摘要翻译: 一种用于确定数据库系统查询优化器何时应采用连接偏斜回避步骤的方法。 该方法包括在查询执行时间沿特定的一组连接列动态地计算特定关系的最坏情况预期频率分布。 将计算的频率分布值与偏斜阈值进行比较,歪斜阈值表示将导致可避免的处理低效率的相同不同值的行数。 如果计算出的频率分布值超过歪斜阈值,则确定数据库系统查询优化器应采用连接偏斜回避步骤。

    Methods and apparatus for reducing storage size

    公开(公告)号:US07885988B2

    公开(公告)日:2011-02-08

    申请号:US11774333

    申请日:2007-07-06

    IPC分类号: G06F17/00

    摘要: Prediction-based compression engines are spoon-fed with sequentially efficiently compressible (SEC) streams of input data that make it possible for the compression engines to more efficiently compress or otherwise compact the incoming data than would be possible with streams of input data accepted on a TV-raster scan basis. Various techniques are disclosed for intentionally forming SEC input data streams. Among these are the tight packing of alike files or fragments into concatenation suitcases and the decomposition of files into substantially predictably consistent (SPC) fragments or segments that are routed to different suitcases according to their type. In a graphics-directed embodiment, image frames are partitioned into segment areas that are internally SPC and multidirectional walks (i.e., U-turning walks) are defined in the segment areas where these defined walks are traced during compression and also during decompression. A variety of pre-compression data transformation methods are disclosed for causing apparently random data sequences to appear more compressibly alike to each other. The methods are usable in systems that permit substantially longer times for data compaction operations than for data decompaction operations.

    Geometrization for pattern recognition, data analysis, data merging, and multiple criteria decision making
    106.
    发明授权
    Geometrization for pattern recognition, data analysis, data merging, and multiple criteria decision making 有权
    用于模式识别,数据分析,数据合并和多准则决策的几何分析

    公开(公告)号:US07885966B2

    公开(公告)日:2011-02-08

    申请号:US11737063

    申请日:2007-04-18

    申请人: Abel Wolman

    发明人: Abel Wolman

    IPC分类号: G06F17/30

    摘要: An analyzer/classifier/synthesizer/prioritizing tool for data comprises use of an admissible geometrization process with data transformed and partitioned by an input process into one or more input matrices and one or more partition classes and one or more scale groups. The data to be analyzed/classified/synthesized/prioritized is processed by an admissible geometrization technique such as 2-partition modified individual differences multidimensional scaling (2p-IDMDS) to produce at least a measure of geometric fit. Using the measure of geometric fit and possibly other 2p-IDMDS output, a back end process analyzes, synthesizes, classifies, and prioritizes data through patterns, structure, and relations within the data.

    摘要翻译: 用于数据的分析器/分类器/合成器/优先级工具包括使用可接受的几何过程,其中数据被输入过程变换和分割成一个或多个输入矩阵以及一个或多个分区类别和一个或多个比例组。 要分析/分类/合成/优先化的数据通过可接受的几何技术进行处理,例如2-分区修改的个体差异多维缩放(2p-IDMDS)以产生至少几何拟合度量。 使用几何拟合和可能的其他2p-IDMDS输出的度量,后端过程通过数据中的模式,结构和关系来分析,综合,分类和优先化数据。

    Apparatus and methods for displaying and determining dependency relationships among subsystems in a computer software system
    107.
    发明授权
    Apparatus and methods for displaying and determining dependency relationships among subsystems in a computer software system 有权
    用于在计算机软件系统中显示和确定子系统之间的依赖关系的装置和方法

    公开(公告)号:US07822795B2

    公开(公告)日:2010-10-26

    申请号:US12017196

    申请日:2008-01-21

    IPC分类号: G06F17/30

    摘要: A method is provided for managing, in a computer system, design of a database system having a set of schemata. The method includes, in a first computer process, extracting dependencies from the database system and identifying the set of schemata. The method further includes, for each specific schema in the set of schemata, creating in a second computer process a partition that, in turn, contains a further partition for each element of the specific schema, so as to establish a hierarchy of partitions in accordance with the structure of the set of schemata. The method also includes storing a representation of the database system including subsystems, dependency relationships among the subsystems, and the hierarchy of partitions. Finally, the method includes providing a graphical output from the computer system, based on the stored representation, in which appears a display of the subsystems in a hierarchy of partitions within a dependency structure matrix, such matrix graphically indicating the dependency relationships among subsystems. Related apparatus and computer products are also provided.

    摘要翻译: 提供了一种用于在计算机系统中管理具有一组模式的数据库系统的设计的方法。 该方法包括在第一计算机进程中从数据库系统提取依赖关系并识别该组模式。 所述方法还包括对于所述一组模式中的每个特定模式,在第二计算机进程中创建分区,所述分区又包含所述特定模式的每个元素的另外的分区,以便根据所述分区建立分区 具有一组模式的结构。 该方法还包括存储数据库系统的表示,包括子系统,子系统之间的依赖关系以及分区的层次结构。 最后,该方法包括基于所存储的表示来提供来自计算机系统的图形输出,其中出现在依赖性结构矩阵内的分区的层次结构中的子系统的显示,矩阵以图形方式指示子系统之间的依赖关系。 还提供了相关设备和计算机产品。

    USER DEFINED DATA PARTITIONING (UDP) - GROUPING OF DATA BASED ON COMPUTATION MODEL
    108.
    发明申请
    USER DEFINED DATA PARTITIONING (UDP) - GROUPING OF DATA BASED ON COMPUTATION MODEL 有权
    用户定义的数据分区(UDP) - 基于计算模型的数据分组

    公开(公告)号:US20100192148A1

    公开(公告)日:2010-07-29

    申请号:US12358995

    申请日:2009-01-23

    摘要: Methods, systems, and computer program products are provided for generating application-aware data partitioning to support parallel computing. A label for a user defined data partitioning (UDP) key is generated by a labeling process to configure data partitions of original data. The UDP is labeled by the labeling process to include at least one key property excluded from the original data. The data partitions are evenly distributed to co-locate and balance the data partitions and corresponding computations performed by computational servers. A data record of the data partitions is retrieved by performing an all-node parallel search of the computational servers using the UDP key.

    摘要翻译: 提供了方法,系统和计算机程序产品,用于生成应用感知数据分区以支持并行计算。 用户定义的数据分区(UDP)密钥的标签由标记过程生成,以配置原始数据的数据分区。 UDP由标记过程标记,以包含从原始数据中排除的至少一个密钥属性。 数据分区均匀分布,以共同定位和平衡数据分区和计算服务器执行的相应计算。 通过使用UDP密钥执行计算服务器的全部节点并行搜索来检索数据分区的数据记录。

    Data summarization
    109.
    发明授权
    Data summarization 有权
    数据汇总

    公开(公告)号:US07747624B2

    公开(公告)日:2010-06-29

    申请号:US10424850

    申请日:2003-04-29

    IPC分类号: G06F17/30

    摘要: A database management system provides the capability to perform cluster analysis and provides improved performance in model building and data mining, good integration with the various databases throughout the enterprise, and flexible specification and adjustment of the models being built, but which provides data mining functionality that is accessible to users having limited data mining expertise and which provides reductions in development times and costs for data mining projects. A database management system for in-database clustering comprises a first data table and a second data table, each data table including a plurality of rows of data, means for building a clustering model using the first data table using a portion of the first data table, wherein the portion of the first data table is selected by partitioning, density summarization, or active sampling of the first data table, and means for applying the clustering model using the second data table to generate apply output data.

    摘要翻译: 数据库管理系统提供执行集群分析的能力,并在模型构建和数据挖掘中提供改进的性能,与整个企业中的各种数据库的良好集成,以及正在构建的模型的灵活规范和调整,但提供数据挖掘功能 对于具有有限的数据挖掘专业知识的用户可以访问,并减少数据挖掘项目的开发时间和成本。 用于数据库内聚类的数据库管理系统包括第一数据表和第二数据表,每个数据表包括多行数据,用于使用第一数据表的一部分使用第一数据表建立聚类模型的装置 其中,通过第一数据表的分区,密度聚合或主动采样来选择第一数据表的部分,以及使用第二数据表应用聚类模型以生成应用输出数据的装置。

    Method and system for managing data transaction requests
    110.
    发明授权
    Method and system for managing data transaction requests 有权
    用于管理数据事务请求的方法和系统

    公开(公告)号:US07650338B2

    公开(公告)日:2010-01-19

    申请号:US10562459

    申请日:2004-05-12

    IPC分类号: G06F17/30

    摘要: A method and system is provided to process data transactions in a data store including a plurality of databases. The system may comprise a computer interface module to receive a data transaction request from at least one requesting computer and a data store interface module to interface the system to the plurality of databases. The system also includes a data access layer defining an abstraction layer to identify at least one database of the plurality of databases. The data transaction request may be an object orientated request and the plurality of databases may be horizontally distributed wherein the data access layer defines an object orientated abstraction layer between the computer interface module and the plurality of databases. In one embodiment a data dependent routing module is provided that generates a query to a database that is identified based on content of the data in the data transaction request.

    摘要翻译: 提供了一种方法和系统来处理包括多个数据库的数据存储器中的数据事务。 该系统可以包括计算机接口模块,用于从至少一个请求计算机和数据存储接口模块接收数据交易请求,以将系统接口到多个数据库。 该系统还包括定义抽象层以识别多个数据库中的至少一个数据库的数据访问层。 数据事务请求可以是面向对象的请求,并且多个数据库可以是水平分布的,其中数据访问层在计算机接口模块和多个数据库之间定义面向对象的抽象层。 在一个实施例中,提供数据相关的路由模块,其生成基于数据事务请求中的数据的内容来识别的数据库的查询。