Data Services for Enterprises Leveraging Search System Data Assets
    2.
    发明申请
    Data Services for Enterprises Leveraging Search System Data Assets 审中-公开
    企业数据服务利用搜索系统数据资产

    公开(公告)号:US20130346464A1

    公开(公告)日:2013-12-26

    申请号:US13527601

    申请日:2012-06-20

    IPC分类号: G06F15/16

    CPC分类号: G06Q10/10

    摘要: A data service system is described herein which processes raw data assets from at least one network-accessible system (such as a search system), to produce processed data assets. Enterprise applications can then leverage the processed data assets to perform various environment-specific tasks. In one implementation, the data service system can generate any of: synonym resources for use by an enterprise application in providing synonyms for specified terms associated with entities; augmentation resources for use by an enterprise application in providing supplemental information for specified seed information; and spelling-correction resources for use by an enterprise application in providing spelling information for specified terms, and so on.

    摘要翻译: 本文描述了一种数据服务系统,其处理来自至少一个网络可访问系统(例如搜索系统)的原始数据资产以产生处理的数据资产。 企业应用程序可以利用已处理的数据资产来执行各种环境特定任务。 在一个实现中,数据服务系统可以生成以下任何一种:供企业应用使用的同义词资源,为与实体相关联的指定术语提供同义词; 增加资源供企业应用用于提供指定种子信息的补充信息; 以及企业应用程序为指定的术语提供拼写信息的拼写纠正资源等。

    Foreign-Key Detection
    3.
    发明申请
    Foreign-Key Detection 有权
    外键检测

    公开(公告)号:US20110208748A1

    公开(公告)日:2011-08-25

    申请号:US12709508

    申请日:2010-02-21

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30306

    摘要: This patent application relates to foreign-key detection. One implementation obtains a set of data tables. This implementation automatically determines foreign-key relationships of columns from separate tables of the set.

    摘要翻译: 本专利申请涉及外键检测。 一个实现获得一组数据表。 此实现将自动确定集合的不同表中的列的外键关系。

    Foreign-key detection
    4.
    发明授权
    Foreign-key detection 有权
    外键检测

    公开(公告)号:US08386529B2

    公开(公告)日:2013-02-26

    申请号:US12709508

    申请日:2010-02-21

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30306

    摘要: This patent application relates to foreign-key detection. One implementation obtains a set of data tables. This implementation automatically determines foreign-key relationships of columns from separate tables of the set.

    摘要翻译: 本专利申请涉及外键检测。 一个实现获得一组数据表。 此实现将自动确定集合的不同表中的列的外键关系。

    Solar light
    5.
    外观设计

    公开(公告)号:USD938638S1

    公开(公告)日:2021-12-14

    申请号:US29728175

    申请日:2020-03-17

    申请人: Zhimin Chen

    设计人: Zhimin Chen

    PRODUCT SYNTHESIS FROM MULTIPLE SOURCES
    6.
    发明申请
    PRODUCT SYNTHESIS FROM MULTIPLE SOURCES 有权
    多源产品合成

    公开(公告)号:US20110264598A1

    公开(公告)日:2011-10-27

    申请号:US12764676

    申请日:2010-04-21

    IPC分类号: G06Q10/00 G06Q30/00

    摘要: Methods and systems for automatically synthesizing product information from multiple data sources into an on-line catalog are disclosed, and in particular, for automatically synthesizing the product information based on attribute-value pairs. Information for a product may be obtained, via entity extraction, feed ingestion, and other mechanisms, from a plurality of structured and unstructured data sources having different taxonomies and schemas. Product information may additionally or alternatively be obtained or derived based on popularity data. The product information may be cleansed, segmented and normalized. The product information may be clustered so closest products, attribute names and attribute values are associated. A representative value for an attribute name may be determined, and the on-line catalog may be updated so that entries are comprehensive, meaningful and useful to a catalog user. Updates from at least 500 million different data sources may be scheduled to occur as frequently as several times daily.

    摘要翻译: 公开了用于将产品信息从多个数据源自动合成到在线目录中的方法和系统,特别地,用于基于属性值对自动合成产品信息。 可以通过实体提取,饲料摄取和其他机制从具有不同分类和模式的多个结构化和非结构化数据源获得信息。 产品信息可以另外地或替代地基于流行度数据获得或导出。 产品信息可以被清洁,分段和归一化。 产品信息可能被聚集,因此最接近的产品,属性名称和属性值相关联。 可以确定属性名称的代表值,并且可以更新在线目录,使得条目对目录用户是全面的,有意义的和有用的。 可能会安排从至少5亿个不同数据源进行更新,频繁发生,每天多次。

    Method, system and computer program for inserting records into a database
    7.
    发明授权
    Method, system and computer program for inserting records into a database 有权
    用于将记录插入数据库的方法,系统和计算机程序

    公开(公告)号:US07860845B2

    公开(公告)日:2010-12-28

    申请号:US11781841

    申请日:2007-07-23

    IPC分类号: G06F7/00

    摘要: For a data processing system having memory for storing a database, a method, a system and a computer program product for directing the data processing system to process a record to be inserted into the database is disclosed. The database includes a plurality of base tables. The method includes the steps of making a record copy matching the record, for each base table to be selected from the plurality of base tables: providing a base table candidate indication for a selected base table, the base table candidate indication indicating whether the selected base table is a candidate base table that may receive the record, the base table candidate indication being determined on an outcome of executing before triggers and an outcome of testing constraints in association with the record copy, the before triggers and the constraints being associated with the selected base table; and restoring the record copy so that the record copy matches the record before providing a next subsequent base table candidate indication for another base table to be selected.

    摘要翻译: 对于具有用于存储数据库的存储器的数据处理系统,公开了一种用于指导数据处理系统处理要插入数据库的记录的方法,系统和计算机程序产品。 数据库包括多个基表。 该方法包括以下步骤:对于从多个基表中选择的每个基表,制作与记录相匹配的记录拷贝:为所选择的基表提供基表候选指示,所述基表候选指示指示所选基数 表是可以接收记录的候选基表,基于表的候选指示是根据在触发之前执行的结果确定的,以及与记录副本相关联的测试约束的结果,前触发和约束与所选择的相关联 基地台 以及恢复所述记录副本,使得所述记录副本与为所选择的另一基表提供下一个后续基表候选指示之前匹配所述记录。

    DATA PROFILE COMPUTATION
    8.
    发明申请
    DATA PROFILE COMPUTATION 有权
    数据配置文件计算

    公开(公告)号:US20090006392A1

    公开(公告)日:2009-01-01

    申请号:US11769050

    申请日:2007-06-27

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06F17/30536

    摘要: Architecture that provides a data profile computation technique which employs key profile computation and data pattern profile computation. Key profile computation in a data table includes both exact keys as well as approximate keys, and is based on key strengths. A key strength of 100% is an exact key, and any other percentage in an approximate key. The key strength is estimated based on the number of table rows that have duplicated attribute values. Only column sets that exceed a threshold value are returned. Pattern profiling identifies a small set of regular expression patterns which best describe the patterns within a given set of attribute values. Pattern profiling includes three phases: a first phases for determining token regular expressions, a second phase for determining candidate regular expressions, and a third phase for identifying the best regular expressions of the candidates that match the attribute values.

    摘要翻译: 提供采用关键轮廓计算和数据模式轮廓计算的数据轮廓计算技术的架构。 数据表中的关键轮廓计算包括精密键和近似键,并且基于关键优点。 100%的关键优势是一个确切的关​​键,其中一个关键的任何其他百分比。 基于具有重复的属性值的表行的数量来估计关键强度。 只返回超过阈值的列集。 模式分析标识一组最佳描述一组给定属性值中的模式的正则表达式模式。 模式分析包括三个阶段:用于确定令牌正则表达式的第一阶段,用于确定候选正则表达式的第二阶段,以及用于识别与属性值匹配的候选的最佳正则表达式的第三阶段。

    SYSTEM AND COMPUTER PROGRAM FOR INSERTING RECORDS INTO A DATABASE
    9.
    发明申请
    SYSTEM AND COMPUTER PROGRAM FOR INSERTING RECORDS INTO A DATABASE 有权
    将记录插入数据库的系统和计算机程序

    公开(公告)号:US20080140689A1

    公开(公告)日:2008-06-12

    申请号:US12020462

    申请日:2008-01-25

    IPC分类号: G06F7/00

    摘要: For a data processing system having memory for storing a database, a method, a system and a computer program product for directing the data processing system to process a record to be inserted into the database is disclosed. The database includes a plurality of base tables. The method includes the steps of making a record copy matching the record, for each base table to be selected from the plurality of base tables: providing a base table candidate indication for a selected base table, the base table candidate indication indicating whether the selected base table is a candidate base table that may receive the record, the base table candidate indication being determined on an outcome of executing before triggers and an outcome of testing constraints in association with the record copy, the before triggers and the constraints being associated with the selected base table; and restoring the record copy so that the record copy matches the record before providing a next subsequent base table candidate indication for another base table to be selected.

    摘要翻译: 对于具有用于存储数据库的存储器的数据处理系统,公开了一种用于指导数据处理系统处理要插入数据库的记录的方法,系统和计算机程序产品。 数据库包括多个基表。 该方法包括以下步骤:对于从多个基表中选择的每个基表,制作与记录相匹配的记录拷贝:为所选择的基表提供基表候选指示,所述基表候选指示指示所选基数 表是可以接收记录的候选基表,基于表的候选指示是根据在触发之前执行的结果确定的,以及与记录副本相关联的测试约束的结果,前触发和约束与所选择的相关联 基地台 以及恢复所述记录副本,使得所述记录副本与为所选择的另一基表提供下一个后续基表候选指示之前匹配所述记录。

    Efficient computation of multiple group by queries
    10.
    发明申请
    Efficient computation of multiple group by queries 审中-公开
    通过查询高效计算多组

    公开(公告)号:US20060253422A1

    公开(公告)日:2006-11-09

    申请号:US11124516

    申请日:2005-05-06

    IPC分类号: G06F17/30

    CPC分类号: G06F16/24535

    摘要: Systems and methodologies for computation of multiple group by queries via an optimizer that examines the space of plans in a systematic and cost based manner. The optimizer includes a merging component to merge pairs of sub plans to facilitate a plan choice with a lowest cost. The merging component can take as input two sub plans (e.g., sub plan P1 with root node V1 and sub plan P2 with root node V2, wherein each sub plan is a sub-tree of a logical plan whose root node is directly pointed to a Relation “R”), to return a set of sub-plans as out put with a root node V1∪V2 that is the smallest relation from which both V1 and V2 can be computed.

    摘要翻译: 用于通过查询计算多组的系统和方法,该优化器以系统和成本为基础的方式检查计划的空间。 优化器包括合并组件以合并子计划对,以便以最低成本进行计划选择。 合并组件可以将根节点V <1>和子计划P <2> 的子计划(例如,子计划P&lt; 1&lt; 1&gt; 节点V 2,其中每个子计划是逻辑计划的子树,其根节点直接指向关系“R”),以返回一组子计划,如与 作为V SUB 1和V 2 2两者之间的最小关系的根节点V 1 2 V 2 2&lt; 1&lt; 1&lt; 计算。