Foreign-Key Detection
    81.
    发明申请
    Foreign-Key Detection 有权
    外键检测

    公开(公告)号:US20110208748A1

    公开(公告)日:2011-08-25

    申请号:US12709508

    申请日:2010-02-21

    CPC classification number: G06F17/30306

    Abstract: This patent application relates to foreign-key detection. One implementation obtains a set of data tables. This implementation automatically determines foreign-key relationships of columns from separate tables of the set.

    Abstract translation: 本专利申请涉及外键检测。 一个实现获得一组数据表。 此实现将自动确定集合的不同表中的列的外键关系。

    ERROR TOLERANT AUTOCOMPLETION
    83.
    发明申请
    ERROR TOLERANT AUTOCOMPLETION 审中-公开
    错误的自动化

    公开(公告)号:US20100325136A1

    公开(公告)日:2010-12-23

    申请号:US12490288

    申请日:2009-06-23

    CPC classification number: G06F17/276

    Abstract: Techniques for error-tolerant autocompletion are described. While displaying characters of an input string as they are inputted by a user, when a character is added to the input string by the user, matching strings may be selected from among a set of candidate strings by determining which of the candidate strings have a prefix whose characters match the characters of the input string within a given edit distance of the input string.

    Abstract translation: 描述了容错自动完成技术。 当用户输入输入字符串的字符时,当用户将字符添加到输入字符串时,可以通过确定哪个候选字符串具有前缀来从一组候选字符串中选择匹配字符串 其字符与输入字符串的给定编辑距离内的输入字符串的字符匹配。

    IDENTIFYING SYNONYMS OF ENTITIES USING A DOCUMENT COLLECTION
    84.
    发明申请
    IDENTIFYING SYNONYMS OF ENTITIES USING A DOCUMENT COLLECTION 有权
    使用文件收集识别实体的同义词

    公开(公告)号:US20100313258A1

    公开(公告)日:2010-12-09

    申请号:US12478120

    申请日:2009-06-04

    CPC classification number: G06F17/2795 G06F17/278

    Abstract: Identifying synonyms of entities using a collection of documents is disclosed herein. In some aspects, a document from a collection of documents may be analyzed to identify hit sequences that include one or more tokens (e.g., words, number, etc.). The hit sequences may then be used to generate discriminating token sets (DTS's) that are subsets of both the hit sequences and the entity names. The DTS's are matched with corresponding entity names, and then used to create DTS phrases by selecting adjacent text in the document that is proximate to the DTS. The DTS phrases may be analyzed to determine whether the corresponding DTS is synonyms of the entity name. In various aspects, the tokens of an associated entity name that are present in the DTS phrases are used to generate a score for the DTS. When the score at least reaches a threshold, the DTS may be designated as a synonym. A list of synonyms may be generated for each entity name.

    Abstract translation: 本文公开了使用文档集合识别实体的同义词。 在一些方面,可以分析来自文档集合的文档以识别包括一个或多个令牌(例如,单词,数字等)的命中序列。 然后可以使用命中序列来生成作为命中序列和实体名称的子集的识别令牌集(DTS's)。 DTS与相应的实体名称相匹配,然后用于通过选择靠近DTS的文档中的相邻文本来创建DTS短语。 可以分析DTS短语以确定对应的DTS是否是实体名称的同义词。 在各方面,使用存在于DTS短语中的关联实体名称的令牌来产生DTS的得分。 当分数至少达到阈值时,DTS可以被指定为同义词。 可以为每个实体名称生成同义词列表。

    IDENTIFYING SYNONYMS OF ENTITIES USING WEB SEARCH
    85.
    发明申请
    IDENTIFYING SYNONYMS OF ENTITIES USING WEB SEARCH 审中-公开
    使用WEB搜索识别实体的同步

    公开(公告)号:US20100293179A1

    公开(公告)日:2010-11-18

    申请号:US12465832

    申请日:2009-05-14

    CPC classification number: G06F16/951

    Abstract: Identifying synonyms of entities using web search results is disclosed herein. In some aspects, a candidate string of tokens of an entity name is selected as a search term. The search term is transmitted by a server to a search engine, which in turn, transmits search results back to the server after performing a search. The server analyzes the search results, generates a score based on the search results, and then determines a status (synonym or not a synonym) of the candidate string based on the score. In further aspects, additional candidate strings are designated as synonyms or not synonyms based on status of the searched candidate string by using relationships of a lattice formed from all possible candidate strings of the entity name.

    Abstract translation: 本文公开了使用网络搜索结果识别实体的同义词。 在某些方面,选择实体名称的令牌候选字符串作为搜索项。 搜索项由服务器发送到搜索引擎,搜索引擎又在执行搜索之后将搜索结果发送回服务器。 服务器分析搜索结果,根据搜索结果生成分数,然后根据分数确定候选字符串的状态(同义词或不是同义词)。 在另外的方面,通过使用由实体名称的所有可能候选字符串形成的格子的关系,基于搜索到的候选字符串的状态,将附加候选字符串指定为同义词或不是同义词。

    Systems and methods for estimating functional relationships in a database
    87.
    发明授权
    Systems and methods for estimating functional relationships in a database 有权
    用于估计数据库中的功能关系的系统和方法

    公开(公告)号:US07562067B2

    公开(公告)日:2009-07-14

    申请号:US11123901

    申请日:2005-05-06

    CPC classification number: G06F17/30536 Y10S707/99932

    Abstract: A system that facilitates estimating functional relationships associated with one or more columns in a database comprises a sampling component that receives a random sample of records within the database. An estimate generator component calculates an estimate of strength of functional relationships based at least in part upon the received samples. For example, the estimate generator component can calculate an estimate of strength of a column as a key column based at least in part upon the received samples.

    Abstract translation: 便于估计与数据库中的一个或多个列相关联的功能关系的系统包括接收数据库内的记录的随机抽样的采样组件。 估计生成器组件至少部分地基于所接收的样本来计算功能关系的强度的估计。 例如,估计生成器组件可以至少部分地基于所接收的样本来计算作为关键列的列的强度的估计。

    CONTINUOUS PHYSICAL DESIGN TUNING
    88.
    发明申请
    CONTINUOUS PHYSICAL DESIGN TUNING 审中-公开
    连续物理设计调谐

    公开(公告)号:US20080183764A1

    公开(公告)日:2008-07-31

    申请号:US11669807

    申请日:2007-01-31

    CPC classification number: G06F16/2453 G06F16/2272

    Abstract: Online physical design tuning is constantly monitoring database indexes and can effectively react to changes in a workload by modifying the physical design as needed. Algorithms can be utilized that take into account various criteria including storage constraints, update statements, and the cost of temporarily creating physical structures.

    Abstract translation: 在线物理设计调整是不断监视数据库索引,并可以根据需要修改物理设计,对工作负载的变化做出有效的反应。 可以利用考虑到各种标准的算法,包括存储约束,更新语句和临时创建物理结构的成本。

    Ranking database query results using probabilistic models from information retrieval
    89.
    发明授权
    Ranking database query results using probabilistic models from information retrieval 失效
    使用信息检索的概率模型对数据库查询结果进行排序

    公开(公告)号:US07383262B2

    公开(公告)日:2008-06-03

    申请号:US10879450

    申请日:2004-06-29

    CPC classification number: G06Q30/0603 G06Q50/16 Y10S707/99937

    Abstract: A system and methods rank results of database queries. An automated approach for ranking database query results is disclosed that leverages data and workload statistics and associations. Ranking functions are based upon the principles of probabilistic models from Information Retrieval that are adapted for structured data. The ranking functions are encoded into an intermediate knowledge representation layer. The system is generic, as the ranking functions can be further customized for different applications. Benefits of the disclosed system and methods include the use of adapted probabilistic information retrieval (PIR) techniques that leverage relational/structured data, such as columns, to provide natural groupings of data values. This permits the inference and use of pair-wise associations between data values across columns, which are usually not possible with text data.

    Abstract translation: 系统和方法对数据库查询的结果进行排序。 披露了一种用于排名数据库查询结果的自动化方法,它利用数据和工作量统计信息和关联。 排名函数基于适用于结构化数据的信息检索的概率模型的原理。 排序函数被编码为中间知识表示层。 该系统是通用的,因为排序功能可以针对不同的应用进一步定制。 所公开的系统和方法的优点包括使用适应的概率信息检索(PIR)技术来利用诸如列的关系/结构化数据来提供数据值的自然分组。 这允许推断和使用跨列之间的数据值之间的成对关联,这通常不可能与文本数据。

Patent Agency Ranking