TAGGING ENTITIES WITH DESCRIPTIVE PHRASES
    71.
    发明申请
    TAGGING ENTITIES WITH DESCRIPTIVE PHRASES 有权
    用描述性标签标签实体

    公开(公告)号:US20130132381A1

    公开(公告)日:2013-05-23

    申请号:US13298349

    申请日:2011-11-17

    CPC classification number: G06F17/30864 G06F17/30277

    Abstract: A plurality of description phrases associated with a first domain may be determined, based on an analysis of a first plurality of documents to determine co-occurrences of the description phrases with one or more name labels associated with the first domain. An entity associated with the first domain may be obtained. An analysis of a second plurality of documents may be initiated to identify co-occurrences of mentions of the obtained entity and one or more of the plurality of description phrases, and contexts associated with each of the co-occurrences of the mentions and description phrases, in each one of the second plurality of documents. A description tag association between the obtained entity and one of the description phrases may be determined, based on an analysis of the identified contexts.

    Abstract translation: 可以基于第一多个文档的分析来确定与第一域相关联的多个描述短语,以确定描述短语与与第一域相关联的一个或多个名称标签的共同出现。 可以获得与第一域相关联的实体。 可以启动对第二多个文档的分析,以识别获得的实体的提及和多个描述短语中的一个或多个以及与提及和描述短语的共同出现中的每一个相关联的上下文, 在第二多个文档的每一个中。 可以基于对所识别的上下文的分析来确定获得的实体与描述短语之一之间的描述标签关联。

    Constrained physical design tuning
    72.
    发明授权
    Constrained physical design tuning 有权
    约束物理设计调谐

    公开(公告)号:US08140548B2

    公开(公告)日:2012-03-20

    申请号:US12191303

    申请日:2008-08-13

    CPC classification number: G06F17/30312

    Abstract: Described is a constraint language and related technology by which complex constraints may be used in selecting configurations for use in physical database design tuning. The complex constraint (or constraints) is processed, e.g., in a search framework, to determine and output at least one configuration that meets the constraint, e.g., a best configuration found before a stopping condition is met. The search framework processes a current configuration into candidate configurations, including by searching for candidate configurations from a current configuration based upon a complex constraint, iteratively evaluating a search space until a stopping condition is satisfied, using transformation rules to generate new candidate configurations, and selecting a best candidate configuration. Transformation rules and pruning rules are applied to efficiently perform the search. Constraints may be specified as assertions that need to be satisfied, or as soft assertions that come close to satisfying the constraint.

    Abstract translation: 描述了一种约束语言和相关技术,通过该约束语言和相关技术,复杂约束可用于选择用于物理数据库设计调优的配置。 复杂约束(或约束)例如在搜索框架中被处理,以确定和输出满足约束的至少一个配置,例如在满足停止条件之前找到的最佳配置。 搜索框架将当前配置处理成候选配置,包括通过基于复杂约束从当前配置中搜索候选配置,迭代地评估搜索空间直到停止条件满足,使用转换规则来生成新的候选配置,以及选择 最佳候选人配置。 应用变换规则和修剪规则来有效地执行搜索。 约束可以被指定为需要满足的断言,或者是接近满足约束的软断言。

    Taxonomy Editor
    73.
    发明申请
    Taxonomy Editor 有权
    分类编辑器

    公开(公告)号:US20110214080A1

    公开(公告)日:2011-09-01

    申请号:US12713190

    申请日:2010-02-26

    CPC classification number: G06F17/30734

    Abstract: This patent application relates to taxonomy editing. One implementation involves a taxonomy editor configured to generate a visual representation of a taxonomy associated with a set of scientific papers. The taxonomy editor includes a properties module configured to identify properties relating to an individual node of the taxonomy and a statistics module configured to determine trends relating to the individual node. The taxonomy editor further includes a similarity module configured to evaluate keyword similarity relative to individual scientific papers associated with the individual node. The taxonomy editor also includes a suggestion module configured to utilize the properties, the trends and the keyword similarity to identify potential modifications to the taxonomy. The taxonomy editor is further configured to present at least some of the potential modifications, the properties, the trends, and the keyword similarity concurrently with the visual representation of the taxonomy.

    Abstract translation: 该专利申请涉及分类编辑。 一个实现涉及分类编辑器,其被配置为生成与一组科学论文相关联的分类法的视觉表示。 分类编辑器包括被配置为识别与分类法的单个节点相关的属性的属性模块,以及被配置为确定与各个节点相关的趋势的统计模块。 分类编辑器还包括相似度模块,其被配置为评估与单个节点相关联的各个科学论文的关键字相似度。 分类编辑器还包括配置为利用属性,趋势和关键字相似性的建议模块来识别对分类法的潜在修改。 分类编辑器还被配置为与分类法的视觉表示同时呈现至少一些潜在的修改,属性,趋势和关键词相似性。

    Foreign-Key Detection
    74.
    发明申请
    Foreign-Key Detection 有权
    外键检测

    公开(公告)号:US20110208748A1

    公开(公告)日:2011-08-25

    申请号:US12709508

    申请日:2010-02-21

    CPC classification number: G06F17/30306

    Abstract: This patent application relates to foreign-key detection. One implementation obtains a set of data tables. This implementation automatically determines foreign-key relationships of columns from separate tables of the set.

    Abstract translation: 本专利申请涉及外键检测。 一个实现获得一组数据表。 此实现将自动确定集合的不同表中的列的外键关系。

    ERROR TOLERANT AUTOCOMPLETION
    76.
    发明申请
    ERROR TOLERANT AUTOCOMPLETION 审中-公开
    错误的自动化

    公开(公告)号:US20100325136A1

    公开(公告)日:2010-12-23

    申请号:US12490288

    申请日:2009-06-23

    CPC classification number: G06F17/276

    Abstract: Techniques for error-tolerant autocompletion are described. While displaying characters of an input string as they are inputted by a user, when a character is added to the input string by the user, matching strings may be selected from among a set of candidate strings by determining which of the candidate strings have a prefix whose characters match the characters of the input string within a given edit distance of the input string.

    Abstract translation: 描述了容错自动完成技术。 当用户输入输入字符串的字符时,当用户将字符添加到输入字符串时,可以通过确定哪个候选字符串具有前缀来从一组候选字符串中选择匹配字符串 其字符与输入字符串的给定编辑距离内的输入字符串的字符匹配。

    IDENTIFYING SYNONYMS OF ENTITIES USING A DOCUMENT COLLECTION
    77.
    发明申请
    IDENTIFYING SYNONYMS OF ENTITIES USING A DOCUMENT COLLECTION 有权
    使用文件收集识别实体的同义词

    公开(公告)号:US20100313258A1

    公开(公告)日:2010-12-09

    申请号:US12478120

    申请日:2009-06-04

    CPC classification number: G06F17/2795 G06F17/278

    Abstract: Identifying synonyms of entities using a collection of documents is disclosed herein. In some aspects, a document from a collection of documents may be analyzed to identify hit sequences that include one or more tokens (e.g., words, number, etc.). The hit sequences may then be used to generate discriminating token sets (DTS's) that are subsets of both the hit sequences and the entity names. The DTS's are matched with corresponding entity names, and then used to create DTS phrases by selecting adjacent text in the document that is proximate to the DTS. The DTS phrases may be analyzed to determine whether the corresponding DTS is synonyms of the entity name. In various aspects, the tokens of an associated entity name that are present in the DTS phrases are used to generate a score for the DTS. When the score at least reaches a threshold, the DTS may be designated as a synonym. A list of synonyms may be generated for each entity name.

    Abstract translation: 本文公开了使用文档集合识别实体的同义词。 在一些方面,可以分析来自文档集合的文档以识别包括一个或多个令牌(例如,单词,数字等)的命中序列。 然后可以使用命中序列来生成作为命中序列和实体名称的子集的识别令牌集(DTS's)。 DTS与相应的实体名称相匹配,然后用于通过选择靠近DTS的文档中的相邻文本来创建DTS短语。 可以分析DTS短语以确定对应的DTS是否是实体名称的同义词。 在各方面,使用存在于DTS短语中的关联实体名称的令牌来产生DTS的得分。 当分数至少达到阈值时,DTS可以被指定为同义词。 可以为每个实体名称生成同义词列表。

    IDENTIFYING SYNONYMS OF ENTITIES USING WEB SEARCH
    78.
    发明申请
    IDENTIFYING SYNONYMS OF ENTITIES USING WEB SEARCH 审中-公开
    使用WEB搜索识别实体的同步

    公开(公告)号:US20100293179A1

    公开(公告)日:2010-11-18

    申请号:US12465832

    申请日:2009-05-14

    CPC classification number: G06F16/951

    Abstract: Identifying synonyms of entities using web search results is disclosed herein. In some aspects, a candidate string of tokens of an entity name is selected as a search term. The search term is transmitted by a server to a search engine, which in turn, transmits search results back to the server after performing a search. The server analyzes the search results, generates a score based on the search results, and then determines a status (synonym or not a synonym) of the candidate string based on the score. In further aspects, additional candidate strings are designated as synonyms or not synonyms based on status of the searched candidate string by using relationships of a lattice formed from all possible candidate strings of the entity name.

    Abstract translation: 本文公开了使用网络搜索结果识别实体的同义词。 在某些方面,选择实体名称的令牌候选字符串作为搜索项。 搜索项由服务器发送到搜索引擎,搜索引擎又在执行搜索之后将搜索结果发送回服务器。 服务器分析搜索结果,根据搜索结果生成分数,然后根据分数确定候选字符串的状态(同义词或不是同义词)。 在另外的方面,通过使用由实体名称的所有可能候选字符串形成的格子的关系,基于搜索到的候选字符串的状态,将附加候选字符串指定为同义词或不是同义词。

    Systems and methods for estimating functional relationships in a database
    80.
    发明授权
    Systems and methods for estimating functional relationships in a database 有权
    用于估计数据库中的功能关系的系统和方法

    公开(公告)号:US07562067B2

    公开(公告)日:2009-07-14

    申请号:US11123901

    申请日:2005-05-06

    CPC classification number: G06F17/30536 Y10S707/99932

    Abstract: A system that facilitates estimating functional relationships associated with one or more columns in a database comprises a sampling component that receives a random sample of records within the database. An estimate generator component calculates an estimate of strength of functional relationships based at least in part upon the received samples. For example, the estimate generator component can calculate an estimate of strength of a column as a key column based at least in part upon the received samples.

    Abstract translation: 便于估计与数据库中的一个或多个列相关联的功能关系的系统包括接收数据库内的记录的随机抽样的采样组件。 估计生成器组件至少部分地基于所接收的样本来计算功能关系的强度的估计。 例如,估计生成器组件可以至少部分地基于所接收的样本来计算作为关键列的列的强度的估计。

Patent Agency Ranking