Database constructing apparatus, database search apparatus, database apparatus, method of constructing database, and method of searching database
    1.
    发明申请
    Database constructing apparatus, database search apparatus, database apparatus, method of constructing database, and method of searching database 审中-公开
    数据库构建装置,数据库搜索装置,数据库装置,数据库构建方法,数据库搜索方法

    公开(公告)号:US20070168363A1

    公开(公告)日:2007-07-19

    申请号:US10587770

    申请日:2005-09-27

    IPC分类号: G06F7/00

    摘要: A database apparatus has an element appearance information storage portion in which element appearance information is stored using element name IDs as keys, an ancestral path appearance information storage portion in which element appearance information is stored using ancestral path name IDs of the elements as keys, an attribute appearance information storage portion in which attribute appearance information is stored using attribute name IDs as keys, and a text appearance information storage portion in which appearance information about text character strings of element entities and the values of attributes possessed by the elements is stored using the partial character strings as keys.

    摘要翻译: 数据库装置具有元素外观信息存储部,其中使用元素名称ID作为键存储元素外观信息,祖先路径外观信息存储部,其中使用元素的祖先路径名称ID存储元素外观信息作为键, 属性外观信息存储部分,其中使用属性名称ID作为键存储属性外观信息;以及文本外观信息存储部分,其中使用所述元素实体存储关于元素实体的文本字符串和属性所具有的属性的值的外观信息 部分字符串作为键。

    Document retrieval system
    2.
    发明授权
    Document retrieval system 失效
    文件检索系统

    公开(公告)号:US6154737A

    公开(公告)日:2000-11-28

    申请号:US865181

    申请日:1997-05-29

    IPC分类号: G06F17/30

    摘要: A document retrieval system for searching a document coinciding with a retrieval request the user inputs and further ranking the document in accordance with the degree of coincidence between the document and the retrieval request. In the document retrieval system, a word frequency calculating section finds out the number of documents where a word appears, a frequency of occurrence of the word in a document and obtains a weighting parameter for the word, and a frequency score calculating section obtains a frequency score on the basis of the output of the word frequency calculating section. In addition, a word cooccurrence relation checking section checks word cooccurrence relations of the retrieval request and the document, and a cooccurrence score calculating section calculates a cooccurrence score from the degree of coincidence therebetween. A document score calculating section calculates a document score on the basis of the frequency score and the cooccurrence score. The documents are ranked in order of document score and displayed to the user.

    摘要翻译: 一种用于搜索与用户输入的检索请求一致的文档的文档检索系统,并且根据文档和检索请求之间的一致程度对文档进行排序。 在文档检索系统中,词频计算部分查找单词出现的文档数量,文档中单词出现的频率,并获得该单词的加权参数,并且频率分数计算部分获得频率 基于字频率计算部分的输出得分。 此外,单词并发关系检查部分检查检索请求和文档​​的单词并发关系,并发共享计算部分从它们之间的一致度计算出共同分数。 文档分数计算部分根据频率得分和共同分数来计算文档分数。 这些文件按文件分数的顺序排列并显示给用户。

    DATABASE MANAGEMENT SERVER APPARATUS, DATABASE MANAGEMENT SYSTEM, DATABASE MANAGEMENT METHOD AND DATABASE MANAGEMENT PROGRAM
    3.
    发明申请
    DATABASE MANAGEMENT SERVER APPARATUS, DATABASE MANAGEMENT SYSTEM, DATABASE MANAGEMENT METHOD AND DATABASE MANAGEMENT PROGRAM 审中-公开
    数据库管理服务器设备,数据库管理系统,数据库管理方法和数据库管理程序

    公开(公告)号:US20100145914A1

    公开(公告)日:2010-06-10

    申请号:US12472680

    申请日:2009-05-27

    IPC分类号: G06F17/00 G06F12/00

    CPC分类号: G06F16/2329 G06F16/219

    摘要: Provided is a database management server apparatus that can maintain the consistency of updates and prevent blocking other update requests in an update process.A server apparatus 3 of a database management system 1 has a function of nondestructively updating databases in response to an update request from a client apparatus 2 to manage generation-management databases. A main storage unit 4 stores entities of a plurality of databases for each version of the databases, and a version creating unit 5 creates a new version of the databases in response to an update request from a client apparatus. A request accepting unit 11 accepts an update request for a next version regardless of whether the new version is being created. An acceptance management unit 13 starts a period for accepting the update request for the next version in response to the update request and ends the period for accepting after a predetermined time. A version creating unit 5 creates the next version based on the update request accepted in the period for accepting.

    摘要翻译: 提供了一种数据库管理服务器装置,其可以维护更新的一致性并防止在更新过程中阻止其他更新请求。 数据库管理系统1的服务器装置3具有响应于来自客户端装置2的更新请求非破坏性地更新数据库以管理生成管理数据库的功能。 主存储单元4存储针对每个版本的数据库的多个数据库的实体,并且版本创建单元5响应于来自客户端装置的更新请求而创建新版本的数据库。 无论是否正在创建新版本,请求接受单元11接受下一个版本的更新请求。 接受管理单元13响应于更新请求开始接受下一个版本的更新请求的时段,并且在预定时间之后结束用于接受的时段。 版本创建单元5基于在接受的时段中接受的更新请求来创建下一个版本。

    Keyword extracting system and text retrieval system using the same
    4.
    发明授权
    Keyword extracting system and text retrieval system using the same 失效
    关键词提取系统和文本检索系统使用相同

    公开(公告)号:US06212517B1

    公开(公告)日:2001-04-03

    申请号:US09106748

    申请日:1998-06-30

    IPC分类号: G06F1730

    摘要: A system for providing keywords to facilitate a search in a text retrieval system. For each of texts constituting a text base, the system creates a word ID of each of words used in the text and a word occurrence count of a corresponding word. The word occurrence count indicates a number of occurrences of a word in each text. For each of words used in any of the texts constituting the text base, the system creates a total word occurrence count and a containing text count indicative of the number of texts containing the word. For each of words contained in the selected texts, a degree of importance is calculated by using the word occurrence count, the total word occurrence count and the containing text count. The words contained in the selected texts are sorted in order of the degree of importance. At least a part of the sorted words are displayed as related keywords.

    摘要翻译: 一种用于提供关键字以便于在文本检索系统中进行搜索的系统。 对于构成文本基础的每个文本,系统创建文本中使用的每个单词的单词ID和相应单词的单词出现计数。 发生次数表示每个文本中出现的单词数。 对于构成文本基础的任何文本中使用的每个单词,系统创建总词出现次数和包含指示包含单词的文本数量的文本计数。 对于所选文本中包含的每个单词,通过使用单词出现次数,总词出现次数和包含的文本计数来计算重要程度。 所选文本中包含的词根据重要程度的顺序排列。 至少一部分排序的字词显示为相关关键字。

    Document searching apparatus
    5.
    发明授权
    Document searching apparatus 失效
    文件检索装置

    公开(公告)号:US06298344B1

    公开(公告)日:2001-10-02

    申请号:US09277197

    申请日:1999-03-26

    IPC分类号: G06F1730

    摘要: A searching apparatus includes an index generation portion for generating an index to provide data of the number of documents including the key word and the number of appearances of the key word. Matching degrees between the key word and documents are calculated from the number of documents including the key word and the number of appearances of the key word. A portion of documents are arranged in order of the matching degree in a buffer which are outputted as the searching result. Lower rank documents regarding the matching degree are searched by comparing the lowest matching degree of the neighbour higher ranked document arranged in the buffer. At first time searching, data of the latest edition of the documents stored in a memory is detected and stored and is used to provide second time searching operation to eliminate inconsistency in the searching result between the editions at first and second time searching operations. The index is generated every field of each document. The matching degree of combined field is calculated by logical operation between the two fields. Moreover, an index of combined field may be generated and one of field of the combined field may be omitted. The matching degree of the other field is also obtained by another logical operation.

    摘要翻译: 搜索装置包括索引生成部分,用于生成索引以提供包括关键字的文档数量和关键字的出现次数的数据。 关键词和文档之间的匹配度由包括关键词和关键词的出现次数的文档数量计算。 文档的一部分按照作为搜索结果输出的缓冲器中的匹配度的顺序排列。 通过比较布置在缓冲器中的相邻较高排序的文档的最低匹配度来搜索关于匹配度的下级文档。 在第一次搜索时,检测并存储存储在存储器中的最新版本的文档的数据,并且用于提供第二次搜索操作,以消除第一次和第二次搜索操作之间的版本之间的搜索结果的不一致性。 每个文档的每个字段都生成索引。 组合场的匹配度由两场之间的逻辑运算来计算。 此外,可以生成组合字段的索引,并且可以省略组合字段的字段之一。 另一个字段的匹配度也可以通过另一个逻辑运算获得。

    Full-text search apparatus utilizing two-stage index file to achieve
high speed and reliability of searching a text which is a continuous
sequence of characters
    6.
    发明授权
    Full-text search apparatus utilizing two-stage index file to achieve high speed and reliability of searching a text which is a continuous sequence of characters 失效
    全文检索装置利用两阶段索引文件来实现搜索文本的高速度和可靠性,这是连续的字符序列

    公开(公告)号:US5706496A

    公开(公告)日:1998-01-06

    申请号:US601656

    申请日:1996-02-14

    IPC分类号: G06F17/30

    摘要: A new type of text search apparatus, capable of finding all occurrence positions of a search string that is an arbitrary character string, within a text which is written as a continous sequence of characters, utilizes for text position reference purposes in an index file, words which each occur (at least once within the text) as the maximum length word, referred to as an extension word, among a set of arbitrarily predefined dictionary words extending from a specific character position. Each such occurrence of a word as an extension word defines one of a set of text position elements, with that set covering all of the character positions of the text. The index file also includes a table which relates each of the extension words to the respective positions at which each of the partial character strings of the word occur within the word. Each occurrence of an arbitrary search string within the text can thereby be expressed as either a partial character string within a single text position element, or as a sequence of partial character strings within a set of sequentially occurring text position elements, so that all such occurrences can be found by utilizing the index file.

    摘要翻译: 一种新型的文本搜索装置,能够在作为连续的字符序列写入的文本内发现作为任意字符串的搜索字符串的所有出现位置,用于索引文件中的文本位置参考目的,词语 它们在从特定字符位置延伸的一组任意预定义的字典单元中发生(至少在文本内部)作为最大长度字,称为扩展字。 作为扩展字的单词的每个这样的出现定义了一组文本位置元素中的一个,该集合覆盖文本的所有字符位置。 索引文件还包括一个表,其将每个扩展词与单词中出现单词的每个部分字符串的各个位置相关联。 因此,文本内的任意搜索字符串的每次出现由此可以表示为单个文本位置元素内的部分字符串,或者表示为在一组顺序发生的文本位置元素内的部分字符串的序列,使得所有这些事件 可以通过使用索引文件找到。

    Document storing and managing system
    7.
    发明授权
    Document storing and managing system 失效
    文件存储和管理系统

    公开(公告)号:US5819295A

    公开(公告)日:1998-10-06

    申请号:US721077

    申请日:1996-09-26

    摘要: A document storing and managing system for storing plural electronic documents in each of folders according to classifications and managing the stored electronic documents in a unit of the folder has a folder managing means for managing attributes of the electronic documents included in each of the folders, a document version managing means for managing information as to version of the electronic documents included in each of the folder, and a folder version managing means for managing a correspondence relation between a version of the folder and a version of each of the electronic documents included in the folder. The document storing and managing system of this invention may set and manage a version of a folder while keeping adjustability with a version of each document.

    摘要翻译: 一种文件存储和管理系统,用于根据分类将文件夹中的多个电子文档存储在文件夹的每个文件夹中,并以文件夹为单位管理所存储的电子文档,具有用于管理包含在每个文件夹中的电子文档的属性的文件夹管理装置, 文件版本管理装置,用于管理包括在每个文件夹中的电子文档的版本的信息;以及文件夹版本管理装置,用于管理文件夹的版本与包含在该文件夹中的每个电子文档的版本之间的对应关系 夹。 本发明的文件存储和管理系统可以利用每个文档的版本保持可调整性来设置和管理文件夹的版本。

    Electronic dictionary system
    8.
    发明授权
    Electronic dictionary system 失效
    电子词典系统

    公开(公告)号:US5404299A

    公开(公告)日:1995-04-04

    申请号:US53290

    申请日:1993-03-28

    CPC分类号: G06F17/30955 G06F17/2795

    摘要: A concept dictionary management device includes a fundamental concept dictionary data holding portion for holding fundamental concept network connection information which represents fundamental concept network connections among words stored in a concept dictionary, a first supplemental concept dictionary data holding portion for holding first supplemental concept network connection information to be used for adding words to and deleting words from the concept network connections represented by the fundamental concept network connection information, a second supplemental concept dictionary data holding portion for holding second supplemental concept network connection information to be used only for personal use, to add one or more words to and deleting one or more words from the concept network connections obtained as a result of an addition of a deletion of the words connection by using the first supplemental concept network connection information, a concept dictionary retrieval portion for retrieving a concept network connection including an input word from the fundamental concept dictionary data holding portion, from the first supplemental concept dictionary data holding portion and from the second supplemental concept dictionary data holding portion, and an operation control portion for receiving concept network connection information representing the concept network connection retrieved by the concept dictionary retrieval means and for extracting a word from the received network connection information and outputting data indicating the extracted word to the concept dictionary retrieval portion as data indicating an input word.

    摘要翻译: 概念字典管理装置包括:基本概念词典数据保持部,用于保持基本概念网络连接信息,该概念网络连接信息表示存储在概念词典中的单词之间的基本概念网络连接;第一补充概念词典数据保持部分,用于保存第一补充概念网络连接信息 用于向由基本概念网络连接信息表示的概念网络连接中添加单词和从其中删除单词;第二补充概念字典数据保存部分,用于保存仅供个人使用的第二补充概念网络连接信息,以添加 通过使用第一补充概念网络连接信息作为添加单词连接的删除而获得的概念网络连接中的一个或多个单词和删除一个或多个单词,概念词典检索口 用于从第一补充概念词典数据保持部分和第二补充概念词典数据保持部分检索包括来自基本概念词典数据保持部分的输入字的概念网络连接,以及用于接收概念网络连接的操作控制部分 表示由概念词典检索装置检索的概念网络连接的信息,并且用于从接收的网络连接信息中提取一个单词,并将表示提取的单词的数据输出到概念词典检索部分,作为指示输入单词的数据。

    Vector index preparing method, similar vector searching method, and apparatuses for the methods

    公开(公告)号:US07007019B2

    公开(公告)日:2006-02-28

    申请号:US09913960

    申请日:2000-12-21

    申请人: Yuji Kanno

    发明人: Yuji Kanno

    IPC分类号: G06F17/30

    摘要: In the present invention, a similar vector is searched from a several hundreds dimensional vector database at a high speed, by a single vector index, and in accordance with either measure of an inner product or a distance by designating a similarity search range and maximum obtained pieces number, vector index preparation is performed by decomposing each vector into a plurality of partial vectors and characterizing the vector by a norm division, belonging region and declination division to prepare an index, and similarity search is performed by obtaining a partial query vector and partial search range from a query vector and search range, performing similarity search in each partial space to accumulate a difference from the search range and to obtain an upper limit value, and obtaining a correct measure from a higher upper limit value to obtain a final similarity search result.

    Constructing method of finite-state machine performing transitions
according to a partial type of success function and a failure function
    10.
    发明授权
    Constructing method of finite-state machine performing transitions according to a partial type of success function and a failure function 失效
    有限状态机的构造方法根据部分类型的成功函数和故障功能执行转换

    公开(公告)号:US5495409A

    公开(公告)日:1996-02-27

    申请号:US331260

    申请日:1994-10-28

    申请人: Yuji Kanno

    发明人: Yuji Kanno

    摘要: A constructing method of a finite state machine with failure transitions FFM is disclosed. The machine FFM is constructed from a nondeterministic finite-state machine and a string of external inputs. States in the machine FFM is formed of a state set q included in the nondeterministic finite-state machine and a set p defined as a subset of the state set q, and the number of states is finite. Also, an external input c takes the machine FFM from a current state s to a next state g(s,c) and an output .mu.(s) is output from the next state g(s,c) in cases where a value g(s,c) of a success function g is defined, and an external input c takes the machine FFM from the current state s to a state g(f(f...f(s)...)) determined by repeatedly calculating a value f(s) of a failure function f until a value g(f(f...f(s)...)) defined is found out in cases where the value g(s,c) of the success function g is not defined. Because all of transitions from the current state s for all external inputs c are not defined by the success function g, a storage capacity for storing the machine FFM is considerably reduced.

    摘要翻译: 公开了一种具有故障转移FFM的有限状态机的构造方法。 机器FFM由非确定性有限状态机和一串外部输入构成。 机器FFM中的状态由包含在非确定性有限状态机中的状态集合q和定义为状态集合q的子集的集合p形成,并且状态数量p,q> 有限。 此外,外部输入c将机器FFM从当前状态s接收到下一状态g(s,c),并且在值g(s,c)的情况下从下一状态g(s,c)输出输出mu 定义成功函数g的(s,c),并且外部输入c将机器FFM从当前状态s引导到由重复确定的状态g(f(f ... f(s)...)) 在成功的值g(s,c)的情况下,计算故障函数f的值f(s),直到定义的值g(f(f ... f(s)...) 功能g未定义。 因为对于所有外部输入c的从当前状态s的所有转换不是由成功函数g定义的,所以存储机器FFM的存储容量大大降低。