摘要:
A method and apparatus for allowing a computer to search a hierarchical structure document by creating a list in which a true flag indicating that conditions of a predicate of a search formula are satisfied or a false flag indicating that the conditions of the predicate of the search formula are not satisfied is set to a predicate node of the document data based on the search formula, and scanning the list to search for data designated by the search formula from the document data.
摘要:
A search device creates as many stack frames as the number obtained by adding one to the number of search condition character strings contained in an out-of-search-condition character string in a stack, sequentially inputs character strings in a text into automaton data, determines whether the character strings in the text hit the search condition character string or the out-of-search-condition character string to push correspondence to the stack or to change correspondence into non-correspondence, and determines whether the text is to be searched.
摘要:
A search device creates as many stack frames as the number obtained by adding one to the number of search condition character strings contained in an out-of-search-condition character string in a stack, sequentially inputs character strings in a text into automaton data, determines whether the character strings in the text hit the search condition character string or the out-of-search-condition character string to push correspondence to the stack or to change correspondence into non-correspondence, and determines whether the text is to be searched.
摘要:
A computer-readable storage medium storing a dataset sorting program is provided to sort records in a dataset into a plurality of destination groups according to a given key item specification. An item value extractor creates an item value list for every record. Then a frequent tree builder builds a frequent tree from the item value lists by finding patterns of item values that appear more often than a threshold specified by a given growth rate parameter. Each item value pattern is a leading part of an item value list with a variable length. A destination group mapper associates each node of the frequent tree with one of the plurality of destination groups. A record sorter traces the frequent tree according to the item value list of each given record, and upon reaching a particular node, puts the record into the destination group associated with that node.
摘要:
A computer-readable storage medium storing a dataset sorting program is provided to sort records in a dataset into a plurality of destination groups according to a given key item specification. An item value extractor creates an item value list for every record. Then a frequent tree builder builds a frequent tree from the item value lists by finding patterns of item values that appear more often than a threshold specified by a given growth rate parameter. Each item value pattern is a leading part of an item value list with a variable length. A destination group mapper associates each node of the frequent tree with one of the plurality of destination groups. A record sorter traces the frequent tree according to the item value list of each given record, and upon reaching a particular node, puts the record into the destination group associated with that node.
摘要:
Tag registration information, keyword registration information, and state management information are generated based on a search condition, and a tag search of detecting a tag registered in the tag registration information from document data of a structured document is switched to and from a keyword search of detecting a keyword registered in the keyword registration information according to the state management information.
摘要:
One or more extraction conditions for designating data to be extracted can be input in a program. When one or mode extraction conditions are input, a data extraction is carried out for each of the extraction conditions and the extracted data is output to an output destination in accordance with the extraction condition that the present data satisfies.
摘要:
A data sort method, apparatus, and program to receive a character string of sort key items specified as keys of a data sort process, generate an automaton corresponding to a record whose final transition state corresponds to the character string, determine the order specified by the character string by scanning the automaton, and determine the order of the records corresponding to the character string.
摘要:
A playback apparatus includes a VOBU number retriever for obtaining the number of VOBU included in a title, a playback time retriever for obtaining playback time for the title and a playback controller for carrying out a special playback according to the number of VOBU and the playback time. The playback controller obtains average playback time for one VOBU from the playback time and number of VOBU for the title as unit playback time and refers to an address map of a VOBU according to the unit playback time to execute a high-speed search and time search.
摘要:
A full text search system using a character string collation method which searches a large quantity of data using a plurality of search processing apparatuses is disclosed. This system comprises a search integration unit which divides search-target character string data into a group of character string records, allocates the divided records to one or more search processing apparatuses, transmits given character string search conditions to each search processing apparatus, and receives and integrates search results. Furthermore, this System comprises an update temporary storage unit which temporarily stores new character string records to update the search-target character string data and an update record search instruction unit which instructs the new character string records stored in the update temporary memory unit to any one of the search processing apparatuses determined in advance as a part of the search-target character string data.