摘要:
The matching apparatus 100 stores, with respect to a branching pattern included in event patterns 140a, a reception time of an event that serves as the branching point in a source node that indicates the transition destination of the event, connects the source node to the branch node by a transition with an interval condition, connects the branch nodes to the link node that indicates the linking point at which these branch nodes are linked together, and thereby generates NFAs 140b. Then, the matching apparatus 100 compares event streams 140c with the NFAs 140b to determine whether the event streams 140c includes the event patterns 140a.
摘要:
A rule learning method in machine learning includes distributing features to a given number of buckets based on a weight of the features which are correlated with a training example; specifying a feature with a maximum gain value as a rule based on a weight of the training example from each of the buckets; calculating a confidence value of the specified rule based on the weight of the training example; storing the specified rule and the confidence value in a rule data storage unit; updating the weights of the training examples based on the specified rule, the confidence value of the specified rule, data of the training example, and the weight of the training example; and repeating the distributing, the specifying, the calculating, the storing, and the updating, when the rule and the confidence value are to be further generated.
摘要:
Related keywords having certain relations to a target keyword are extracted from a unit storing keywords according to relations among each other. Expressions corresponding to the target keyword and related keywords are read from a unit storing expressions correspondingly with each stored keyword, each expression having been extracted from a content containing the corresponding keyword. A commonness between the expressions corresponding to each related keyword and the target keyword is calculated. An association between each related keyword and the target keyword is calculated using a distance for each related keyword from the target keyword and the commonness. A related keyword having a certain association is extracted as a same-region keyword in a same region as the target keyword. A drawing including the target keyword and related keywords arranged according to the associations is output with the same-region keyword displayed distinguishably from other related keywords not included in the same region.
摘要:
A computer-readable recording medium stores therein an information retrieving program that causes a computer to execute acquiring a document to be searched and having a hierarchical structure; generating a path schema related to the acquired document; receiving input of a retrieval keyword, a retrieval condition for the retrieval keyword, and a retrieval equation specifying a retrieval range for the retrieval keyword; generating a single automaton that includes a hierarchy retrieval automaton that retrieves a hierarchy of the generated path schema and a hit keyword retrieval automaton that retrieves a hit keyword satisfying the retrieval condition, the single automaton making state transition between a hit hierarchical node where the hit keyword in the hierarchy retrieval automaton is present and a set of nodes representing the hit keyword; retrieving, from the document and using the single automaton, the hit keyword within the retrieval range; and outputting a retrieval result.
摘要:
A search supporting device includes an accepting unit for accepting an input word and a URL, a log obtaining unit for obtaining from a search log storing unit in which a log including a search word having been used for every URL of viewed data for a search of the data is stored, the log including a URL having a particular portion in common with the accepted URL and a search word having a particular portion in common with the accepted input word, and an outputting unit for outputting the search word included in the obtained log.
摘要:
A method and apparatus for allowing a computer to search a hierarchical structure document by creating a list in which a true flag indicating that conditions of a predicate of a search formula are satisfied or a false flag indicating that the conditions of the predicate of the search formula are not satisfied is set to a predicate node of the document data based on the search formula, and scanning the list to search for data designated by the search formula from the document data.
摘要:
A computer-readable recording medium stores therein an information retrieving program that causes a computer to execute acquiring a document to be searched and having a hierarchical structure; generating a path schema related to the acquired document; receiving input of a retrieval keyword, a retrieval condition for the retrieval keyword, and a retrieval equation specifying a retrieval range for the retrieval keyword; generating a single automaton that includes a hierarchy retrieval automaton that retrieves a hierarchy of the generated path schema and a hit keyword retrieval automaton that retrieves a hit keyword satisfying the retrieval condition, the single automaton making state transition between a hit hierarchical node where the hit keyword in the hierarchy retrieval automaton is present and a set of nodes representing the hit keyword; retrieving, from the document and using the single automaton, the hit keyword within the retrieval range; and outputting a retrieval result.
摘要:
A named entity extraction apparatus includes an extraction result acquisition unit for acquiring a named entity extraction result obtained as a result of a named entity extraction process; and a lexicon information creation unit for creating lexicon information which is utilized as clues in extracting named entities from text data, on the basis of the named entity extraction result acquired by said extraction result acquisition unit.
摘要:
A feature-pattern output apparatus, which has a database in which data formed of a plurality of items is classified as a plurality of classes, and outputs a combination of items forming a feature of each of the classes as a feature pattern of the class, includes a similar-data extracting unit that extracts, when input data is received, similar data that is similar to the input data for each of the classes from the database; a similar-pattern-set calculating unit that calculates a similar pattern set for each of the classes from the similar data extracted; and a feature-pattern calculating unit that calculates a feature pattern for each of the classes from the similar pattern set calculated.
摘要:
An apparatus includes a unit dividing a region into blocks having a predetermined bit length and generating addition data having the predetermined bit length corresponding to coordinate data, a unit storing storage data having the predetermined bit length, a unit generating composite data having a bit length twice the length of the predetermined bit length by connecting the addition data to the storage data, and storing the composite data, and a unit repeating an operation of designating the bit length of the composite data as the predetermined bit length, instructing to generate the addition data, and instructing to generate and store the composite data. The number of different bits between a bit string of a location corresponding to the input coordinate data and a bit string of a reference location tends to increase as a distance between the location corresponding to the coordinate data and the reference location increases.