摘要:
An information processing program causes a computer to execute a process including executing training of a trained model based on training data defining relations between vectors corresponding to target compounds and vectors respectively corresponding to plural subcompounds included in synthetic pathways for manufacture of the target compounds and calculating vectors of plural subcompounds corresponding to a target compound to be analyzed by inputting a vector of the target compound to be analyzed into the trained model in a case where the target compound to be analyzed has been received.
摘要:
An information processing apparatus is configured to: obtain pieces of segmented genome data being genome information of a specific individual; generate pieces of segmented codon data obtained by encoding each of the pieces of segmented genome data in a codon unit based on a table in which a codon is associated with a code; identify, based on reference codon data obtained by encoding reference genome data to be a reference in the codon unit and each of the pieces of segmented codon data, a type and a position of an appearance of gene mutation different from the code in the reference codon data among the codes in the pieces of segmented codon data; and generate a gene mutation inverted index in which the gene mutation and the type and position of the appearance of the gene mutation are associated with each other.
摘要:
An evaluation device generates a new piece of base sequence data through shifting. The evaluation device specifies a partial base sequence including a base in which a genetic mutation is caused, from among a plurality of partial base sequences generated by dividing a plurality of bases included in the newly generated base sequence data from a reference position on the new piece of base sequence data according to a predetermined rule. The evaluation device performs evaluation according to an appearance state in which an arrangement of the specified partial base sequence and a partial base sequence that has a predetermined positional relationship with the specified partial base sequence from among the plurality of partial base sequences appears in the plurality of partial base sequences generated by dividing a plurality of bases included in predetermined base sequence data from the reference position on the predetermined base sequence data according to the predetermined rule.
摘要:
An information processing apparatus (100) determines, by referring to a storage unit that stores therein contour data of a plurality of objects, whether a plurality of pieces of contour data associated with a contour of a subject included in a captured image. The information processing apparatus (100) acquires, when a determination result is affirmative, by referring to the storage unit, a plurality of pieces of region data associated with the plurality of pieces of corresponding contour data associated with the contour of the subject and specifies, based on the plurality of pieces of acquired region data, an object associated with the subject from among the plurality of objects.
摘要:
A file generating device (100) extracts, when a captured image captured by an image capturing device is acquired, based on the acquired captured image, a shape of an object included in the captured image and generates, based on the extracted shape, text information that includes a drawing indication of the shape. The information processing apparatus (200) refers to a storage unit that stores therein, regarding each of a plurality of objects, identification information on the objects in association with the text information that includes a drawing indication of the objects and acquires identification information on an object, from among the plurality of objects, that is associated with the text information in which similarity to the generated text information satisfies a criterion.
摘要:
An encoding apparatus includes an encoding unit configured to acquire text data, specify a first dynamic dictionary among a plurality of dynamic dictionaries based on attribute information of a first word included in the text data, register the first word in association with a first dynamic code in the first dynamic dictionary, and encode the first word into the first dynamic code.
摘要:
In (A), classification codes "#1", "#10", and "#1032" of a search word "curry-and-rice" are acquired for tiers. In (B), for a character string for comparison "curry-and-rice, a food prepared by putting curry on rice", number of appearances of the major classification code "#1" (food) is four, number of appearances of the intermediate classification code "#10" (rice) is two, number of appearances of the minor classification code "#1032" (curry-and-rice) is one, number of appearances of an intermediate classification code "#11" (spice) is one, and number of appearances of a minor classification code "#1154" (curry) is one. In (C), the number of appearances acquired in (A) is converted into a vector. The number of appearances acquired in (B) is converted into a vector. A degree of similarity is calculated by acquiring the inner product of these vectors. In (C), the degree of similarity is seven. Automatic selection or refusal of a range for the fuzzy retrieval enables the accuracy of fuzzy retrieval to be improved.
摘要:
An information processing apparatus performs processing including: classifying vectors of first sentences in a file into each similar vector; generating an inverted index associating a vector of each first sentence with a position of the first sentence on the file; identifying a feature sentence from a second sentences included in a second sentences; specifying similar vectors being vectors similar to a vector of the feature sentence based on the inverted index; specifying, for each similar vector, first transition data indicating transition of vectors at positions before and after the similar vector based on the inverted index; and specifying, from pieces of first transition data obtained by performing the specifying of the first transition data on the similar vectors, transition data similar to second transition data indicating transition of vectors of sentences before and after the feature sentence, to output the transition data as a response of the search query.
摘要:
An information processing program for causing a computer to perform processing including: dividing a sequence indicating a rational formula of a compound, into a character string of a minimum unit of the sequence and a branch symbol indicating a branched portion of the compound; generating a first coded sequence by using a group dictionary indicating a relationship between the sequence and the compression code, the generating including assigning a compression code to the character string of the minimum unit, and assigning the compression code according to a type of the branched portion to the branch symbol; and generating a second coded sequence by using a primary structure dictionary indicating a relationship between a group primary structure of the sequence and the compression code, the generating of the second coded sequence including encoding the compression code in the first coded sequence in units of the group primary structure.
摘要:
An information processing device identifies a vector corresponding to any word included in text included in a search condition. The information processing device refers to a storage unit that stores presence information indicating whether or not a word corresponding to each of a plurality of vectors is included in each of a plurality of text files, and identifies a text file including the any word among the plurality of text files on the basis of presence information associated with a vector in which similarity to the identified vector is equal to or higher than a standard among the plurality of vectors.