摘要:
The present invention includes: an extraction unit that extracts a specified quantity of documents, as targets to be classified by a user, from document information; a classification code accepting unit that accepts a classification code which is an identifier used when categorizing the documents, and is assigned by the user to each of the extracted documents; a database that records keywords selected from the extracted documents on the basis of the classification code; a score calculation unit that calculates a score which evaluates linkage strength between documents included in the document information, and the classification code on the basis of the keywords; and a judgment unit that judges whether the number of times of the calculation of the score has reached a specified number of times or not; wherein when the judgment unit determines that the number of times of the calculation of the score has not reached the specified number of times, the score calculation unit recalculates the score on the basis of a result of further extraction, by the extraction unit, of a specified quantity of documents, as targets to be classified by the user, from the document information according to the score.
摘要:
It is possible to analyze digitized document information gathered to be provided as evidence in a legal action and to classify the document information to be easily accessible in the legal action. A document classification system includes a keyword database, a related term database, a first classification unit which extracts a document including a keyword recorded in the keyword database from document information and attaches a specific classification mark to the extracted document based on keyword-corresponding information, and a second classification unit which extracts a document including a related term recorded in the related term database from document information, to which the specific classification mark is not attached in the first classification unit, calculates a score based on an evaluated value of the related term included in the extracted document and the number of related terms, and attaches a predetermined classification mark to a document, for which the score exceeds a given value, among the documents including the related term based on the score and the related term-corresponding information.
摘要:
It is possible to analyze digitized document information gathered to be provided as evidence in a legal action and to classify the document information to be easily accessible in the legal action. A document classification system includes a keyword database, a related term database, a first classification unit which extracts a document including a keyword recorded in the keyword database from document information and attaches a specific classification mark to the extracted document based on keyword-corresponding information, and a second classification unit which extracts a document including a related term recorded in the related term database from document information, to which the specific classification mark is not attached in the first classification unit, calculates a score based on an evaluated value of the related term included in the extracted document and the number of related terms, and attaches a predetermined classification mark to a document, for which the score exceeds a given value, among the documents including the related term based on the score and the related term-corresponding information.
摘要:
Embodiments of the inventive concept reduce the burden of creating litigant sources of evidence or other evidentiary materials in connection with litigation in a court of law. Designation of at least one document file included in digital document information is accepted and designation of a language into which the designated document file is translated is accepted. The document file, the designation of which is accepted, is translated into the language the designation of which is accepted. A common document file representing the same content as that of the designated document file is extracted from digital document information recorded in a recording unit. Translation-related information representing that the extracted common document file is translated by invoking a translated content of the translated document file is generated, and, based on the translation-related information, a litigant-related document file is output.
摘要:
The present invention relates to data analysis for evaluating a plurality of pieces of object data; and the evaluation corresponds to the relation between each piece of object data and a specified case. An index that enables ranking of the plurality of pieces of object data is generated by the evaluation and the index changes based on an input entered by a user. A pattern is extracted that characterizes the reference data from the reference data according to the classification information assigned by the input. The index is determined by evaluating the relation between the object data and the specified case based on the extracted pattern and set to the object data. The plurality of pieces of object data are ranked according to the index and reported the user.
摘要:
Disclosed is a forensic system capable of enhancing the accuracy and efficiency of classification work of whether to submit document information as evidence in a lawsuit by highlighting a portion including a specific keyword in a unit of a sentence. The forensic system includes: a database that registers a keyword for determining by a user whether a plurality of pieces of document information included in the digital information is related to a lawsuit; a retrieving unit that retrieves the keyword registered in the database from the document information; a sentence extracting unit that extracts a sentence including the retrieved keyword from the document information; a score calculating unit that calculates a score indicating a degree of relevance to the lawsuit using a feature value extracted from the sentence extracted by the sentence extracting unit; and a highlighting unit that changes a degree of highlighting of the sentence according to the score.
摘要:
It is possible to analyze digitized document information gathered to be provided as evidence in a legal action and to classify the document information to be easily accessible in the legal action. A document classification system includes a keyword database, a related term database, a first classification unit which extracts a document including a keyword recorded in the keyword database from document information and attaches a specific classification mark to the extracted document based on keyword-corresponding information, and a second classification unit which extracts a document including a related term recorded in the related term database from document information, to which the specific classification mark is not attached in the first classification unit, calculates a score based on an evaluated value of the related term included in the extracted document and the number of related terms, and attaches a predetermined classification mark to a document, for which the score exceeds a given value, among the documents including the related term based on the score and the related term-corresponding information.
摘要:
A forensic system includes a result information receiving unit that receives result information which is a determination result of connection between a lawsuit and a document group including a predetermined number of documents, which is extracted from document data included in digital information, by a user, an element selection unit that calculates evaluation values of elements which commonly appear in the document group in each result information item from the characteristics of the elements and selects the elements on the basis of the evaluation values, a score calculation unit that calculates a score of each document in the document data from the selected elements included in each document of the document data and the evaluation values of the selected elements, and a recall ratio calculation unit that calculates a recall ratio related to the determination of the connection to the lawsuit on the basis of the score.
摘要:
A digital information analysis system includes a target selection unit that selects target digital information, a combination storage unit that stores each of a plurality of word combinations related to a predetermined specific item, a search unit that searches whether the plurality of word combinations stored in the combination storage unit are included in the target digital information selected by the target selection unit, a relation determination unit that determines the relation of the target digital information to the predetermined specific item on the basis of a morphological analysis result when the plurality of word combinations stored in the combination storage unit are included in the target digital information, and a determination result setting unit that associates the determination result of the relation determination unit with the target digital information.
摘要:
Embodiments of the inventive concept can extract digital document information related with a specific individual to achieve a work load reduction associated with evidentiary material preparation for litigation. A specific individual is selected from at least one individual included in user information. Only digital document information which was accessed by the specific individual is extracted based on access history information regarding the selected specific individual. Additional information indicating whether or not document files in the extracted digital document information are each related with the litigation is set, and a document file related with the litigation is outputted based on the additional information.