摘要:
In the method and system for knowledge extraction of this invention, knowledge extraction is realized through acquiring an initial sentence group including one or more sentences, and then comparing the length of the initial sentence group with an expected length to determine the initial sentence group to be expanded according to the comparison result. Since the sentence groups are formed by consecutive sentences, it may be guaranteed that the sentence groups themselves have good coherence in logic, so that the final sentence groups obtained through expanding the initial sentence groups have good coherence in logic correspondingly. Thus, this invention may override the drawback of lacking logical coherence in extracted knowledge information in the prior art.
摘要:
The present invention provides a semantic information acquisition method and system, and corresponding keyword expansion and search methods and systems, comprising: searching for, then classifying an article; then, performing word segmentation according to the classified article to obtain the words in said category, and setting said category and words to serve as the semantic information of the keyword; also, a method and system using the semantic information acquisition method to expand a keyword, and a method and system using keyword expansion to perform a search. The described semantic information acquisition method effectively avoids the technical problems in the prior art of only being able to obtain semantic information of English vocabulary; and it also being impossible to classify semantic information based on category information. The invention is particularly suitable for searching using a keyword, searching a large number of texts, and organizing large amounts of related data and information.
摘要:
A method and system for key knowledge point recommendation are provided, the method comprising calculating knowledge point relationship strengths of knowledge points in a set of knowledge points; calculating weights for knowledge points according to the knowledge point relationship strengths of knowledge points in the set of knowledge points, and storing the knowledge points and weights correspondingly; determining key knowledge points according to the weights of the knowledge points and recommending the key knowledge points to a user. With this solution, knowledge point relationship strengths are obtained through calculating knowledge point relationship strengths of knowledge points in a set of knowledge points; and recommendation is given to the user for learning knowledge according to knowledge point relationship strengths, so as to help the user to learn key knowledge points selectively in a more objective and effective manner, and avoid problems of information recommendation based on fuzzy logical information recommendation technology.
摘要:
A method and system for obtaining a knowledge point implicit relationship are provided; first, establishing a knowledge point explicit relationship map according to knowledge point explicit relationship strengths; second, computing according to said knowledge point explicit relationship map a simple path set of two knowledge points; then, computing the implicit relationship strength values corresponding to each simple path in said simple path set; further, comparing the relationship strength values of the simple paths and setting as the significant implicit relationship strength value the simple path relationship strength having the largest value also greater than a preset threshold value. The described solution effectively avoids the problems of only using the relationship strengths between knowledge points and the ratio of relationship strengths to obtain the implicit relationship of knowledge points, the manner of searching for an implicit relationship being insufficiently accurate, and not performing normalization processing on the relationship strengths.
摘要:
A logic process apparatus for composite graphs in a fixed layout document is provided in this invention, comprising: a composite graph block extraction unit, for extracting composite graph blocks from the fixed layout document; a document parsing unit, for parsing the fixed layout document to obtain text primitives contained therein; a legend primitive extraction unit, for extracting legend primitives from the text primitives; a correlation detection unit, for detecting correlations between the composite graph blocks and the legend primitives; a correlation storage unit, for storing the detected correlations. A logic process method for composite graphs in a fixed layout document is also provided.
摘要:
The present invention provides a method and system of measuring knowledge point relationship strength, the method comprising calculating explicit relationship strength for all knowledge points and generating a knowledge point relationship strength matrix M; constructing a weighted and directed graph G according to the knowledge point relationship strength matrix of all knowledge points; calculating knowledge point implicit relationship strength values according to the weighted and directed graph and generating a knowledge point implicit relationship strength matrix I; traversing the knowledge point implicit relationship strength matrix I and updating the knowledge point relationship strength matrix M. The above technical solution may effectively avoid the problem of lack of an absolute measurable value for the determination of relationship strength, incorrect measurement of relationship strength, or unable to discover some stronger relationship strength in the prior art.
摘要:
Provided is a table recognizing method, comprising: parsing and analyzing metadata information in an original fixed-layout document, and extracting basic elements on a page of the document; segmenting the basic elements, extracting segmented text lines on the page, and acquiring fragments; constructing an undirected graph with respect to each of the fragments; extracting an image on the page, detecting intersection points of horizontal lines and vertical lines, detecting an external bounding box of the intersection points, and taking whether the segmented text lines fall within the external bounding box as local relationship features; training a learning model according to the local relationship features, local features of the fragments, and neighborhood relationship features among the fragments, acquiring model parameters, and establishing a table recognizing model; and invoking the table recognizing model to perform table recognizing for the document, and acquiring a recognizing result.
摘要:
A list recognizing method and system, which comprises: parsing and analyzing metadata information within an original fixed-layout document, and extracting basic elements within a page; segmenting the basic elements, extracting segmented text lines within the page to obtain fragments; building an undirected graph with respect to the fragments; detecting indent features of a bullet according to features of the basic elements; training a learning model according to the indent features, local features of the fragments and neighborhood relation features among the fragments, obtaining model parameters, and establishing a list recognizing model; and invoking the list recognizing model to perform list recognizing on the required document, so as to get recognition result. This machine learning method may recognize not only a list, but also the contextual relationship between the first line and its subsequent lines of a list, and realize analyzing and understanding a layout of the list of the fixed-layout document ultimately. The accuracy of list recognizing on a fixed-layout document can be improved even if the bullets of the first line of the list are various.
摘要:
The present invention provides a method and an apparatus for detecting a traffic monitoring video. The method comprises: determining a background reference model; determining a target area image in the traffic monitoring video according to the background reference model; updating the background reference model by using the target area image; summating all target points in detection area of each frame of image in the traffic monitoring video according to the updated background reference model to obtain a total area of all the target points; segmenting the frame with the biggest total area to obtain a target area at the best position; and extracting vehicle information from the target area at the best position. By using the present invention, the accuracy of a detection result in a complex environment may be improved.
摘要:
The present invention provides a method and an apparatus for detecting traffic video information. The method includes: acquiring a traffic video stream; determining color features of each frame of image in the traffic video stream; calculating the inter-frame distance between adjacent frames according to the color features; calculating the boundary of an image clustered frames' group according to the inter-frame distance by adopting an image clustering evaluation standard in RGB space and an image clustering evaluation standard in YUV space respectively; and determining a final boundary of the image clustered frames' group according to the boundaries of the image clustered frames' group in RGB space and YUV space. By using the present invention, the stability of detection results in different environments may be improved.