摘要:
The present invention is directed to a method and apparatus for establishing documents relationship based on user's operation upon search result. When a user uses search engine to search for documents with a query in repository, the search result may be a list of ranked documents, and these documents may contain a lot of relationship in term of the specific query. If the user clicks some search result further, and if the click and open operation meet certain conditions, for example exceed a period of time, the clicked document could be deemed as related to the search query. Furthermore it could be inferred that there is a strong relationship between different documents clicked by the user. The present invention records the relationship between documents and presents it to the user when necessary.
摘要:
The present invention is directed to a method and apparatus for establishing documents relationship based on user's operation upon search result. When a user uses search engine to search for documents with a query in repository, the search result may be a list of ranked documents, and these documents may contain a lot of relationship in term of the specific query. If the user clicks some search result further, and if the click and open operation meet certain conditions, for example exceed a period of time, the clicked document could be deemed as related to the search query. Furthermore it could be inferred that there is a strong relationship between different documents clicked by the user. The present invention records the relationship between documents and presents it to the user when necessary.
摘要:
A method and apparatus for preprocessing a plurality of documents for search and presenting search result and a system for searching documents that comprises these apparatuses. The search result, for example, includes at least one candidate document. The candidate document is assigned a tree structure representing its content. The tree structure includes at least one node. The method may include presenting at least a portion of the tree structure corresponded to the candidate document in the search result.
摘要:
A method, system and program storage device are provided for extending an inverted index, which comprises first and second inverted index subfiles to increase the speed of establishing and updating inverted index files. The method includes performing ordered keyword indexing operations of generating an inverted index from data sources, in which a frequency of occurrence of keywords in each of the data sources is calculated, and writing each keyword, the data sources, and the frequency of occurrence of each keyword in the corresponding data sources to the inverted index. If a number of data sources involved in the indexing operations reaches a first threshold, then writing contents of the inverted index as a smallest grid into the first inverted index subfile. If a number of smallest grids in the first inverted index subfile reaches a second threshold, then merging the smallest grids into a merged grid and writing the merged grid into the second inverted index subfile. If the number of merged grids in the second inverted index subfile reaches a third threshold, then further merging the merged grids into a larger merged grid, and writing the larger merged grid back into the first inverted index subfile.
摘要:
The present invention provides a search ranking method suitable for a file system, including receiving a query, calculating final relevance scores of individual file items with respect to the query at least partially in accordance with energy scores of individual nodes on a current file system energy tree, and outputting a list of search results based on the final relevance scores. The file system energy tree is updated in response to an operation on the file system performed by a user, wherein the file system energy tree has a tree structure corresponding to that of the file system, and the individual nodes thereof respectively corresponds to the individual file items in the file system
摘要:
The present invention provides a search ranking method suitable for a file system, comprising: receiving a query; calculating final relevance scores of individual file items with respect to the query at least partially in accordance with energy scores of individual nodes on a current file system energy tree, and outputting a list of search results based on the final relevance scores; and updating the file system energy tree in response to an operation on the file system performed by a user, wherein the file system energy tree has a tree structure corresponding to that of the file system, and the individual nodes thereof respectively corresponds to the individual file items in the file system. The present invention also provides a corresponding file system search engine and computer program product. With the present invention, files and file folders that the user is interested in are usually arranged in relatively higher positions of the list of search results in file system search. Moreover, with the increase in the user's clicks on the file, the list of search results can be dynamically adapted to changes in the user's interest or preference.
摘要:
A computer-implemented method and system for checking a chemical name. The method tokenizes the chemical name to obtain corresponding tokens; checks the chemical name according to the chemical association between chemical compositions represented by the tokens; and if the chemical name does not pass the check, replaces at least part of tokens of the chemical name that does not pass the check, and repeats the checking step. The system and method can not only help users to find and correct errors in spelling a chemical name but also check the entire chemical name at the level of chemical associations. Hence, not only chemical names that are incorrectly spelled but also ones that do not conform to chemical rules can be found, and significant help is provided to users for correcting chemical names.
摘要:
A tagging method and apparatus, including computer program products, based on a structured data set are provided, the tagging method comprising: creating classification models for respective nodes in the structured data set of an event; acquiring public opinions on the event; and tagging the opinions to corresponding nodes of the structured data set using the created classification models. The tagging method and apparatus of the present disclosure are able to provide well-ordered, focused public opinions for each event to users, and to exhibit the evolution of the public opinions along with time.
摘要:
Tagging methods and apparatus, including computer program products, based on a structured data set. Classification models are created for respective nodes in the structured data set of an event. Public opinions on the event are acquired. The opinions are tagged to corresponding nodes of the structured data set using the created classification models. The tagging methods and apparatus provide well-ordered, focused public opinions for each event to users, and exhibit the evolution of the public opinions along with time.
摘要:
Techniques for processing geographical location data in a document comprise: obtaining geographical location data in the document; grading the geographical location data according to a predetermined condition to determine an associated relationship between the geographical location data; marking on an electronic map the associated relationship between the geographical location data; and presenting the marked electronic map.