摘要:
A method, software and apparatus are provided which enable a user to obtain products and services in a manner which counteracts conflicting actions modifying raw ranking data based on merchants interests. For this purpose, an information a user's web browser is provided with modules that modify search results obtained from one or more sources including directly from merchants. When the user enters a set of key words in an entry field in an on-screen form for a web server to obtain a list of items (products and/or services) of interest to the user, the search terms are modified in the users or his agents browser to reflect the interest of the user, his management or his parents. The user or his agent receives ranking information taking the highest ranked products and obtains the product information for the most highly ranked products. The browser then reranks the products using a ranking algorithm preferred by the user that offsets information sources biases in the ranking of the product information. The algorithm adds weighting factors reflecting the modified search terms. The weighting factors are combined with the mentioned ranking mechanisms to the documents to increase the probability that certain items come to the top when the search results are presented to the shopper.
摘要:
A random document is stripped of the relevant search terms to generate a non-relevant document. The relevant search terms are formed into grammatically correct but not necessarily technically correct sentences. The grammatically correct sentences are placed at the beginning of the random document in one pass through the system in the middle of a document in a second pass through the system and at the end of the document in a third pass through the system. A relevancy vector chart is computed using the references documents and a known relevancy algorithm relating to position of search terms. The results obtained from search engines are compared to the relevancy vector chart to determine the relative relevancy of the returned search results from the search engines.
摘要:
A document search and retrieval system and program product therefor. Search requests are provided to the system through a user interface. A document decomposer decomposes documents into individual document components. Document components and corresponding searchable indices for each are stored in a Component Library. A search unit searches stored document components responsive to search queries. A results validator compares document hitlists with a document type identified in a search query to select valid hitlists entries for a final hitlist. A document view assembly module collects identified document components and assembles them into a document for view at the user interface.
摘要:
A search engine system is provided in which inputted data in various configurations is first passed through transitional translation layers which convert the data in its multiple configurations to a common or kernel computer form and language. The data is operated on and stored in the kernel form and language. The operated on data is then outputted through transitional translation layers and provided in the desired format the same or different from the input form and language.
摘要:
Techniques are presented that include defining one or more personalized categories, wherein each personalized category comprises one or more terms, identifying one or more of said personalized categories that are associated with a user query, identifying one or more terms that are associated with said user query and with said personalized category, and adding said identified one or more terms to said user query.
摘要:
Techniques are presented for automatically selecting information sources that are most relevant to user queries. Results of searches returned by information sources for queries are analyzed and the information sources are ranked based on this analysis. The information sources that have high rankings for a query are subsequently used to search for relevant results. This process can be adaptive, as the returned results of old queries can be analyzed at a later date to update the ranking of the information sources, automatic searches can be performed to update the ranking of the information sources, new queries can be used for analysis and stored, new information sources added, and old information sources deleted. A linguistic library is used to store personal categories for one or more users and general categories. Each category is associated with keywords and ranked lists of information sources. The library also contains general categories, taxonomies, and dictionaries.
摘要:
Techniques are presented for automatically selecting information sources that are most relevant to user queries. Results of searches returned by information sources for queries are analyzed and the information sources are ranked based on this analysis. The information sources that have high rankings for a query are subsequently used to search for relevant results. This process can be adaptive, as the returned results of old queries can be analyzed at a later date to update the ranking of the information sources, automatic searches can be performed to update the ranking of the information sources, new queries can be used for analysis and stored, new information sources added, and old information sources deleted. A linguistic library is used to store personal categories for one or more users and general categories. Each category is associated with keywords and ranked lists of information sources. The library also contains general categories, taxonomies, and dictionaries.
摘要:
Techniques are presented that include sending a query to one or more information sources associated with a category comprising one or more terms, wherein said one or more information sources are identified by a ranked list of information sources, and wherein said ranked list is generated by ranking a plurality of information sources based on scores for results returned by said plurality of information sources in response to a ranking query generated from said category, and obtaining results from said one or more information sources.
摘要:
Search time is reduced with a search engine that includes a bi-directional inverted index facility which can be accessed with a keyword search in one of a number of languages and provide a listing of documents contained in all of those languages. The keywords in all supported languages are preferably stored in an inverted index lookup table cross referenced to documents in those language containing the keywords. Keywords with the same meaning in different languages are accessible together when that keyword in one of the languages is queried. The search engine containing the table can identify pertinent documents either in a selected language, a second language or in all supported languages, as determined by the user. Information about each document can include not only the identity of the document but also information used in ranking the documents such as the number of times that a keyword appears in that document, and the keywords proximity to other keywords. The use of the inverted index table therefore reduces search time by eliminating the need for translation of keywords, their identification in documents and accumulating of ranking information at search runtime and avoids inaccuracies which may result from full text translations of documents.
摘要:
Euclidean analysis is used to define queries in terms of a multi-axis query space where each of the keywords T1, T2, . . . Ti, . . . Tn is assigned an axis in that space. Sets of test queries St for each one from one of a plurality of server sources, are plotted in the query space. Clusters of the search terms are identified based on the proximity of the plotted query vectors to one another. Predominant servers are identified for each of the clusters. When a search query Ss is received, the location of its vector is determined and the servers accessed by the search query Ss are those that are predominant in the cluster which its vector may fall or is in closest proximity to.