摘要:
When a message having at least one attachment is obtained for indexing, it is indexed as N+1 separate documents, where N is the number of attached documents. If the message is part of a message thread, then information regarding the last message in the thread is retrieved, and search index attachment meta data for the last message is extracted. A unique identifier is computed for the newly obtained attachments, and used to search for matches in the attachments for the last message in the thread. If there is a match, then the newly obtained attachment is not indexed, but the unique identifier of the previously indexed matching attachment is added to a body index document for the new message. A unique identifier associated with the new message is also added to a list of parent identifiers associated with the attachment.
摘要:
A method and system for sharing full text index entries across application boundaries in which documents are obtained by a shared, platform level indexing service, and a determination is made as to whether the received documents are duplicates with regard to previously indexed documents. If a document is determined to be a duplicate, the index representation of the previously indexed copy of the document is modified to indicate that the document is also associated with another application or context. If a document is not a duplicate of a previously indexed document, the document is indexed to support future searches and/or other processing. The index representation of a document includes application category identifiers associating one or more applications or contexts with the document. When a document is indexed, one or more category identifiers are generated and stored in association with that document. The category identifiers for an indexed document may, for example, represent an application that received, stored, or otherwise processed that document. The application category identifiers enable category specific searching by applications sharing a common search index. A software category filter may be provided to process search results from the shared search index, so that only documents associated with certain categories are returned. Accordingly, one or more search categories may be determined for a given search query, based on an application generating the search query, or some other context information, and then used to filter the search results provided from the shared search index.
摘要:
A system and a method for providing user interest based search index optimization. The system includes a server node configured to transmit electonic mail, and a client node having an electronic recepticle and a dynamic interest profile member (DIP). The client node is interconnected to the server node via a network. The client node is configured to receive in the electronic receptacle the transmitted electronic mail. The DIP is configured to assign a DIP ranking to each piece of received electronic mail predicated upon at least one of, (i) the identity of the sender, and (ii) the keywords listed in the contents of the electronic mail. The DIP ranking of the electronic mail is compared to a DIP threshold; the electronic mail is added to a full text index located in the client node when the DIP ranking of the electronic mail exceeds the DIP threshold.
摘要:
A method and system for sharing search index entries across multiple computer systems organized in a peer to peer network, in which unique content is indexed only once, even though the content may be physically duplicated in multiple computer systems in the peer to peer network. When files are obtained by a shared indexing service, and a determination is made as to whether the received files are duplicates with regard to previously indexed files. If a file is determined to be a duplicate, the index representation of the previously indexed copy of the file is modified to indicate that the file is also associated with another computer system in the peer to peer network. If a file is not a duplicate of a previously indexed file, the file is indexed to support future searches. The index representation of a file includes category identifiers associating one or more computer systems with the file. When a file is indexed, one or more category identifiers are generated and stored in association with that file. The category identifiers for an indexed file may represent host computer systems on which copies of the file are stored. The category identifiers enable location specific searching by computer systems in a peer to peer network sharing a common search index. A software category filter may be provided to process search results from the shared search index, so that only files associated with certain categories are returned.
摘要:
A system for full text indexing optimization that operates based on identification of idle and active content in a content source, and by prioritizing indexing of idle content over active content. Active and idle content items are automatically identified, and idle content items are given a higher priority for indexing, while active content items are given a lower priority. Active content items are generally those that are considered relatively more likely to be located by the user without using the full text indexing function, while idle content items are those content items that are relatively more likely to be located through use of the full text indexing function. The specific content item attributes that are used to determine whether a given content item is active or idle may depend on the type content source for which the full text index is being provided. Additionally, the determination of which content items are active and which are idle may be based on predetermined, static criteria, and/or dynamically determined use patterns determined by monitoring operations performed on content items by a user.
摘要:
A system for securing application information in a shared, system-wide search service. Each application can register a security filtering module that is to be used at search time to filter data associated with that application. When a user performs a search, initial, unfiltered search results are obtained based the contents of the shared search index. The unfiltered search results are organized by application, and previously registered filter modules are called to perform user specific, per-application filtering on the initial results. The filter modules cause data to which the user issuing the search request does not have access to be removed from the search results, on a per application basis. Those of the initial search results that are determined in this way to not be accessible to the user issuing the search request are removed, resulting in a set of filtered search results that are presented to the user. The filtered search results thus contain indications only of data that is accessible to the user. In this way, the system-wide search service filters search results to remove indications of data which match the search criteria provided by the user, but to which the user does not have access, based on a conveniently extensible, per-application search result filtering process.
摘要:
A system for securing application information in a shared, system-wide search service. Each application can register a security filtering module that is to be used at search time to filter data associated with that application. When a user performs a search, initial, unfiltered search results are obtained based the contents of the shared search index. The unfiltered search results are organized by application, and previously registered filter modules are called to perform user specific, per-application filtering on the initial results. The filter modules cause data to which the user issuing the search request does not have access to be removed from the search results, on a per application basis. Those of the initial search results that are determined in this way to not be accessible to the user issuing the search request are removed, resulting in a set of filtered search results that are presented to the user. The filtered search results thus contain indications only of data that is accessible to the user. In this way, the system-wide search service filters search results to remove indications of data which match the search criteria provided by the user, but to which the user does not have access, based on a conveniently extensible, per-application search result filtering process.
摘要:
A system for social bookmarking of a resource provided through a Web application that doesn't follow the representational state transfer architectural style (REST). When processing a request for a social bookmarking operation, the system searches for a URL and potentially other resource information within the document object model of a currently loaded Web page. If such resource information is found, it is used to process the requested social bookmarking operation. Otherwise, if such resource information is not found in the document object model, the system uses the URL stored in the location bar of the Web browser to process the requested social bookmarking operation. The system includes semantic information in the DOM of a Web page that is used to identify subsets of content within the Web page that can be socially bookmarked. The semantic information may be stored in what is generally referred to as “microformatting”.
摘要:
A document handling apparatus is disclosed which is particularly well adapted for counting documents of value, such as banknotes. The document handling apparatus comprises an input module for receiving a stack of documents and feeding them one by one into the apparatus, a detector box which houses a series of detectors for detecting characteristics of the documents, a transport system for conveying documents within the apparatus, a doubles detector for sensing the passage of each document therethrough and a stacker module for stacking the output documents and presenting them to the user. The apparatus is based on a metal framework in a plastics housing.
摘要:
A computer-implemented method of bookmarking syndicated feeds. The method can include receiving a request to bookmark a first network resource and determining whether at least one syndicated feed is associated with the first network resource. When at least one syndicated feed is associated with the first network resource, the syndicated feed can be identified to a user and the user can be prompted to indicate whether to bookmark the syndicated feed. Responsive to receiving an indication from the user to bookmark the syndicated feed, a bookmark can be generated for the syndicated feed and the syndicated feed bookmark can be output.