摘要:
A system for full text indexing optimization that operates based on identification of idle and active content in a content source, and by prioritizing indexing of idle content over active content. Active and idle content items are automatically identified, and idle content items are given a higher priority for indexing, while active content items are given a lower priority. Active content items are generally those that are considered relatively more likely to be located by the user without using the full text indexing function, while idle content items are those content items that are relatively more likely to be located through use of the full text indexing function. The specific content item attributes that are used to determine whether a given content item is active or idle may depend on the type content source for which the full text index is being provided. Additionally, the determination of which content items are active and which are idle may be based on predetermined, static criteria, and/or dynamically determined use patterns determined by monitoring operations performed on content items by a user.
摘要:
When a message having at least one attachment is obtained for indexing, it is indexed as N+1 separate documents, where N is the number of attached documents. If the message is part of a message thread, then information regarding the last message in the thread is retrieved, and search index attachment meta data for the last message is extracted. A unique identifier is computed for the newly obtained attachments, and used to search for matches in the attachments for the last message in the thread. If there is a match, then the newly obtained attachment is not indexed, but the unique identifier of the previously indexed matching attachment is added to a body index document for the new message. A unique identifier associated with the new message is also added to a list of parent identifiers associated with the attachment.
摘要:
A method and system for sharing full text index entries across application boundaries in which documents are obtained by a shared, platform level indexing service, and a determination is made as to whether the received documents are duplicates with regard to previously indexed documents. If a document is determined to be a duplicate, the index representation of the previously indexed copy of the document is modified to indicate that the document is also associated with another application or context. If a document is not a duplicate of a previously indexed document, the document is indexed to support future searches and/or other processing. The index representation of a document includes application category identifiers associating one or more applications or contexts with the document. When a document is indexed, one or more category identifiers are generated and stored in association with that document. The category identifiers for an indexed document may, for example, represent an application that received, stored, or otherwise processed that document. The application category identifiers enable category specific searching by applications sharing a common search index. A software category filter may be provided to process search results from the shared search index, so that only documents associated with certain categories are returned. Accordingly, one or more search categories may be determined for a given search query, based on an application generating the search query, or some other context information, and then used to filter the search results provided from the shared search index.
摘要:
A system and a method for providing user interest based search index optimization. The system includes a server node configured to transmit electonic mail, and a client node having an electronic recepticle and a dynamic interest profile member (DIP). The client node is interconnected to the server node via a network. The client node is configured to receive in the electronic receptacle the transmitted electronic mail. The DIP is configured to assign a DIP ranking to each piece of received electronic mail predicated upon at least one of, (i) the identity of the sender, and (ii) the keywords listed in the contents of the electronic mail. The DIP ranking of the electronic mail is compared to a DIP threshold; the electronic mail is added to a full text index located in the client node when the DIP ranking of the electronic mail exceeds the DIP threshold.
摘要:
A method and system for sharing search index entries across multiple computer systems organized in a peer to peer network, in which unique content is indexed only once, even though the content may be physically duplicated in multiple computer systems in the peer to peer network. When files are obtained by a shared indexing service, and a determination is made as to whether the received files are duplicates with regard to previously indexed files. If a file is determined to be a duplicate, the index representation of the previously indexed copy of the file is modified to indicate that the file is also associated with another computer system in the peer to peer network. If a file is not a duplicate of a previously indexed file, the file is indexed to support future searches. The index representation of a file includes category identifiers associating one or more computer systems with the file. When a file is indexed, one or more category identifiers are generated and stored in association with that file. The category identifiers for an indexed file may represent host computer systems on which copies of the file are stored. The category identifiers enable location specific searching by computer systems in a peer to peer network sharing a common search index. A software category filter may be provided to process search results from the shared search index, so that only files associated with certain categories are returned.
摘要:
A network device such as a server monitors user behavior and provides subscriptions to documents based at least in part on that behavior. Subscription triggers include requesting at least a predetermined percentage of the entire document, requesting at least a predetermined number of pages in a document, and requesting a document at least a predetermined number of times. If documents are organized in a root and branch structure, the subscription may be triggered based on the extent to which documents are requested in the structure, e.g., triggered upon request for a branch document related to a root document. Subscriptions may also be limited to a portion of a document, such as a branch document associated with a root document.
摘要:
A system and method allows an email user to solicit an email thread from another user (or from a server) in situations where the first requesting user may have been added to an email thread after the email thread was started, or in situations where the user may have an incomplete thread (for example, where a user is being added to a thread where an attachment was removed). The complete thread is returned to the first requesting user, after permission has been sought and granted, and after data mining and intelligent aggregation of a plurality of relevant threads in to one interleaved result is performed and returned to the first requesting user.
摘要:
A system for adding new recipients to an e-mail thread. The system includes a server device configured to administer an electronic mail (E-MAIL) application. At least one client device being operably associated via a network with the server device. The at least one client device configured to add new E-MAIL recipients to the E-MAIL thread of the distributed E-MAIL message via a user interface (UI) having a menu selection for adding recipients to the thread. The UI being operably associated with the client device and the server device. The client device is further configured to add the identity of the new E-MAIL recipient to a persistent store table located on the client device.
摘要:
A system and method allows an email user to solicit an email thread from another user (or from a server) in situations where the first requesting user may have been added to an email thread after the email thread was started, or in situations where the user may have an incomplete thread (for example, where a user is being added to a thread where an attachment was removed). The complete thread is returned to the first requesting user, after permission has been sought and granted, and after data mining and intelligent aggregation of a plurality of relevant threads in to one interleaved result is performed and returned to the first requesting user.
摘要:
A system that enables a user to conveniently reduce the size of one or more mailboxes stored on the client computer system. The user can indicate that “unimportant” email messages are to be removed from one or more mailboxes. Unimportant email messages are automatically removed before important ones. To calculate the importance of a message for a given user, message attributes such as users associated with and/or terms contained in the message are compared against a personal interest profile associated with the local user. Additionally, dates associated with the message, such as when the message was added to the mailbox, and/or when the message was last read, may be used to further determine message importance. The size of a message may also be considered when determining whether a message should be removed.