摘要:
A system for securing application information in a shared, system-wide search service. Each application can register a security filtering module that is to be used at search time to filter data associated with that application. When a user performs a search, initial, unfiltered search results are obtained based the contents of the shared search index. The unfiltered search results are organized by application, and previously registered filter modules are called to perform user specific, per-application filtering on the initial results. The filter modules cause data to which the user issuing the search request does not have access to be removed from the search results, on a per application basis. Those of the initial search results that are determined in this way to not be accessible to the user issuing the search request are removed, resulting in a set of filtered search results that are presented to the user. The filtered search results thus contain indications only of data that is accessible to the user. In this way, the system-wide search service filters search results to remove indications of data which match the search criteria provided by the user, but to which the user does not have access, based on a conveniently extensible, per-application search result filtering process.
摘要:
A system for securing application information in a shared, system-wide search service. Each application can register a security filtering module that is to be used at search time to filter data associated with that application. When a user performs a search, initial, unfiltered search results are obtained based the contents of the shared search index. The unfiltered search results are organized by application, and previously registered filter modules are called to perform user specific, per-application filtering on the initial results. The filter modules cause data to which the user issuing the search request does not have access to be removed from the search results, on a per application basis. Those of the initial search results that are determined in this way to not be accessible to the user issuing the search request are removed, resulting in a set of filtered search results that are presented to the user. The filtered search results thus contain indications only of data that is accessible to the user. In this way, the system-wide search service filters search results to remove indications of data which match the search criteria provided by the user, but to which the user does not have access, based on a conveniently extensible, per-application search result filtering process.
摘要:
Embodiments of the present invention address deficiencies of the art in respect to crawling content and provide a novel and non-obvious method, system and computer program product for seed lists for hierarchically structured content sources. In one embodiment, a method for crawling seed lists for hierarchically structured content sources can be provided. The method can include specifying a depth of crawling for hierarchically structured content, crawling only seed lists at the specified depth among other seed lists in a hierarchy of seed lists mimicking the hierarchically structured content, and returning indexed data for the crawled seed lists. Optionally, an administrator user interface can be provided for specifying the depth of crawling for the hierarchically structured content.
摘要:
Embodiments of the present invention address deficiencies of the art in respect to crawling content and provide a novel and non-obvious method, system and computer program product for seed lists for hierarchically structured content sources. In one embodiment, a method for crawling seed lists for hierarchically structured content sources can be provided. The method can include specifying a depth of crawling for hierarchically structured content, crawling only seed lists at the specified depth among other seed lists in a hierarchy of seed lists mimicking the hierarchically structured content, and returning indexed data for the crawled seed lists. Optionally, an administrator user interface can be provided for specifying the depth of crawling for the hierarchically structured content.
摘要:
Embodiments of the present invention address deficiencies of the art in respect to crawling content and provide a method, system and computer program product for metadata processing for seed lists for structured content sources. In one embodiment, a method for processing metadata for a seed list can include extracting metadata from a seed list for application content, storing the metadata in a repository, associating the metadata with fields of the application content, crawling the fields of the application content by reference to the metadata, and indexing the fields. In an aspect of the embodiment, the method further can include annotating the application to produce metadata for the fields of the application content. In yet another aspect of the embodiment, the method can include mapping the metadata to a document schema generic to a plurality of heterogeneous application content.
摘要:
Device, system, and method of creating virtual social networks based on web-extracted features. For example, a method for creating virtual social networks based on web-extracted data includes: accessing through a global communication network a first content item and a second content item, wherein each one of the first and second content items is selected from the group consisting of: an image, a video, text, and metadata; extracting data corresponding to a first feature from the first content item; extracting data corresponding to a second feature from the second content item; and based on a common attribute of the first and second features, clustering into a cluster a first identifier of a first user associated with the first content item and a second identifier of a second user associated with the second content item.
摘要:
Folder recommendation in a file operation is accomplished by: receiving (325) information relating to the file operation; providing history data (321) of one or more previous operations, the data including details of information relating to a previous operation and a folder on which the previous operation was performed; and selecting a folder (324) for recommendation including matching the file operation to one or more folders based on the history data. In one embodiment, the operation is an operation of saving a file, and the information relating to the operation is file information. In another embodiment, the operation is an operation of retrieving a file, and the information relating to the operation is destination information.
摘要:
A method and system are provided for searching across independent applications. A first seedlist (202) is provided from a first application (201), the first seedlist (202) including a list of data items (211) for crawling by a search engine (220). The data items (211) are owned by the first application (201). A second seedlist (206) is also provided from a second independent application (205), the second seedlist (206) including annotation data (214) for crawling by a search engine (220). The annotation data (214) relates to the data items (211) of the first application (201). A search index (204) provides searchable data from the two or more applications (201, 205), the searchable data relating to the same set of data items (211).
摘要:
A system-level text correction service (330) is provided. The service (330) includes monitoring (410) the input of text (340) on a computer system, identifying an incorrect input word in a first application. Correction proposal words are retrieved from a system-level dictionary (320), wherein the system-level dictionary (320) stores a reference word with an identifier of at least one application type in which the reference word is used. The correction proposal words are ranked according to the similarity of the proposal word to the input word and by comparing the first application to the application types of the reference word.
摘要:
A method, system, and computer program product for measuring web site satisfaction of information needs are provided. The method includes: selecting a page for analysis; generating a page profile in the form of a list of keywords representing the page; generating a page traffic profile in the form of lists of keywords representing information needs of users, wherein the page traffic profile is generated from keywords used by users to visit the page; determining the success of users' visits to the page; and analyzing whether a page satisfies users' information needs by applying a distance measure between the keywords of the page profile and the keywords of the page traffic profile and combining the distance measure result with a success rate of the keywords.