摘要:
Systems and methods (e.g., utilities) for use in providing automated, lightweight collection of online, open source data which may be content-based to reduce website source bias. In one aspect, a utility is disclosed for use in extracting content of interest from at least one website or other online data source (e.g., where the extracted content can be used in a subsequent search query). In other aspects, utilities are disclosed that are operable to perform various types of analyses on such extracted content and present graphical representations of such analyses on a display of a client device.
摘要:
A method and system for web mining and clustering is described. The method includes receiving and dividing input data into a plurality of primitive datasets. Additionally, one or more combinations of the plurality of primitive datasets may be created. Further, a model for each primitive dataset in the plurality of primitive datasets and each of the one or more combinations of the plurality of primitive datasets may be generated. Subsequently, a cost associated with a model corresponding to each primitive dataset in the plurality of primitive datasets, and each of the one or more combinations of the plurality of primitive datasets may be computed. Further, a sum of the costs associated with the models corresponding to each primitive dataset in the plurality of primitive datasets may be compared with the cost associated with each model corresponding to each of the one or more combinations of the plurality of primitive datasets. Finally, the plurality of primitive datasets may be partitioned into one or more clusters based on the comparison of the costs such that each primitive dataset is a part of a cluster in the one or more clusters or a stand-alone primitive dataset.
摘要:
Systems and methods (e.g., utilities) for use in providing automated, lightweight collection of online, open source data which may be content-based to reduce website source bias. In one aspect, a utility is disclosed for use in extracting content of interest from at least one website or other online data source (e.g., where the extracted content can be used in a subsequent search query). In other aspects, utilities are disclosed that are operable to perform various types of analyses on such extracted content and present graphical representations of such analyses on a display of a client device.
摘要:
One website mining embodiment is for characterizing first time users of a website, collecting user session data of the users visiting the website and identifying first time visitors, determining features of the first time visitors utilizing the user session data, determining rules utilizing the features of the first time visitors, monitoring actions of the first time visitors on the website, updating the rules utilizing the monitored actions of the first time visitors and recommending web content utilizing the rules to the first time visitor.
摘要:
Systems, apparatus, and methods for referring physicians based on hierarchical disease profile matching are disclosed. An example system includes a data store to include a plurality of disease profiles, each disease profile associated with a patient condition, a user interface to accept a user request for a referral of a patient to a physician, and a referral processor to compare a profile associated with the patient including a patient symptom to the plurality of disease profiles to generate one or more physician recommendations for referral, the referral processor to refine the one or more physician recommendations based on one or more characteristics associated with each of the one or more physician recommendations, the referral processor to provide the refined one or more physician recommendations to a user for review and selection via the user interface.
摘要:
A system and method for conducting a computerized search, including: receiving a query from a user; classifying the query; augmenting the query based on the classification; issuing the query to a search engine; and conducting a search based on the augmented query. Alternatively, a system and method for conducting a computerized search, including: receiving a search query; analyzing a knowledge base; modifying the search query based on the analysis of the knowledge base; issuing the modified search query to a search engine; and conducting a search via the search engine based on the modified search query to generate search results.
摘要:
A method for intent mining is provided. The method includes performing a preliminary search of a constrained source using one or more seed phrases to generate multiple preliminary search results representing different ways of expressing a desired intent. The method also includes identifying each of the plurality of preliminary search results that have expressed the desired intent to generate a plurality of intent results. The method also includes producing multiple action search strings around one or more action verbs in each of the multiple intent results. The method further includes applying each of the multiple action search strings on one or more non-constrained sources to generate multiple action search results.
摘要:
Systems and methods (e.g., utilities) for use in providing automated, lightweight collection of online, open source data which may be content-based to reduce website source bias. In one aspect, a utility is disclosed for use in extracting content of interest from at least one website or other online data source (e.g., where the extracted content can be used in a subsequent search query). In other aspects, utilities are disclosed that are operable to perform various types of analyses on such extracted content and present graphical representations of such analyses on a display of a client device.
摘要:
Systems, apparatus, and methods for referring physicians based on hierarchical disease profile matching are disclosed. An example system includes a data store to include a plurality of disease profiles, each disease profile associated with a patient condition, a user interface to accept a user request for a referral of a patient to a physician, and a referral processor to compare a profile associated with the patient including a patient symptom to the plurality of disease profiles to generate one or more physician recommendations for referral, the referral processor to refine the one or more physician recommendations based on one or more characteristics associated with each of the one or more physician recommendations, the referral processor to provide the refined one or more physician recommendations to a user for review and selection via the user interface.
摘要:
Mining of websites that in one embodiment includes obtaining web usage data of user sessions of a website, wherein the website has a hierarchical structure with granular levels and has mapping from each webpage of the website into the hierarchical structure, mapping the user sessions to the hierarchical structure of the website resulting in hierarchical user sessions, initiating an edit distance metrics to determine similarity in the hierarchical user sessions, and clustering similar hierarchical user sessions into groups.