摘要:
Methods for updating an information retrieval system are disclosed. In one embodiment, search terms affiliated with mappings or associations that represent a connection of relevancy between a query and an asset are pushed as content updates to a client system (e.g., as new updates or utilized to replace older data). The search terms are inserted (e.g., inserted as metadata) into corresponding content (the content associated with the asset). In this manner, content-searching data can be updated (e.g., remotely updated) as frequently as desired, even periodically, or selectively as new manually and/or automatically derived data becomes available. In another embodiment, the update data is already built into the content when it is delivered to a client machine. Other disclosed embodiments pertain to methods for generating a data mining classification model that is a blended representation of associations (e.g., query-asset associations) having different characteristics and/or different originating sources.
摘要:
The subject invention leverages data logging of responses to diagnostic reports to provide data that can be mined for diagnostic report quality information. Instances of the subject invention provide an initial diagnostic report assessment means to facilitate review by an entity. The entity's responses to the sorted diagnostic reports are logged unobtrusively to create diagnostic report quality data. This data is then analyzed by an analysis means that can then adjust the assessment means to improve its performance. In this manner, the performance of the assessment means is increased while reducing the workload of the entity reviewing the diagnostic reports. Other instances of the subject invention facilitate to increase the performance of a diagnostic report generating means as well. Instances of the subject invention can also employ machine learning techniques to facilitate in analyzing the quality data and/or in assessing the diagnostic reports.
摘要:
The subject invention leverages data logging of responses to diagnostic reports to provide data that can be mined for diagnostic report quality information. Instances of the subject invention provide an initial diagnostic report assessment means to facilitate review by an entity. The entity's responses to the sorted diagnostic reports are logged unobtrusively to create diagnostic report quality data. This data is then analyzed by an analysis means that can then adjust the assessment means to improve its performance. In this manner, the performance of the assessment means is increased while reducing the workload of the entity reviewing the diagnostic reports. Other instances of the subject invention facilitate to increase the performance of a diagnostic report generating means as well. Instances of the subject invention can also employ machine learning techniques to facilitate in analyzing the quality data and/or in assessing the diagnostic reports.
摘要:
A system for dynamically updating user accessible features of a software application on a client computer has a user interface, a local usage data file, and a data mining engine. The user interface is adapted to receive operator inputs. The local usage data file is adapted to store usage information corresponding to the operator inputs. The data mining engine is adapted to process the stored usage information and to generate local adjustments to a user interface of the software application based on the operator inputs. In one embodiment, a server is adapted to receive usage data from a plurality of application instances on a plurality of client computers and to generate global adjustments based on the received usage data. In one embodiment, the system has a merge feature adapted to blend and resolve conflicts between local and global adjustments to generate an interface adjustment for the user interface.
摘要:
Continuous attributes are used as input attributes in decision tree creation. Buckets are created by dividing the range of values for the continuous attribute into sub-ranges of equal extent. These buckets form initial partitions. Mergers of adjacent partitions are considered to determine score gains from such mergers, and the most useful mergers occur. The resulting partitions are used as the discretization of the continuous attribute for use as an input attribute.