摘要:
A method and system of resource allocation for execution of a job are provided. The method includes receiving feedback (134) regarding the execution of previously submitted jobs on one or more resource nodes (101-104), and estimating the resources required for execution of a submitted job based on the feedback (134) and the parameters of the job. One, or a plurality of resource nodes in parallel, having the estimated resources are allocated the job. The feedback may be implicit feedback indicating the success or failure of the execution of a job. The one or more resource nodes (101-104) allocated for execution of a job may have less than a user requested resource allocation for the job.
摘要:
A method for generating a storage policy, the method includes: receiving a storage system target function; and generating, by a machine learning entity, the storage policy in response to: (a) a set of file-related storage operation requests, (b) a state of the storage system before responding to the set of file-related storage operation requests, and (c) the storage system target function. A method for evaluating a storage policy, the method includes: simulating an application of the storage policy by the storage system during a first period, in response to a set of file-related storage operation requests that was provided to the storage system during the first period, to provide a simulation result; wherein the first period starts before the simulating.
摘要:
A method and system are provided for detection of authors across different types of information sources such as across documents on the Web. The method includes obtaining a compression signature (303) for a document, and determining the similarity (304) between compression signatures of two or more documents. If the similarity is greater than a threshold measure (305), the two or more documents are considered to be by the same author. Scored pairs of documents are clustered (308) to provide a group of documents by the same author. The group of documents by the same author can be used for user profiling, noise reduction, contribution sizing, detecting fraudulent contributions, obtaining other search results by the same author, or mating a document with undisclosed authorship to a document of known author.
摘要:
A computer-implemented calendar verification system including a computer configured to operate a calendar application, and a consistency checker operable by the computer, where the calendar application is operative to record information regarding a scheduled event, and where the consistency checker is operative to determine the consistency of a sender time-to-event, received in a verification message for the event, with a recipient time-to-event calculated by the consistency checker using the scheduled event information.
摘要:
A novel and useful method for enabling system logs to be effectively and efficiently monitored by ranking the system log messages by their estimated value to administrators and generating a log view that displays the most important messages. The ranking process uses a dataset of system logs from many computer systems to score messages. For better scoring, unsupervised clustering is used to identify sets of systems that behave similarly. The expected distribution of messages in a given system is estimated using the resulting clusters, and log messages are scored using this estimation.
摘要:
The present disclosure relates to web searching, and more particularly to using a user's information, such as the user's geographic location, and social connections, such as geographic location of one or more friends of the user, in web searching, e.g., such as in filtering and/or ranking search results, identifying relevant queries, and/or identifying content, such as news items, for presentation on a web page presented to the user.
摘要:
A system and method including a simulator operating in conjunction with a search-engine, for improving document and site findability. Users input their content (pages or sites) and the simulator will analyze the site in terms of structure and content. It will then give the user a ranked list of suggestions about how the user might improve his/her site's findability. The user will then be able to apply some or all of these suggestions, or any other changes, by virtually modifying the site, and then immediately receive feedback both on how the pages look and a sense of the degree of findability improvement. The interactive process allows users to simulate modifications in their site structure and content in order to improve its findability. When the user completes the modifications and is satisfied with the new findability of his site, the user will be able then to replace his/her current site in the repository with the modified one.
摘要:
Systems and methods for generating a storage policy for a storage system are provided. The method comprises receiving a target function applicable to a storage system having one or more data storage mediums, wherein the target function represents values for storage parameters associated with productivity or loss tolerance in the storage system; implementing one or more simulation rules according to the received target function; generating one or more storage operation requests to access data on said one or more data storage mediums based on said one or more simulation rules; submitting said one or more storage operation requests to the storage system for processing; analyzing simulation results obtained for the storage system, in response to the storage system processing said one or more storage operation requests; and generating one or more storage policies, by a machine learning entity, in response to analyzing the simulation results.
摘要:
A novel and useful mechanism enabling a standard learning algorithm to generate rules for complex event processing (CEP) systems. The method creates rules that infer previously defined output events by creating input event feature vectors for each targeted output event. In addition, a method for automatically generating CEP system rules to infer output events which are anomalies (i.e. statistical outliers) of input event sequences is disclosed. Input feature vectors consisting of multiple input events and parameters for each targeted output event are then input into a standard learning algorithm to generate CEP system rules.
摘要:
A method and system are provided for maintaining profiles of information channels available on the Web, wherein the information channels are accessed via pull-only protocols. The method includes monitoring one or more channels by a channel pull action at a monitoring rate, wherein the monitoring rate is determined for the one or more channels based on the number of update events in a previous time period. The method may optimally include filtering the update events in the time period by a novelty measure, wherein the filtering disregards events that do not include significant novel information. The monitoring rate is adapted based on reinforcement learning applying iterative learning rules over time.