摘要:
A method and system of classifying documents is provided. The method includes receiving a stream of documents from at least one user wherein each document includes a topic of information relating to a customer support issue or sentiment. The method includes classifying each of the received documents using a plurality of trained classifiers, the classification based on a voting by the trained classifiers, each document labeled according to a similar topic. A drift of the topic of one or more of the classifications is determined wherein the drift is related to the received documents that include information relating to an unclassified customer support issue or sentiment. If the determined drift exceeds a predetermined threshold range, rebuilding the plurality of classifiers to include a second set of classifiers trained to recognize the unclassified customer support issue or sentiment.
摘要:
Methods and systems for constructing a taxonomy based on hierarchical clustering are provided. The taxonomy is generated by first constructing a hierarchy of clusters using a clustering algorithm. A first level of the hierarchy of clusters is generated by providing a plurality of content files to a clustering algorithm. Subsequent levels of the hierarchy are generated by providing the clusters of the preceding levels to the clustering algorithm. Labels that characterize each cluster within the hierarchy are assigned to corresponding clusters. Labels and clusters are combined to form the taxonomy.
摘要:
Disclosed herein is a framework for pushing service notifications to users via selected channels that are chosen based on user history. In accordance with one aspect, the framework determines if a notification is to be sent to a particular user. The framework may further determine if the notification requires an action. If the notification requires no action, the framework may choose, from multiple channels, a channel with lowest receive time duration. The framework may then send the notification via the chosen channel.
摘要:
Disclosed herein is a framework for generating and providing self-distinguishable identifiers as to users. In accordance with one aspect, an entry is retrieved from an object, wherein the entry includes one or more fields. The one or more fields may be concatenated to create a concatenated string. The framework may then determine if the concatenated string is unique from other concatenated strings in a listing of the object. If the concatenated string is determined to be not unique, a unique sequence identifier may be added to the concatenated string.
摘要:
Methods and systems for use in labeling documents within a cluster are provided. One example method includes assembling a set of documents including a first plurality of previously clustered documents and a second plurality of documents. Each of the first plurality of previously clustered documents has at least one label identifying a topic to which content of the document relates. The method includes partitioning documents from the set of documents into multiple clusters, determining if a dominant topic exists within one of the multiple clusters, determining a metric value for one of the multiple clusters based on the number of documents within the one of the multiple clusters having a label identifying the determined dominant topic, and labeling at least documents from the second plurality of documents within the one of the multiple clusters with the label identifying the dominant topic when the metric value exceeds a predetermined threshold.
摘要:
Described herein is a technology to facilitate designing or arranging a layout of a page. In one implementation, a tile-based page having one or more tiles forming a page layout is provided. The tile-based page includes a focus tile. In response to a user event to indicate a tile change in the page layout, a tile adjacent in the layout is determined. The tile adjacent is the minimum area to be shifted to effect the tile change. The tile is absorbed into an empty area and the changed layout is analyzed to determine if the tile-based page is a convex page.
摘要:
Among other disclosed subject matter, a computer-implemented method includes receiving a plurality of electronic documents associated with a domain at a server. Each of the plurality of electronic documents includes meta-data and textual content. The method includes identifying one or more text strings in the textual content that are to be processed differently than an identical or similar text string in other electronic documents, and associating, with the electronic document, data indicating that each of the identified text strings is to be processed differently than an identical or similar text string in other electronic documents. The method also includes performing an analysis of the electronic documents to identify one or more subsets of the electronic documents that include related subject matter. A plurality of degrees of relatedness can be associated with text strings associated with data indicating that each of the text strings is to be processed differently.
摘要:
A technique for providing information about defects in a mask pattern is described. In this technique, defects in the mask pattern may be determined based on differences between a calculated pattern produced at an image plane in the photolithographic process, when the mask pattern, illuminated by an associated source pattern, is at an object plane in the photolithographic process, and a target pattern that excludes the defects. Then the defect information may be provided to the user, such as a spatial map of the determined defects, where the spatial map is associated with at least the portion of the mask pattern.
摘要:
A method of forming a midsole includes the steps of placing a first mold member and a second mold member in contact with one another, injecting a first material into the first mold member to form first and second portions of a midsole preform, positioning the first and second portions in a third mold member of a second mold assembly; placing a second material in the third mold member; placing the third mold member and a fourth mold member in contact with one another, subjecting the second material and the first and second portions to heat to form a midsole preform, placing the midsole preform in a fifth mold member, placing the fifth mold member and a sixth mold member in contact with one another; and subjecting the midsole preform to heat and pressure to form a midsole.
摘要:
An extraction process for a non-aqueous hydroformylation product composition to separate an aldehyde product and to recover a hydroformylation catalyst. The process involves mixing a non-aqueous hydroformylation product composition containing a mixture of formyl-substituted fatty acid triglyceride esters derived from hydroformylating a seed oil, a transition metal-organophosphine ligand wherein the organophosphine is ionically-charged, optionally free ionically-charged organophosphine ligand, and a polar organic solubilizing agent with water and an extraction solvent having low water solubility to recover an organic phase containing the mixture of formyl-substituted fatty acid triglycerides and the low solubility extraction solvent and an aqueous phase containing the transition metal-organophosphine ligand, optional free ligand, the organic solubilizing agent, and water. Optionally, the low solubility extraction solvent can be prepared in situ in the hydroformylation step.