Abstract:
Some examples include generating a personal language model based on linguistic characteristics of one or more files stored at one or more locations in a file system. Further, some implementations include predicting and presenting a non-Latin character string based at least in part on the personal language model, such as in response to receiving a Latin character string via an input method editor interface.
Abstract:
Scale-invariant features are extracted from an image. The features are projected to a lower dimensional random projection matrix by multiplying the features by a matrix of random entries. The matrix of random projections is quantized to produce a matrix of quantization indices, which form a query vector for searching a database of images to retrieve metadata related to the image.
Abstract:
A substrate capable of producing 3D pattern having colored dazzle light effect and a method of producing pattern and fabricating finished product are provided, in which plastic foaming sheet is selected as base material and reflection film having its surface electroplated is overlaid thereon. Then, embossing process is conducted to form concave-convex veins, and finally printing is conducted to form a print-layer thereon, so that the surface of the substrate possesses a 3D pattern having colored dazzle light effect. This substrate can be utilized in products as container, tray, eyeglass frame or watchstrap.
Abstract:
Method and system of processing cross-domain cookies in order to allow a first website to access a cookie of a second website are provided. In one aspect, a method includes: providing a flash cookie of a first website in a user's local computer; reading an ordinary cookie of a second website that is stored in the user's local computer; and writing the ordinary cookie of the second website into the flash cookie of the first website. Based on this method, it is achievable to access and store cookies across domains in the user's local computer. Accordingly, the method enables e-commerce websites to have a more comprehensive collection of user information to provide more reliable references for the e-commerce websites to analyze user information.
Abstract:
The present invention relates to a corpus for use in training a language model. The corpus includes a plurality of characters and a plurality of morphological tags associated with a plurality of sequences of characters. The plurality of morphological tags indicate a morphological type of an associated sequence of characters and a combination of parts forming a morphological subtype.
Abstract:
A substrate having animated flashing figures, applications thereof, and a manufacturing method of the same are revealed. A reflective film with an electroplated layer on a surface is covered over the base material. Then at least two figures are printed on the reflective film to form a printed layer. The figures are cut into strips and then the strips are arranged alternatively. Finally, a surface grating layer is arranged over the printed layer. Through the combination of the reflective film, the printed layer formed by staggered strips of the figures, and the surface grating layer, figures on a surface of the substrate show an animated flashing effect. The substrate can be applied to various products such as containers, trays, eyeglass frames, watch bands, etc.
Abstract:
A distributional similarity between a word of a search query and a term of a candidate word sequences is used to determine an error model probability that describes the probability of the search query given the candidate word sequence. The error model probability is used to determine a probability of the candidate word sequence given the search query. The probability of the candidate word sequence given the search query is used to select a candidate word sequence as a corrected word sequence for the search query. Distributional similarity is also used to build features that are applied in maximum entropy model to compute the probability of the candidate word sequence given the search query.
Abstract:
An ensemble of random feature clusters is built from training data using a clustering algorithm where some randomness has been introduced. For each clustered feature space, a classifier, such as a Naïve Bayesian Classifier, is trained, realizing a classifier ensemble. The final classification decision is made by the resulting classifier ensemble.
Abstract:
A method of post-processing character data from an optical character recognition (OCR) engine and apparatus to perform the method. This exemplary method includes segmenting the character data into a set of initial words. The set of initial words is word level processed to determine at least one candidate word corresponding to each initial word. The set of initial words is segmented into a set of sentences. Each sentence in the set of sentences includes a plurality of initial words and candidate words corresponding to the initial words. A sentence is selected from the set of sentences. The selected sentence is word disambiguity processed to determine a plurality of final words. A final word is selected from the at least one candidate word corresponding to a matching initial word. The plurality of final words is then assembled as post-processed OCR data.
Abstract:
Disclosed is a method for displaying an advertisement. The method displays a present advertisement, determines whether the present advertisement has been displayed completely, and adds an identifier of the present advertisement to a priority advertisement list if the present advertisement has not been displayed completely. The method sends the priority advertisement list to the advertisement engine when requesting the advertisement engine for displaying a next advertisement. Using the priority advertisement list, the advertisement engine may give priority to the present advertisement in next advertisement assignment. Using an optimized advertisement display strategy, the disclosed method may increase coverage rates of advertisement contents to audiences, thereby improving advertisement effectiveness for advertisers and increasing cash flow return for website owners.