摘要:
Tools and techniques are described for providing multi-lingual word hyphenation using inductive machine learning on training data. Methods provided by these techniques may receive training data that includes hyphenated words, and may inductively generate hyphenation patterns that represent substrings of these words. The hyphenation patterns may include the substrings and hyphenation codes associated with characters occurring in the substrings. The methods may receive induction parameters applicable to generating the hyphenation patterns, and may store the hyphenation patterns into a language-specific lexicon file. These methods may also receive requests to hyphenate input words that occur in a human language, and may evaluate how to process the request based on the language. The methods may search for hyphenation patterns occurring in the input words, with the hyphenation patterns being stored in the lexicon file. Finally, the methods may respond to the request, indicating whether the hyphenation patterns occurred in the input words.
摘要:
Tools and techniques are described for providing multi-lingual word hyphenation using inductive machine learning on training data. Methods provided by these techniques may receive training data that includes hyphenated words, and may inductively generate hyphenation patterns that represent substrings of these words. The hyphenation patterns may include the substrings and hyphenation codes associated with characters occurring in the substrings. The methods may receive induction parameters applicable to generating the hyphenation patterns, and may store the hyphenation patterns into a language-specific lexicon file. These methods may also receive requests to hyphenate input words that occur in a human language, and may evaluate how to process the request based on the language. The methods may search for hyphenation patterns occurring in the input words, with the hyphenation patterns being stored in the lexicon file. Finally, the methods may respond to the request, indicating whether the hyphenation patterns occurred in the input words.
摘要:
Architecture that employs adaptive learning algorithms to adapt a data correction tool to user-specific behavior during runtime. The architecture includes a framework for training and measuring adaptive learning algorithms, adapting the current text correction tool codebase, and one or more different adaptive learning algorithms. This enables a text correction system to adapt the behavior of the text correction system to an individual user based on the user's interaction with the data correction system. This also facilitates the testing and improvements in an adaptive learning algorithm at the vendor before shipping in a product to the end-user. This reduces the risk of shipping a feature the precise behavior of which is different for each user.
摘要:
Architecture that employs adaptive learning algorithms to adapt a data correction tool to user-specific behavior during runtime. The architecture includes a framework for training and measuring adaptive learning algorithms, adapting the current text correction tool codebase, and one or more different adaptive learning algorithms. This enables a text correction system to adapt the behavior of the text correction system to an individual user based on the user's interaction with the data correction system. This also facilitates the testing and improvements in an adaptive learning algorithm at the vendor before shipping in a product to the end-user. This reduces the risk of shipping a feature the precise behavior of which is different for each user.
摘要:
Computer-readable media, computer systems, and computing methods are provided for recommending websites that are relevant to a current website to which a user has navigated. A search engine is used to track a set of websites the user has visited immediately prior to the current website, while predictive model(s) are used to generate a sequence of websites that include the current website and the tracked websites. The sequence is compared against strings of websites within a browser-history log to identify matching strings, where the matching strings include the sequence and a respective candidate website. A probability of relevance is computed from a frequency that each of the matching strings has been visited within a predefined time frame. The probability of relevance for each of the matching strings is ranked against one another to distill the highest-ranked matching strings, which are parsed to extract and present the candidate websites included therein.
摘要:
Auto body roof comprising at least one steel part and a skin part made of an aluminum alloy that is attached to the steel part before painting. The aluminum part comprises a sheet of the following composition: Si: 0.7-1.3, Fe
摘要翻译:汽车车身屋顶包括至少一个钢部件和由铝合金制成的表皮部件,其在涂漆之前附接到钢部件。 铝部分包括以下组成的片:Si:0.7-1.3,Fe <0.5,Cu:0.5-1.1,Mn:0.4-1.0,Mg:0.6-1.2,Zn <0.7,Cr <0.25,Zr + Ti <0.20,其他元素各<0.05,总计<0.15,其余为铝。 固化处理后的铝部分在室温下淬火和时效硬化三周,其屈服强度R <0.2>小于170MPa,优选为160MPa。
摘要:
The present invention relates to a method of manipulating a software application and processing data stored in a data source. The method includes receiving a natural language input and analyze the natural language input to identify semantic information contained therein. Portions of the natural language input are associated with command objects and entity objects of a schema based on the semantic information and the natural language input. The method also includes rendering data from the data source in a table of columns and rows based on the schema and the associated portions of the natural language input.
摘要:
Various technologies and techniques are disclosed for aggregating and using data collected from multiple computers to modify a later behavior of those computers. In one implementation, a data aggregation system is described. A data collector is operable to collect behavior data over a network from one or more applications used by the computers, and to save the behavior data to a data store. A data installer is operable to access the behavior data in the data store and convert the behavior data into a format that will modify a future operation of at least one of the applications that is used on at least one of the computers. A method for creating and distributing a custom dictionary from data collected from multiple computers is described. A method for identifying related documents from data collected from multiple computers is also described.
摘要:
A method for processing the surface of a strip, sheet or a shaped part made of an aluminum alloy which involves the preparation of a surface with the aid of an atmospheric pressure plasma and by a chemical conversion treatment using at least the elements Si, Ti, Zr, Ce, Co, Mn, Mo and V, for producing a conversion coating on the strip, sheet or part. The process is more rapid and less costly than previous conversion treatments and is applied, in particular, for strips and sheets which are used for a car body and assembled by welding or gluing.
摘要:
A framework for generating a semantic interpretation of natural language input includes an interpreter, a first set of types, and a second set of types. The interpreter is adapted to mediate between a client application and one or more analysis engines to produce interpretations of the natural language input that are valid for the client application. The first set of types is adapted to define interactions between the interpreter and the one or more analysis engines. The second set of types is adapted to define interactions between the interpreter and the client application.