MULTI-LINGUAL WORD HYPHENATION USING INDUCTIVE MACHINE LEARNING ON TRAINING DATA
    1.
    发明申请
    MULTI-LINGUAL WORD HYPHENATION USING INDUCTIVE MACHINE LEARNING ON TRAINING DATA 有权
    使用感应机器学习培训数据的多语言词汇

    公开(公告)号:US20090182550A1

    公开(公告)日:2009-07-16

    申请号:US12015489

    申请日:2008-01-16

    IPC分类号: G06F17/28

    CPC分类号: G06F17/26

    摘要: Tools and techniques are described for providing multi-lingual word hyphenation using inductive machine learning on training data. Methods provided by these techniques may receive training data that includes hyphenated words, and may inductively generate hyphenation patterns that represent substrings of these words. The hyphenation patterns may include the substrings and hyphenation codes associated with characters occurring in the substrings. The methods may receive induction parameters applicable to generating the hyphenation patterns, and may store the hyphenation patterns into a language-specific lexicon file. These methods may also receive requests to hyphenate input words that occur in a human language, and may evaluate how to process the request based on the language. The methods may search for hyphenation patterns occurring in the input words, with the hyphenation patterns being stored in the lexicon file. Finally, the methods may respond to the request, indicating whether the hyphenation patterns occurred in the input words.

    摘要翻译: 描述了使用感应机器学习训练数据来提供多语言单词连字的工具和技术。 通过这些技术提供的方法可以接收包括连字字的训练数据,并且可以感应地生成表示这些单词的子串的连字符模式。 连字符模式可以包括与在子字符串中出现的字符相关联的子串和连字符代码。 这些方法可以接收适用于生成连字符模式的归纳参数,并且可以将连字符模式存储到语言特定的词典文件中。 这些方法也可以接收对以人类语言进行连字的输入单词的请求,并且可以评估如何基于该语言来处理该请求。 这些方法可以搜索在输入单词中出现的连字符模式,连字模式存储在词典文件中。 最后,这些方法可以响应请求,指示输入单词中是否发生连字符模式。

    Multi-lingual word hyphenation using inductive machine learning on training data
    2.
    发明授权
    Multi-lingual word hyphenation using inductive machine learning on training data 有权
    使用感应机器学习训练数据的多语言单词连字

    公开(公告)号:US08996994B2

    公开(公告)日:2015-03-31

    申请号:US12015489

    申请日:2008-01-16

    IPC分类号: G06F17/20 G06F17/26

    CPC分类号: G06F17/26

    摘要: Tools and techniques are described for providing multi-lingual word hyphenation using inductive machine learning on training data. Methods provided by these techniques may receive training data that includes hyphenated words, and may inductively generate hyphenation patterns that represent substrings of these words. The hyphenation patterns may include the substrings and hyphenation codes associated with characters occurring in the substrings. The methods may receive induction parameters applicable to generating the hyphenation patterns, and may store the hyphenation patterns into a language-specific lexicon file. These methods may also receive requests to hyphenate input words that occur in a human language, and may evaluate how to process the request based on the language. The methods may search for hyphenation patterns occurring in the input words, with the hyphenation patterns being stored in the lexicon file. Finally, the methods may respond to the request, indicating whether the hyphenation patterns occurred in the input words.

    摘要翻译: 描述了使用感应机器学习训练数据来提供多语言单词连字的工具和技术。 通过这些技术提供的方法可以接收包括连字字的训练数据,并且可以感应地生成表示这些单词的子串的连字符模式。 连字符模式可以包括与在子字符串中出现的字符相关联的子串和连字符代码。 该方法可以接收适用于生成连字符模式的归纳参数,并且可以将连字符模式存储到语言特定的词典文件中。 这些方法也可以接收对以人类语言进行连字的输入单词的请求,并且可以评估如何基于该语言来处理该请求。 这些方法可以搜索在输入单词中出现的连字符模式,连字模式存储在词典文件中。 最后,这些方法可以响应请求,指示输入单词中是否发生连字符模式。

    Adaptive learning framework for data correction
    3.
    发明授权
    Adaptive learning framework for data correction 有权
    用于数据校正的自适应学习框架

    公开(公告)号:US08090669B2

    公开(公告)日:2012-01-03

    申请号:US12115551

    申请日:2008-05-06

    IPC分类号: G06F15/18

    CPC分类号: G06N99/005

    摘要: Architecture that employs adaptive learning algorithms to adapt a data correction tool to user-specific behavior during runtime. The architecture includes a framework for training and measuring adaptive learning algorithms, adapting the current text correction tool codebase, and one or more different adaptive learning algorithms. This enables a text correction system to adapt the behavior of the text correction system to an individual user based on the user's interaction with the data correction system. This also facilitates the testing and improvements in an adaptive learning algorithm at the vendor before shipping in a product to the end-user. This reduces the risk of shipping a feature the precise behavior of which is different for each user.

    摘要翻译: 采用自适应学习算法在运行时将数据校正工具适应用户特定行为的架构。 该架构包括用于训练和测量自适应学习算法的框架,适应当前文本校正工具代码库以及一种或多种不同的自适应学习算法。 这使得文本校正系统可以基于用户与数据校正系统的交互来将文本校正系统的行为适应于单个用户。 这也有助于在产品运送到最终用户之前在供应商处进行自适应学习算法的测试和改进。 这降低了运送功能的风险,每个用户的精确行为不同。

    ADAPTIVE LEARNING FRAMEWORK FOR DATA CORRECTION
    4.
    发明申请
    ADAPTIVE LEARNING FRAMEWORK FOR DATA CORRECTION 有权
    用于数据校正的自适应学习框架

    公开(公告)号:US20090281972A1

    公开(公告)日:2009-11-12

    申请号:US12115551

    申请日:2008-05-06

    IPC分类号: G06F15/18

    CPC分类号: G06N99/005

    摘要: Architecture that employs adaptive learning algorithms to adapt a data correction tool to user-specific behavior during runtime. The architecture includes a framework for training and measuring adaptive learning algorithms, adapting the current text correction tool codebase, and one or more different adaptive learning algorithms. This enables a text correction system to adapt the behavior of the text correction system to an individual user based on the user's interaction with the data correction system. This also facilitates the testing and improvements in an adaptive learning algorithm at the vendor before shipping in a product to the end-user. This reduces the risk of shipping a feature the precise behavior of which is different for each user.

    摘要翻译: 采用自适应学习算法在运行时将数据校正工具适应用户特定行为的架构。 该架构包括用于训练和测量自适应学习算法的框架,适应当前文本校正工具代码库以及一种或多种不同的自适应学习算法。 这使得文本校正系统可以基于用户与数据校正系统的交互来将文本校正系统的行为适应于单个用户。 这也有助于在产品运送到最终用户之前在供应商处进行自适应学习算法的测试和改进。 这降低了运送功能的风险,每个用户的精确行为不同。

    Predictively suggesting websites
    5.
    发明授权
    Predictively suggesting websites 有权
    预测网站

    公开(公告)号:US08600968B2

    公开(公告)日:2013-12-03

    申请号:US13089996

    申请日:2011-04-19

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30876

    摘要: Computer-readable media, computer systems, and computing methods are provided for recommending websites that are relevant to a current website to which a user has navigated. A search engine is used to track a set of websites the user has visited immediately prior to the current website, while predictive model(s) are used to generate a sequence of websites that include the current website and the tracked websites. The sequence is compared against strings of websites within a browser-history log to identify matching strings, where the matching strings include the sequence and a respective candidate website. A probability of relevance is computed from a frequency that each of the matching strings has been visited within a predefined time frame. The probability of relevance for each of the matching strings is ranked against one another to distill the highest-ranked matching strings, which are parsed to extract and present the candidate websites included therein.

    摘要翻译: 提供计算机可读介质,计算机系统和计算方法来推荐与用户已经浏览的当前网站相关的网站。 搜索引擎用于跟踪用户在当前网站之前立即访问的一组网站,而使用预测模型来生成包括当前网站和跟踪网站的网站序列。 该序列与浏览器历史日志中的网站字符串进行比较,以识别匹配的字符串,其中匹配的字符串包括序列和相应的候选网站。 从在预定时间帧内已经访问了每个匹配串的频率计算相关概率。 每个匹配字符串的相关概率相互排序,以便蒸馏最高排名的匹配字符串,其被解析以提取和呈现其中包括的候选网站。

    TECHNIQUES FOR DATA AGGREGATION, ANALYSIS, AND DISTRIBUTION
    8.
    发明申请
    TECHNIQUES FOR DATA AGGREGATION, ANALYSIS, AND DISTRIBUTION 审中-公开
    数据聚合,分析和分配的技术

    公开(公告)号:US20100185631A1

    公开(公告)日:2010-07-22

    申请号:US12355806

    申请日:2009-01-19

    IPC分类号: G06F17/30

    摘要: Various technologies and techniques are disclosed for aggregating and using data collected from multiple computers to modify a later behavior of those computers. In one implementation, a data aggregation system is described. A data collector is operable to collect behavior data over a network from one or more applications used by the computers, and to save the behavior data to a data store. A data installer is operable to access the behavior data in the data store and convert the behavior data into a format that will modify a future operation of at least one of the applications that is used on at least one of the computers. A method for creating and distributing a custom dictionary from data collected from multiple computers is described. A method for identifying related documents from data collected from multiple computers is also described.

    摘要翻译: 公开了各种技术和技术,用于聚合和使用从多台计算机收集的数据来修改这些计算机的后续行为。 在一个实现中,描述了数据聚合系统。 数据收集器可操作以通过网络从计算机使用的一个或多个应用收集行为数据,并将行为数据保存到数据存储。 数据安装器可操作以访问数据存储中的行为数据,并将行为数据转换成将修改在至少一个计算机上使用的至少一个应用的将来操作的格式。 描述了从多台计算机收集的数据创建和分发自定义词典的方法。 还描述了从从多台计算机收集的数据中识别相关文档的方法。

    Method for processing surfaces of aluminium alloy sheets and strips
    9.
    发明申请
    Method for processing surfaces of aluminium alloy sheets and strips 审中-公开
    铝合金板材和带材表面处理方法

    公开(公告)号:US20070026254A1

    公开(公告)日:2007-02-01

    申请号:US10558749

    申请日:2004-06-09

    IPC分类号: B32B15/10 B32B15/01 C08F2/46

    CPC分类号: C23C22/78 Y10T428/12743

    摘要: A method for processing the surface of a strip, sheet or a shaped part made of an aluminum alloy which involves the preparation of a surface with the aid of an atmospheric pressure plasma and by a chemical conversion treatment using at least the elements Si, Ti, Zr, Ce, Co, Mn, Mo and V, for producing a conversion coating on the strip, sheet or part. The process is more rapid and less costly than previous conversion treatments and is applied, in particular, for strips and sheets which are used for a car body and assembled by welding or gluing.

    摘要翻译: 一种用于处理由铝合金制成的带材,片材或成形部件的表面的方法,其涉及借助于大气压等离子体制备表面,并且通过至少使用元素Si,Ti, Zr,Ce,Co,Mn,Mo和V,用于在带材,片材或部分上制备转化涂层。 该方法比以前的转换处理更快速且更便宜,并且特别适用于用于车身并通过焊接或胶合组装的条带和片材。