-
公开(公告)号:US08521516B2
公开(公告)日:2013-08-27
申请号:US12411224
申请日:2009-03-25
IPC分类号: G06F17/21
CPC分类号: G06F17/27
摘要: Systems, methods, and apparatuses including computer program products are provided for training machine learning systems. In some implementations, a method is provided. The method includes receiving a collection of phrases, normalizing a plurality of phrases of the collection of phrases, the normalizing being based at least in part on lexicographic normalizing rules, and generating a normalized phrase table including a plurality of key-value pairs, each key value pair includes a key corresponding to a normalized phrase and a value corresponding to one or more un-normalized phrases associated with the normalized key, each un-normalized phrase having one or more parameters.
摘要翻译: 提供包括计算机程序产品在内的系统,方法和设备用于训练机器学习系统。 在一些实现中,提供了一种方法。 该方法包括接收短语集合,归一化短语集合中的多个短语,归一化至少部分地基于词典标准化规则,以及生成包括多个键值对的标准化短语表,每个键 值对包括对应于归一化短语的键和对应于与归一化键相关联的一个或多个非标准化短语的值,每个非正规化短语具有一个或多个参数。
-
公开(公告)号:US20130151235A1
公开(公告)日:2013-06-13
申请号:US12411224
申请日:2009-03-25
IPC分类号: G06F17/27
CPC分类号: G06F17/27
摘要: Systems, methods, and apparatuses including computer program products are provided for training machine learning systems. In some implementations, a method is provided. The method includes receiving a collection of phrases, normalizing a plurality of phrases of the collection of phrases, the normalizing being based at least in part on lexicographic normalizing rules, and generating a normalized phrase table including a plurality of key-value pairs, each key value pair includes a key corresponding to a normalized phrase and a value corresponding to one or more un-normalized phrases associated with the normalized key, each un-normalized phrase having one or more parameters.
摘要翻译: 提供包括计算机程序产品在内的系统,方法和设备用于训练机器学习系统。 在一些实现中,提供了一种方法。 该方法包括接收短语集合,归一化短语集合中的多个短语,归一化至少部分地基于词典标准化规则,以及生成包括多个键值对的标准化短语表,每个键 值对包括对应于归一化短语的键和对应于与归一化键相关联的一个或多个非标准化短语的值,每个非正规化短语具有一个或多个参数。
-
公开(公告)号:US08027938B1
公开(公告)日:2011-09-27
申请号:US12055967
申请日:2008-03-26
申请人: Peng Xu , Ioannis Tsochandaridis
发明人: Peng Xu , Ioannis Tsochandaridis
IPC分类号: G06F15/18
CPC分类号: G06N99/005
摘要: Systems, methods, and apparatuses including computer program products for machine learning are provided. A method is provided that includes distributing a parameterized model to each worker of a hierarchy of workers, the parameterized model including a plurality of feature functions and corresponding model parameters, processing a portion of training data at each worker of the plurality of workers according to the parameterized model to calculate updates to model parameters, for each worker at a lowest level of the hierarchy of workers, sending the calculated updates to a next higher level worker, for each other worker in the hierarchy of workers, combining updates of the respective worker with updates received from one or more lower level workers, collecting all updates from the workers at a master to generate real updates to the model parameters, and generating an updated model using the real updates to the model parameters.
摘要翻译: 提供了包括用于机器学习的计算机程序产品的系统,方法和装置。 提供了一种方法,其包括将参数化模型分发给工作人员层级的每个工人,所述参数化模型包括多个特征功能和对应的模型参数,根据所述参数化模型处理所述多个工人的每个工人的一部分训练数据 参数化模型来计算模型参数的更新,对于工人层次结构的最低级别的每个工作者,将计算的更新发送给下一个更高级别的工作者,对于工人层次结构中的每个其他工作者,将相应工作者的更新与 从一个或多个较低级别的工作人员收到的更新,收集主人员的所有更新,以生成对模型参数的真实更新,以及使用模型参数的真实更新生成更新的模型。
-
-