Multistage learner for efficiently boosting large datasets
    2.
    发明授权
    Multistage learner for efficiently boosting large datasets 有权
    用于有效提升大型数据集的多级学习者

    公开(公告)号:US09418343B2

    公开(公告)日:2016-08-16

    申请号:US14142977

    申请日:2013-12-30

    Applicant: Google Inc.

    CPC classification number: G06N99/005

    Abstract: Implementations of the disclosed subject matter provide methods and systems for using a multistage learner for efficiently boosting large datasets in a machine learning system. A method may include obtaining a first plurality of examples for a machine learning system and selecting a first point in time. Next, a second point in time occurring subsequent to the first point in time may be selected. The machine learning system may be trained using m of the first plurality of examples. Each of the m examples may include a feature initially occurring after the second point in time. In addition, the machine learning system may be trained using n of the first plurality of examples, and each of the n examples may include a feature initially occurring after the first point in time.

    Abstract translation: 所公开的主题的实现提供了使用多级学习者有效地提升机器学习系统中的大型数据集的方法和系统。 方法可以包括获得机器学习系统的第一多个示例并选择第一时间点。 接下来,可以选择在第一时间点之后发生的第二时间点。 可以使用第一多个示例的m来训练机器学习系统。 m个示例中的每一个可以包括最初在第二时间点之后发生的特征。 此外,可以使用第一多个示例中的n来训练机器学习系统,并且n个示例中的每个示例可以包括最初在第一时间点之后发生的特征。

    Multistage Learner for Efficiently Boosting Large Datasets
    3.
    发明申请
    Multistage Learner for Efficiently Boosting Large Datasets 有权
    多级学习者有效提升大型数据集

    公开(公告)号:US20150186795A1

    公开(公告)日:2015-07-02

    申请号:US14142977

    申请日:2013-12-30

    Applicant: Google Inc.

    CPC classification number: G06N99/005

    Abstract: Implementations of the disclosed subject matter provide methods and systems for using a multistage learner for efficiently boosting large datasets in a machine learning system. A method may include obtaining a first plurality of examples for a machine learning system and selecting a first point in time. Next, a second point in time occurring subsequent to the first point in time may be selected. The machine learning system may be trained using m of the first plurality of examples. Each of the m examples may include a feature initially occurring after the second point in time. In addition, the machine learning system may be trained using n of the first plurality of examples, and each of the n examples may include a feature initially occurring after the first point in time.

    Abstract translation: 所公开的主题的实现提供了使用多级学习者有效地提升机器学习系统中的大型数据集的方法和系统。 方法可以包括获得机器学习系统的第一多个示例并选择第一时间点。 接下来,可以选择在第一时间点之后发生的第二时间点。 可以使用第一多个示例的m来训练机器学习系统。 m个示例中的每一个可以包括最初在第二时间点之后发生的特征。 此外,可以使用第一多个示例中的n来训练机器学习系统,并且n个示例中的每个示例可以包括最初在第一时间点之后发生的特征。

    Efficient locking of large data collections
    4.
    发明授权
    Efficient locking of large data collections 有权
    高效锁定大型数据收集

    公开(公告)号:US09569481B1

    公开(公告)日:2017-02-14

    申请号:US14101611

    申请日:2013-12-10

    Applicant: Google Inc.

    CPC classification number: G06F17/30371

    Abstract: The present disclosure provides systems and techniques for efficient locking of datasets in a database when updates to a dataset may be delayed. A method may include accumulating a plurality of updates to a first set of one or more values associated with one or more features. The first set of one or more values may be stored within a first database column. Next, it may be determined that a first database column update aggregation rule is satisfied. A lock assigned to at least a portion of at least a first database column may be acquired. Accordingly, one or more values in the first set within the first database column may be updated based on the plurality of updates. In an implementation, the first set of one or more values may be associated with the first lock.

    Abstract translation: 本公开提供了用于在数据库的更新可能被延迟时有效地将数据集锁定在数据库中的系统和技术。 方法可以包括将多个更新累积到与一个或多个特征相关联的一个或多个值的第一组中。 第一组一个或多个值可以存储在第一数据库列中。 接下来,可以确定满足第一数据库列更新聚合规则。 可以获取分配给至少第一数据库列的至少一部分的锁。 因此,可以基于多个更新来更新第一数据库列中的第一集合中的一个或多个值。 在实现中,第一组一个或多个值可以与第一锁相关联。

    Using specialized workers to improve performance in machine learning
    5.
    发明授权
    Using specialized workers to improve performance in machine learning 有权
    使用专业人员提高机器学习的绩效

    公开(公告)号:US09269057B1

    公开(公告)日:2016-02-23

    申请号:US14102718

    申请日:2013-12-11

    Applicant: Google Inc.

    CPC classification number: G06N99/005 G06F9/5066

    Abstract: Systems and techniques are disclosed for generating weighted machine learned models using multi-shard combiners. A learner in a machine learning system may receive labeled positive and negative examples and workers within the learner may be configured to receive either positive or negative examples. A positive and negative statistic may be calculated for a given feature and may either be applied separately in a model or may be combined to generate an overall statistic.

    Abstract translation: 公开了用于使用多分片组合器生成加权机器学习模型的系统和技术。 机器学习系统中的学习者可能会收到标签的正面和负面实例,学习者中的工作人员可能被配置为接受正面或者负面的例子。 可以为给定特征计算正和负统计量,并且可以在模型中单独应用,或者可以组合以产生总体统计量。

Patent Agency Ranking