发明申请
- 专利标题: SHUFFLING-TYPE GRADIENT METHOD FOR TRAINING MACHINE LEARNING MODELS WITH BIG DATA
-
申请号: US17109112申请日: 2020-12-01
-
公开(公告)号: US20220171996A1公开(公告)日: 2022-06-02
- 发明人: Lam Minh Nguyen , Dung Tien Phan
- 申请人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 申请人地址: US NY Armonk
- 专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人地址: US NY Armonk
- 主分类号: G06K9/62
- IPC分类号: G06K9/62 ; G06N3/02 ; G06F17/16
摘要:
A computer-implemented method for a shuffling-type gradient for training a machine learning model using a stochastic gradient descent (SGD) includes the operations of uniformly randomly distributing data samples or coordinate updates of a training data, and calculating the learning rates for a no-shuffling scheme and a shuffling scheme. A combined operation of the no-shuffling scheme and the shuffling scheme of the training data is performed using a stochastic gradient descent (SGD) algorithm. The combined operation is switched to performing only the shuffling scheme from the no-shuffling scheme based on one or more predetermined criterion; and training the machine learning models with the training data based on the combined no-shuffling scheme and shuffling scheme.
公开/授权文献
信息查询