-
公开(公告)号:US11257005B2
公开(公告)日:2022-02-22
申请号:US16119585
申请日:2018-08-31
Applicant: Alibaba Group Holding Limited
Inventor: Jun Zhou
Abstract: A training method and a training system for a machine learning system are provided. The method includes allocating training data to a plurality of working machines; dividing training data allocated by each working machine into a plurality of data pieces; obtaining a local weight and a local loss function value calculated by each working machine based on each data piece; aggregating the local weight and the local loss function value calculated by each work machine based on each data piece to obtain a current weight and a current loss function value; performing model abnormality detection using the current weight and/or the current loss function value; inputting a weight and a loss function value of a previous aggregation to the machine learning system for training in response to a result of the model abnormality detection being a first type of abnormality; and modifying the current weight and/or the current loss function value to a current weight and/or a current loss function value within a first threshold in response to the result of the model abnormality detection being a second type of abnormality, and inputting thereof to the machine learning system for training.
-
公开(公告)号:US10769383B2
公开(公告)日:2020-09-08
申请号:US16743224
申请日:2020-01-15
Applicant: ALIBABA GROUP HOLDING LIMITED
Inventor: Shaosheng Cao , Xinxing Yang , Jun Zhou , Xiaolong Li
Abstract: Embodiments of the present application disclose a cluster-based word vector processing method, apparatus, and device. Solutions are include: in a cluster having a server cluster and a worker computer cluster, in which each worker computer in the worker computer cluster separately reads some corpuses in parallel, extracts a word and context words of the word from the read corpuses, obtains corresponding word vectors from a server in the server cluster, and trains the corresponding word vectors, and the server cluster updates word vectors of same words that are stored before the training according to training results of one or more respective worker computers with respect to the word vectors of the same words.
-
公开(公告)号:US20200126143A1
公开(公告)日:2020-04-23
申请号:US16390057
申请日:2019-04-22
Applicant: Alibaba Group Holding Limited
Inventor: Chaochao Chen , Jun Zhou
Abstract: An item rating and recommendation platform identifies rating data comprising respective ratings of multiple items with respect to multiple users, identifies user-feature data comprising multiple user features contributing to the respective ratings of the multiple items with respect to the multiple users, and receives, from a social network platform via a secret sharing scheme with a trusted initializer, manipulated social network data computed based on social network data and first input data from the trusted initializer. The social network data indicate social relationships between any two of the multiple users. In the secret sharing scheme with the trusted initializer, the social network platform shares with the item rating and recommendation platform the manipulated social network data without disclosing the social network data. The item rating and recommendation platform updates the user-feature data based on the rating data and the manipulated social network data.
-
公开(公告)号:US20190272472A1
公开(公告)日:2019-09-05
申请号:US16290208
申请日:2019-03-01
Applicant: Alibaba Group Holding Limited
Inventor: Chaochao Chen , Jun Zhou
Abstract: A client device determines a local user gradient value based on a current user preference vector and a local item gradient value based on a current item feature vector. The client device updates a user preference vector by using the local user gradient value and updates an item feature vector by using the local item gradient value. The client device determines a neighboring client device based on a predetermined adjacency relationship. The local item gradient value is sent by the client device to the neighboring client device. The client device receives a neighboring item gradient value sent by the neighboring client device. The client device updates the item feature vector by using the neighboring item gradient value. In response to the client device determining that a predetermined iteration stop condition is satisfied, the client device outputs the user preference vector and the item feature vector.
-
公开(公告)号:US20200349416A1
公开(公告)日:2020-11-05
申请号:US16812105
申请日:2020-03-06
Applicant: Alibaba Group Holding Limited
Inventor: Xinxing Yang , Longfei Li , Jun Zhou
Abstract: Implementations of the present specification provide a method for determining a computer-executed ensemble model. The method includes: obtaining a current ensemble model and a plurality of untrained candidate submodels; integrating each of the plurality of candidate submodels into the current ensemble model to obtain a plurality of first candidate ensemble models; training at least the plurality of first candidate ensemble models to obtain a plurality of second candidate ensemble models after this training; performing performance evaluation on each of the plurality of second candidate ensemble models to obtain corresponding performance evaluation results; determining, based on the performance evaluation results, an optimal candidate ensemble model with optimal performance from the plurality of second candidate ensemble models; and updating the current ensemble model with the optimal candidate ensemble model if the performance of the optimal candidate ensemble model satisfies a predetermined condition.
-
公开(公告)号:US20200293924A1
公开(公告)日:2020-09-17
申请号:US16889695
申请日:2020-06-01
Applicant: Alibaba Group Holding Limited
Inventor: Wenjing FANG , Jun Zhou , Licui GAO
Abstract: Implementations of the present specification disclose methods, devices, and apparatuses for determining a feature interpretation of a predicted label value of a user generated by a GBDT model. In one aspect, the method includes separately obtaining, from each of a predetermined quantity of decision trees ranked among top decision trees, a leaf node and a score of the leaf node; determining a respective prediction path of each leaf node; obtaining, for each parent node on each prediction path, a split feature and a score of the parent node; determining, for each child node on each prediction path, a feature corresponding to the child node and a local increment of the feature on the child node; obtaining a collection of features respectively corresponding to the child nodes; and obtaining a respective measure of relevance between the feature corresponding to the at least one child node and the predicted label value.
-
公开(公告)号:US10748524B2
公开(公告)日:2020-08-18
申请号:US16774422
申请日:2020-01-28
Applicant: Alibaba Group Holding Limited
Inventor: Zhiming Wang , Jun Zhou , Xiaolong Li
Abstract: A speech wakeup method, apparatus, and electronic device are disclosed in embodiments of this specification. The method includes: inputting speech data to a speech wakeup model trained with general speech data; and outputting, by the speech wakeup model, a result for determining whether to execute speech wakeup, wherein the speech wakeup model includes a Deep Neural Network (DNN) and a Connectionist Temporal Classifier (CTC).
-
公开(公告)号:US20200125745A1
公开(公告)日:2020-04-23
申请号:US16390147
申请日:2019-04-22
Applicant: Alibaba Group Holding Limited
Inventor: Chaochao Chen , Jun Zhou
Abstract: An item rating and recommendation platform identifies rating data including respective ratings of multiple items with respect to multiple users; identifies user-feature data including user features contributing to the respective ratings of the multiple items with respect to the multiple users; and receives, from a social network platform via a secret sharing scheme without a trusted initializer, manipulated social network data computed based on social network data and a first number of random variables. The social network data indicate social relationships between any two of the number of users. In the secret sharing scheme without the trust initializer, the social network platform shares with the item rating and recommendation platform manipulated social network data without disclosing the social network data. The item rating and recommendation platform updates the user-feature data based on the rating data and the manipulated social network data.
-
公开(公告)号:US20180307774A1
公开(公告)日:2018-10-25
申请号:US16019897
申请日:2018-06-27
Applicant: Alibaba Group Holding Limited
Inventor: Jun Zhou
IPC: G06F17/30
Abstract: Techniques for navigating webpages requested through short links are provided. In some implementations, a short link uniform resource locator (URL) is received, the short link URL is processed to extract a simplified short link and an address code, and a determination is made as to whether the simplified short link is associated with a long link URL representing an address of a webpage. In response to determining that the simplified short link is associated with a long link URL, the associated long link URL is provided. In response to determining that the simplified short link is not associated with a long link URL, a common long link URL associated with the address code is provided.
-
公开(公告)号:US10824819B2
公开(公告)日:2020-11-03
申请号:US16879316
申请日:2020-05-20
Applicant: ALIBABA GROUP HOLDING LIMITED
Inventor: Shaosheng Cao , Jun Zhou
IPC: G06F40/279 , G06F40/284 , G06N3/04 , G06N20/00 , G06F40/40 , G06F40/30 , G06N3/08
Abstract: Implementations of the present specification disclose methods, apparatuses, and devices for generating word vectors. The method includes: obtaining individual words by segmenting a corpus; establishing a feature vector of each word based on n-ary characters; training a recurrent neural network based on the feature vectors of the obtained words and feature vectors of context words associated with the obtained words in the corpus; and generating a word vector for each obtained word based on the feature vector of the obtained word and the trained recurrent neural network.
-
-
-
-
-
-
-
-
-