-
公开(公告)号:US10769136B2
公开(公告)日:2020-09-08
申请号:US15826279
申请日:2017-11-29
Applicant: Microsoft Technology Licensing, LLC
Inventor: Cagri Ozcaglar , Xianren Wu , Jaewon Yang , Abhishek Gupta , Anish Ramdas Nair
IPC: G06F16/242 , G06F16/25 , G06F16/248 , G06F16/9535 , G06Q50/00 , G06Q10/10 , G06F16/9536
Abstract: Techniques for improving search using generalized linear mixed models are disclosed herein. In some embodiments, a computer-implemented method comprises: receiving a search query comprising at least one search term and being associated with a user; extracting features from corresponding profiles of a plurality of candidates; for each one of the candidates, generating a corresponding score based on a generalized linear mixed model comprising a generalized linear query-based model and a random effects user-based model; selecting a subset of candidates from the plurality of candidates based on the corresponding scores; and causing the selected subset of candidates to be displayed to the user in a search results page for the search query.
-
公开(公告)号:US10324937B2
公开(公告)日:2019-06-18
申请号:US15221195
申请日:2016-07-27
Applicant: Microsoft Technology Licensing, LLC
Inventor: Jaewon Yang , Liang Tang , Bee-Chung Chen
IPC: G06N7/00 , G06F17/21 , H04L12/58 , H04L29/08 , G06F16/248 , G06F16/2457 , G06F16/9535
Abstract: A news feed system provided with an on-line social network system determines that a news feed is to be constructed for a viewer. The news feed system accesses the viewer's profile and other information associated with the viewer, accesses an inventory of activities that have been identified as potentially of interest to the viewer, and calculates relevance score for each item inventory of activities using the combined coefficients methodology. The activities are then arranged for presentation to the viewer via a news feed web page, using respective calculated relevance scores.
-
公开(公告)号:US11816636B2
公开(公告)日:2023-11-14
申请号:US17412753
申请日:2021-08-26
Applicant: Microsoft Technology Licensing, LLC
Inventor: Liwei Wu , Wenjia Ma , Jaewon Yang , Yanen Li
IPC: G06Q10/1053 , G06N20/20 , G06Q10/0631 , G06N5/01
CPC classification number: G06Q10/1053 , G06N5/01 , G06N20/20 , G06Q10/063112
Abstract: Techniques for mining training data for use in training a dependency model are disclosed herein. In some embodiments, a computer-implemented method comprises: obtaining training data comprising a plurality of reference skill pairs, each reference skill pair comprising a corresponding first reference skill and a corresponding second reference skill, the plurality of reference skill pairs being included in the training data based on a co-occurrence of the corresponding first and second reference skills for each reference skill pair in the plurality of reference skill pairs, the co-occurrence comprising the corresponding first and second reference skills co-occurring for a same entity; and training a dependency model with a machine learning algorithm using the training data, the dependency model comprising a logistic regression model or a data gradient boosted decision tree (GBDT) model. The dependency model may then be used to identify corresponding dependency relations for a plurality of target skill pairs.
-
公开(公告)号:US11604990B2
公开(公告)日:2023-03-14
申请号:US16902587
申请日:2020-06-16
Applicant: Microsoft Technology Licensing, LLC
Inventor: Xiao Yan , Wenjia Ma , Jaewon Yang , Jacob Bollinger , Qi He , Lin Zhu , How Jing
Abstract: In an example embodiment, a framework to infer a user's value for a particular attribute based upon a multi-task machine learning process with uncertainty weighting that incorporates signals from multiple contexts is provided. In an example embodiment, the framework aims to measure a level of a user attribute under a certain context. Rather than attempting to devise a universal, one-size-fits-all value for the attribute, the framework acknowledges that the user's value for that attribute can vary depending on context and factors in the context under which the user's attribute levels are measured. Multiple contexts are defined depending on different situations where users and entities such as companies and organizations need to evaluate user attribute levels. Signals for attribute levels are then collected for each context. Machine learning models are utilized to estimate attribute values for different contexts. Multi-task deep learning is used to level attributes from different contexts.
-
公开(公告)号:US20230075600A1
公开(公告)日:2023-03-09
申请号:US17468028
申请日:2021-09-07
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yiming Ma , Lili Zhang , Wei Kang , Jaewon Yang
Abstract: Techniques for training and using a neural network to make predictions using trajectory modelling are disclosed herein. In some embodiments, a computer-implemented method comprises: training a first neural network with a first machine learning algorithm using training data, the first neural network being a recurrent neural network, the training data including a plurality of reference career trajectories, each reference career trajectory in the plurality of reference career trajectories comprising a sequence of reference career segments, each reference career segment in the sequence of reference career segments comprising reference profile data and reference time data indicating a position of the reference career segment within the sequence of reference career segments, the training data also including a corresponding set of reference skills for each reference career segment.
-
公开(公告)号:US11487947B2
公开(公告)日:2022-11-01
申请号:US16716402
申请日:2019-12-16
Applicant: Microsoft Technology Licensing, LLC
IPC: G06F17/00 , G06F40/30 , G06F16/93 , G06F40/289 , G06F40/284 , G06N20/00
Abstract: Techniques are provided for using machine learning techniques to analyze textual content. In one technique, a potential item is identified within a document. An analysis of the potential item is performed at multiple levels of granularity that includes two or more of a sentence level, a segment level, or a document level. The analysis produces multiple outputs, one for each level of granularity in the multiple levels of granularity. The outputs are input into a machine-learned model to generate a score for the potential item. Based on the score, the potential item is presented on a computing device. In response to user selection of the potential item, an association between the potential item and the document is created. The association may be used later to identify a set of users to which the document (or data thereof) is to be presented.
-
公开(公告)号:US20210303638A1
公开(公告)日:2021-09-30
申请号:US16836546
申请日:2020-03-31
Applicant: Microsoft Technology Licensing, LLC
Inventor: Peide Zhong , Feishe Chen , Weizhi Meng , Wei Kang , Feng Guo , Fei Chen , Jaewon Yang , Qi He
IPC: G06F16/903 , G06K9/62 , G06F40/30 , G06N20/00
Abstract: The disclosed embodiments provide a system for processing user-generated input. During operation, the system obtains a first embedding produced by an embedding model from an input string representing an entity and a hierarchy of clusters of embeddings generated by the embedding model from a set of standardized entities. Next, the system searches the hierarchy of clusters for a subset of the embeddings that are within a threshold proximity to the first embedding in a vector space. The system then calculates embedding match scores between the input string and a first subset of the standardized entities represented by the subset of the embeddings based on distances between the subset of the embeddings and the first embedding in the vector space. Finally, the system modifies, based on the embedding match scores, content outputted in response to the input string within a user interface of an online system.
-
公开(公告)号:US20210065047A1
公开(公告)日:2021-03-04
申请号:US16556097
申请日:2019-08-29
Applicant: Microsoft Technology Licensing, LLC
Inventor: Baoxu Shi , Jaewon Yang , Qi He
Abstract: Techniques for learning entity representations in a scalable manner are provided. A graph that comprises a plurality of nodes representing a set of entities is stored. A first subset of the set of entities and a second subset of the set of entities are identified. For each entity in the first subset of the set of entities, one or more machine learning techniques are used to generate a machine-learned embedding for the entity. For each entity in the second subset of the set of entities, a subset of entities in the first subset that are associated with the entity is identified. One or more embeddings are identified for the subset of entities. Based on the one or more embeddings, an inferred embedding is generated for the entity.
-
公开(公告)号:US20190130281A1
公开(公告)日:2019-05-02
申请号:US15799396
申请日:2017-10-31
Applicant: Microsoft Technology Licensing, LLC
Inventor: Jaewon Yang , Qi He , How Jing , Bee-Chung Chen , Liangyue Li
IPC: G06N5/02
Abstract: Techniques for predicting a next company and next title of a user are disclosed herein. In some embodiments, an encoder is used for encoding a representation of the user's profile. The encoding includes accessing discrete entities comprising context information included in the user's profile, constructing a plurality of embedding vectors from the context information, and generating a context vector from the plurality of embedding vectors. The plurality of embedding vectors including a skill embedding vector, a school embedding vector, and a location embedding vector. A decoder is for decoding a career path from the context vector. The decoding includes applying a long short-term memory (LSTM) model to the context vector to generate perform the prediction of the user's next company and next title for presentation in a user interface.
-
公开(公告)号:US11775778B2
公开(公告)日:2023-10-03
申请号:US17090776
申请日:2020-11-05
Applicant: Microsoft Technology Licensing, LLC
Inventor: Zhuliu Li , Xiao Yan , Yiming Wang , Jaewon Yang
IPC: G06F40/00 , G06F40/58 , G06N3/08 , G06N3/04 , G06F40/295
CPC classification number: G06F40/58 , G06F40/295 , G06N3/04 , G06N3/08
Abstract: Embodiments of the disclosed technologies incorporate taxonomy information into a cross-lingual entity graph and input the taxonomy-informed cross-lingual entity graph into a graph neural network. The graph neural network computes semantic alignment scores for node pairs. The semantic alignment scores are used to determine whether a node pair represents a valid machine translation.
-
-
-
-
-
-
-
-
-