发明申请
US20120323968A1 Learning Discriminative Projections for Text Similarity Measures
审中-公开
用于文本相似度量度的学习判别预测
- 专利标题: Learning Discriminative Projections for Text Similarity Measures
- 专利标题(中): 用于文本相似度量度的学习判别预测
-
申请号: US13160485申请日: 2011-06-14
-
公开(公告)号: US20120323968A1公开(公告)日: 2012-12-20
- 发明人: Wen-tau Yih , Kristina N. Toutanova , Christopher A. Meek , John C. Platt
- 申请人: Wen-tau Yih , Kristina N. Toutanova , Christopher A. Meek , John C. Platt
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A model for mapping the raw text representation of a text object to a vector space is disclosed. A function is defined for computing a similarity score given two output vectors. A loss function is defined for computing an error based on the similarity scores and the labels of pairs of vectors. The parameters of the model are tuned to minimize the loss function. The label of two vectors indicates a degree of similarity of the objects. The label may be a binary number or a real-valued number. The function for computing similarity scores may be a cosine, Jaccard, or differentiable function. The loss function may compare pairs of vectors to their labels. Each element of the output vector is a linear or non-linear function of the terms of an input vector. The text objects may be different types of documents and two different models may be trained concurrently.
信息查询