Invention Grant
- Patent Title: Computing numeric representations of words in a high-dimensional space
-
Application No.: US16363460Application Date: 2019-03-25
-
Publication No.: US10922488B1Publication Date: 2021-02-16
- Inventor: Tomas Mikolov , Kai Chen , Gregory S. Corrado , Jeffrey A. Dean
- Applicant: Google LLC
- Applicant Address: US CA Mountain View
- Assignee: Google LLC
- Current Assignee: Google LLC
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G10L15/00
- IPC: G10L15/00 ; G06F40/279 ; G10L15/06 ; G06N20/00 ; G06F40/30

Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for computing numeric representations of words. One of the methods includes obtaining a set of training data, wherein the set of training data comprises sequences of words; training a classifier and an embedding function on the set of training data, wherein training the embedding function comprises obtained trained values of the embedding function parameters; processing each word in the vocabulary using the embedding function in accordance with the trained values of the embedding function parameters to generate a respective numerical representation of each word in the vocabulary in the high-dimensional space; and associating each word in the vocabulary with the respective numeric representation of the word in the high-dimensional space.
Information query