Invention Grant
- Patent Title: Language-agnostic understanding
-
Application No.: US15850382Application Date: 2017-12-21
-
Publication No.: US10657332B2Publication Date: 2020-05-19
- Inventor: Ying Zhang , Reshef Shilon , Jing Zheng
- Applicant: Facebook, Inc.
- Applicant Address: US CA Menlo Park
- Assignee: FACEBOOK, INC.
- Current Assignee: FACEBOOK, INC.
- Current Assignee Address: US CA Menlo Park
- Main IPC: G06F40/49
- IPC: G06F40/49 ; G06F16/35 ; G06F40/30 ; G06F40/44 ; G06F40/58 ; G06F40/216 ; G06F40/284

Abstract:
Exemplary embodiments relate to techniques to classify or detect the intent of content written in a language for which a classifier does not exist. These techniques involve building a code-switching corpus via machine translation, generating a universal embedding for words in the code-switching corpus, training a classifier on the universal embeddings to generate an embedding mapping/table; accessing new content written in a language for which a specific classifier may not exist, and mapping entries in the embedding mapping/table to the universal embeddings. Using these techniques, a classifier can be applied to the universal embedding without needing to be trained on a particular language. Exemplary embodiments may be applied to recognize similarities in two content items, make recommendations, find similar documents, perform deduplication, and perform topic tagging for stories in foreign languages.
Public/Granted literature
- US20190197119A1 LANGUAGE-AGNOSTIC UNDERSTANDING Public/Granted day:2019-06-27
Information query