Cross-language models based on transfer learning
摘要:
A method for text classification involves generating, using a bilingual embedding model, source language embeddings for source language documents; obtaining source language document labels of the source language documents; and training a source language classifier model and a label embedding network, executing on a computing system, using the source language embeddings and the source language document labels. The method further involves generating pseudo-labels for unlabeled target language documents, by: generating, using the bilingual embedding model, target language embeddings for the unlabeled target language documents, and applying the source language classifier model and the label embedding network to the target language embeddings to obtain the pseudo-labels for the unlabeled target language documents. In addition, the method involves training a target language classifier model executing on the computing system using the target language embeddings and the pseudo labels.
信息查询
0/0