SENTENCE EMBEDDING METHOD AND APPARATUS BASED ON SUBWORD EMBEDDING AND SKIP-THOUGHTS

    公开(公告)号:US20200175119A1

    公开(公告)日:2020-06-04

    申请号:US16671773

    申请日:2019-11-01

    IPC分类号: G06F17/28 G06F17/27

    摘要: Provided are sentence embedding method and apparatus based on subword embedding and skip-thoughts. To integrate skip-thought sentence embedding learning methodology with a subword embedding technique, a skip-thought sentence embedding learning method based on subword embedding and methodology for simultaneously learning subword embedding learning and skip-thought sentence embedding learning, that is, multitask learning methodology, are provided as methodology for applying intra-sentence contextual information to subword embedding in the case of subword embedding learning. This makes it possible to apply a sentence embedding approach to agglutinative languages such as Korean in a bag-of-words form. Also, skip-thought sentence embedding learning methodology is integrated with a subword embedding technique such that intra-sentence contextual information can be used in the case of subword embedding learning. A proposed model minimizes additional training parameters based on sentence embedding such that most training results may be accumulated in a subword embedding parameter.