METHOD AND APPARATUS FOR SEQUENCE LABELING ON ENTITY TEXT, AND NON-TRANSITORY COMPUTER-READABLE RECORDING MEDIUM

    公开(公告)号:US20220164536A1

    公开(公告)日:2022-05-26

    申请号:US17455967

    申请日:2021-11-22

    IPC分类号: G06F40/295

    摘要: A method and an apparatus for sequence labeling on an entity text, and a non-transitory computer-readable recording medium are provided. In the method, a start position of an entity text within a target text is determined. Then, a first matrix is generated based on the start position of the entity text. Elements in the first matrix indicates focusable weights of each word with respect to other words in the target text. Then, a named entity recognition model is generated using the first matrix. The named entity recognition model is obtained by training using first training data, the first training data includes word embeddings corresponding to respective texts in a training text set, and the texts are texts whose entity label has been labeled. Then, the target text is input to the named entity recognition model, and probability distribution of the entity label is output.