DOMAIN-SPECIFIC TEXT LABELLING USING NATURAL LANGUAGE INFERENCE MODEL

    公开(公告)号:EP4372606A1

    公开(公告)日:2024-05-22

    申请号:EP23196795.1

    申请日:2023-09-12

    申请人: Fujitsu Limited

    IPC分类号: G06F40/30 G06F40/169 G06N3/08

    CPC分类号: G06N3/08 G06F40/30 G06F40/169

    摘要: In an embodiment, a set of texts associated with a domain is received. A set of hypothesis statements associated with the domain is received. A pre-trained natural language inference (NLI) model is applied on each of the received set of texts and on each of the received set of hypothesis statements. A second text corpus associated with the domain is generated. The generated second text corpus corresponds to a set of labels associated with the domain. A few-shot learning model is applied on the generated second text corpus to generate a third text corpus associated with the domain. The generated third text corpus is configured to fine-tune the applied pre-trained NLI model, and the fine-tuned NLI model is configured to label an input text associated with the domain. A display of the labelled input text on a display device is controlled.