Methods and apparatus for creating domain-specific intended-meaning natural language processing pipelines

    公开(公告)号:US11681878B2

    公开(公告)日:2023-06-20

    申请号:US17982760

    申请日:2022-11-08

    IPC分类号: G06F17/00 G06F40/30

    CPC分类号: G06F40/30

    摘要: A method includes receiving a dataset that includes a plurality of input texts. Each input text from the plurality of texts is associated with a content category from a plurality of content categories based on a comparison between that input text and an intended meaning that is common for each comparison. For each model in a plurality of models, and for each content category from the plurality of content categories, that model is executed on each input text from the plurality of input texts to generate an average similarity/dissimilarity score for that content category. At least one model from the plurality of models is selected, based on the average similarity score for each content category from the plurality of content categories for each model in the plurality of models, to determine whether an input text is similar/dissimilar to the intended meaning.