REFERENCE DRIVEN NLP-BASED TOPIC CATEGORIZATION

    公开(公告)号:US20240169147A1

    公开(公告)日:2024-05-23

    申请号:US18056463

    申请日:2022-11-17

    IPC分类号: G06F40/20 G06F16/951

    CPC分类号: G06F40/20 G06F16/951

    摘要: A computer-implemented method includes: obtaining a record including text narration; inputting the text narration into an NLP model; generating, by the NLP model, encoded narration based on the text narration; determining similarity index between the encoded narration and each of encoded reference topics; comparing the similarity index between the encoded narration and each encoded reference topic to similarity threshold; and determining whether the similarity index between the encoded narration and each encoded reference topic is equal to or greater than the similarity threshold. When the similarity index is equal to or greater than the similarity threshold, a corresponding reference topic is added to a result group, and, when the similarity index is less than the similarity threshold, the corresponding reference topic is left out of the result group. The record is classified based on the reference topic included in the result group and corresponding to the encoded reference topic having greatest similarity index.

    PREDICTING RECORD TOPIC USING TRANSITIVE RELATIONS

    公开(公告)号:US20240169216A1

    公开(公告)日:2024-05-23

    申请号:US18056456

    申请日:2022-11-17

    IPC分类号: G06N5/02

    CPC分类号: G06N5/022

    摘要: A method includes generating dataset using topics associated with historical records, the dataset including pairs of data that are formed based on the topics, each of the pairs of data including an antecedent topic associated with a historical record corresponding to a preceding event and a consequent topic associated with a historical record corresponding to an event that occurred after the preceding event, the antecedent topic and the consequent topic forming a transitive relation for each of the pairs of data; inputting, into ML model, the pairs of data and input topic associated with a record of a user; generating, by the ML model, a prediction of a next record topic for a next record corresponding to the user, based on the consequent topic included in each of the pairs of data that include the antecedent topic corresponding to the input topic; and outputting the prediction.