-
公开(公告)号:US20230067435A1
公开(公告)日:2023-03-02
申请号:US17978421
申请日:2022-11-01
Inventor: Jaejong HEO , Kyungwon Lee , Hyoji Ha
IPC: G06F40/30 , G06F40/166 , G06F40/279 , G06F40/268
Abstract: A method of segmenting topics of content according to an embodiment is configured to preprocess text data configured of content, and divide a plurality of utterances into two topic segmented bodies based on the preprocessed data. The preprocessing may be performed by processing the text data in a continuous form of the plurality of utterances, and calculating a conceptual similarity between utterances based on the processed data. The topic segmented bodies may be divided into two by calculating similarity cohesion for the two topic segmented bodies based on the conceptual similarity and a consistency metric while changing a segmentation point which distinguishes the two topic segmented bodies, and determining the segmentation point based on the similarity cohesion.
-
公开(公告)号:US12073185B2
公开(公告)日:2024-08-27
申请号:US17978421
申请日:2022-11-01
Inventor: Jaejong Heo , Kyungwon Lee , Hyoji Ha
IPC: G06F40/30 , G06F40/268 , G06F40/279 , G06F40/166
CPC classification number: G06F40/30 , G06F40/166 , G06F40/268 , G06F40/279
Abstract: A method of segmenting topics of content according to an embodiment is configured to preprocess text data configured of content, and divide a plurality of utterances into two topic segmented bodies based on the preprocessed data. The preprocessing may be performed by processing the text data in a continuous form of the plurality of utterances, and calculating a conceptual similarity between utterances based on the processed data. The topic segmented bodies may be divided into two by calculating similarity cohesion for the two topic segmented bodies based on the conceptual similarity and a consistency metric while changing a segmentation point which distinguishes the two topic segmented bodies, and determining the segmentation point based on the similarity cohesion.
-