-
1.
公开(公告)号:US20240161019A1
公开(公告)日:2024-05-16
申请号:US18507705
申请日:2023-11-13
Inventor: Heuiseok LIM , Gyeongmin KIM
IPC: G06N20/20 , G06F16/215 , G06F16/901
CPC classification number: G06N20/20 , G06F16/215 , G06F16/9014
Abstract: Disclosed herein is a method of generating a similarity determination model of programming codes based on a cross-validation ensemble and filtering strategy. The method of generating the similarity determination model is performed by a computing device including at least a processor, the method includes: performing preprocessing on raw data written in any one language; performing filtering on the preprocessed data; generating positive pairs and negative pairs for training; and training a pre-trained language model using the generated positive pairs and negative pairs.
-
公开(公告)号:US20240289652A1
公开(公告)日:2024-08-29
申请号:US18585166
申请日:2024-02-23
Inventor: Heuiseok LIM , Sugyeong EO , Hyeonseok MOON , Jinsung KIM , Yoona HUR , Jeongwook KIM
IPC: G06N5/04
CPC classification number: G06N5/04
Abstract: Disclosed is a device and method for educational question-answer pair generation (QAG) considering type diversity. The method for question-answer pair generation is performed by a computing device and includes generating a query-focused summarization (QFS) for a passage; generating an initial answer based on the passage and the QFS; generating a question corresponding to the initial answer based on the initial answer, the passage, and an interrogative word; generating an answer corresponding to the question based on the question and the passage and generating a question-answer (QA) pair; and deriving a final QA pair by selecting at least one QA pair from among the QA pairs.
-
3.
公开(公告)号:US20240249120A1
公开(公告)日:2024-07-25
申请号:US18405051
申请日:2024-01-05
Inventor: Jinsung KIM , Heuiseok LIM
IPC: G06N3/0464
CPC classification number: G06N3/0464
Abstract: Disclosed is a device and method for dialogue relation extraction using utterance-level graph computation. The dialogue relation extraction method refers to a dialogue relation extraction method performed by a computing device including at least a processor and includes receiving a target conversation that includes a plurality of utterances and an argument pair that is a target of relation extraction; generating a graph (G=(A, X)) that includes an adjacency matrix (A) and a node feature matrix (X) based on the target conversation and the argument pair; and deriving a relation between subject and object included in the argument pair by inputting the graph to a graph convolutional network (GCN) trained to infer the relation of the argument pair.
-
公开(公告)号:US20220366894A1
公开(公告)日:2022-11-17
申请号:US17739383
申请日:2022-05-09
Inventor: Heuiseok LIM , Chanjun PARK
IPC: G10L15/06 , G10L13/00 , G06F40/166
Abstract: Disclosed is a training data construction method and a speech recognition method using the same. The training data construction method is performed by a computing apparatus including at least one processor and includes converting first text data including a plurality of sentences to first speech data; acquiring second speech data by adding noise to the first speech data; and converting the second speech data to second text data. The second text data includes a sentence corresponding to each of the plurality of sentences included in the first text data.
-
-
-