-
公开(公告)号:US20250086227A1
公开(公告)日:2025-03-13
申请号:US18519415
申请日:2023-11-27
Inventor: Jin Young MOON , Jonghee KIM , Muah SEOL
IPC: G06F16/732 , G06V10/82 , G06V20/40
Abstract: Provided is a system for detecting a video semantic interval. The system includes a communication module configured to receive a video and a query sentence, memory in which a program for outputting a semantic interval proposal from the video and the query sentence is stored, and a processor configured to execute the program stored in the memory. By executing the program, the processor outputs a semantic interval proposal having start timing and end timing, which is matched with the query sentence within the video, over a pre-trained video semantic interval detection network based on boundary refinements as the results of the detection of the semantic interval proposal, and outputs a semantic interval proposal having a variable boundary through the refinements of a predetermined semantic interval proposal.
-
公开(公告)号:US20210142063A1
公开(公告)日:2021-05-13
申请号:US17088185
申请日:2020-11-03
Inventor: Jin Young MOON , Yong Jin KWON , Hyung Il KIM , Jong Youl PARK , Kang Min BAE , Ki Min YUN
Abstract: An electronic device according to an embodiment disclosed herein may include a memory including at least one instruction and a processor. By executing the at least one instruction, the processor may check feature information corresponding to a video and including at least one of an appearance-related feature value and a motion-related feature value from the video, calculate at least one of a starting score related to a starting point of an action instance, an ending score related to an ending point of an action instance, and a relatedness score between action instances on the basis of the feature information corresponding to the video, the action instances being included in the video, and generate an action proposal included in the video on the basis of the at least one score.
-
公开(公告)号:US20230259741A1
公开(公告)日:2023-08-17
申请号:US18090211
申请日:2022-12-28
Inventor: Joong Won HWANG , Yong Jin KWON , Jin Young MOON , Yu Seok BAE , Sung Chan OH
Abstract: The present disclosure relates to a method and apparatus for constructing a network adaptable to consecutive/complex domains. An apparatus for constructing a domain adaptive network according to an embodiment of the present disclosure includes a memory configured to store data; and a processor configured to control the memory, wherein the processor is configured to determine a weight to be applied to one or more neural networks based on input data, construct a final neural network by applying the weight to the one or more neural networks, and output result data of the input data using the final neural network, wherein the one or more neural networks are trained using data for each prototype domain.
-
公开(公告)号:US20220067382A1
公开(公告)日:2022-03-03
申请号:US17411728
申请日:2021-08-25
Inventor: Jin Young MOON , Hyung Il KIM , Jong Youl PARK , Kang Min BAE , Ki Min YUN
IPC: G06K9/00
Abstract: Provided is an apparatus for online action detection, the apparatus including a feature extraction unit configured to extract a chunk-level feature of a video chunk sequence of a streaming video, a filtering unit configured to perform filtering on the chunk-level feature, and an action classification unit configured to classify an action class using the filtered chunk-level feature.
-
公开(公告)号:US20180285744A1
公开(公告)日:2018-10-04
申请号:US15945690
申请日:2018-04-04
Inventor: Kyu Chang KANG , Yongjin KWON , Jin Young MOON , Kyoung PARK , Jongyoul PARK , Yu Seok BAE , Sungchan OH , Jeun Woo LEE
Abstract: A system for generating a multimedia knowledge base uses a multimedia information detection unit to detect texted meta information from multimedia data including at least one combination of a text, a voice, an image and a video and allows a knowledge base shaping unit to use the texted meta information and context information of the multimedia data to divide the multimedia data into syntactic information representing extrinsic configuration information and semantic information representing intrinsic meaning information and may shape the syntactic information and the semantic information into the multimedia knowledge.
-
公开(公告)号:US20230083476A1
公开(公告)日:2023-03-16
申请号:US17881151
申请日:2022-08-04
Inventor: Jin Young MOON , Jung Kyoo SHIN
IPC: G06F16/732 , G06V20/40 , G06V10/82 , G06F40/279 , G06F40/30
Abstract: Provided is a method of detecting a semantics section in a video. The method includes extracting all video features by inputting an inputted video to a pre-trained first deep neural network algorithm, extracting a query sentence feature by inputting an inputted query sentence to a pre-trained second deep neural network algorithm, generating video-query relation integration feature information in which all of the video features and the query sentence feature have been integrated by inputting all of the video features and the query sentence feature to a plurality of scaled-dot product attention layers, and estimating a video segment corresponding to the query sentence in the video based on the video-query relation integration feature information.
-
公开(公告)号:US20200311389A1
公开(公告)日:2020-10-01
申请号:US16834500
申请日:2020-03-30
Inventor: Hyung Il KIM , Yong Jin KWON , Jin Young MOON , Jong Youl PARK , Sung Chan OH , Ki Min YUN , Jeun Woo LEE
Abstract: A domain adaptation-based object recognition apparatus includes a memory configured to store a domain adaptation-based object recognition program and a processor configured to execute the program. The processor learns a generative model for generating a feature or an image similar to a gallery image on the basis of domain adaptation in association with an input probe image and learns an object recognition classification model by using a learning database corresponding to the gallery image and the input probe image, thereby performing object recognition using the input probe image.
-
公开(公告)号:US20150324456A1
公开(公告)日:2015-11-12
申请号:US14602904
申请日:2015-01-22
Inventor: Young Rae KIM , Hyung Jik LEE , Jin Young MOON , Chang Seok BAE , Hyun Ki KIM
IPC: G06F17/30
CPC classification number: G06F17/30657 , G06F17/30011 , G06F17/30654 , G06F17/30696 , G06N5/00
Abstract: Provided is a question answering system with respect to a natural language question and a method thereof. The question answering system includes a candidate answer generating unit configured to extract a document mapped to an input natural language question, and generate candidate answers with respect to the natural language question from the extracted document, a text entailment recognizing unit configured to generate a text entailment recognition result representing a degree of association between multiple evidence sentences including the generated candidate answers and the natural language question, a list generating unit configured to generate a candidate answer list including the multiple evidence sentences in high association degree order on the basis of the text entailment recognition result, and an output unit configured to output the generated candidate answer list as a search result with respect to the natural language question.
Abstract translation: 提供了关于自然语言问题的问答系统及其方法。 所述问答系统包括:候选答案生成部,被配置为提取映射到输入自然语言问题的文档,并从所提取的文档生成关于自然语言问题的候选答案;文本携带识别单元,被配置为生成文本内容 识别结果表示包括所生成的候选答案和自然语言问题的多个证词之间的关联度;列表生成单元,被配置为基于文本含义生成包括高关联度顺序的多个证词的候选答案列表 识别结果,以及输出单元,被配置为输出所生成的候选答案列表作为关于自然语言问题的搜索结果。
-
公开(公告)号:US20230368499A1
公开(公告)日:2023-11-16
申请号:US18318159
申请日:2023-05-16
Inventor: Young Wan LEE , Jong Hee KIM , Jin Young MOON , Kang Min BAE , Yu Seok BAE , Je Seok HAM
CPC classification number: G06V10/7715 , G06V10/82 , G06V10/761 , G06V10/42
Abstract: The disclosure relates to a method of extracting image features based on a vision transformer, a method of performing embedding on an input image in units of patches and extracting visual features through global attention. An apparatus for extracting an image feature based on a vision transformer according to an embodiment of the disclosure includes a memory configured to store data and a processor configured to control the memory, wherein the processor is configured to perform embedding on multi-patches for an input image, extract feature maps for the embedding multi-patches, perform transformer encoding based on a neural network using the extracted feature maps, extract a feature of the input image through a final feature map extracted through the transformer encoding, and wherein the patches have different sizes.
-
公开(公告)号:US20230059462A1
公开(公告)日:2023-02-23
申请号:US17535486
申请日:2021-11-24
Inventor: Eun Woo KIM , Hyun Dong JIN , Ki Min YUN , Jin Young MOON
IPC: G06N3/04
Abstract: The present disclosure relates to a method and apparatus for performing multiple tasks based on task similarity by using artificial intelligence.
According to an embodiment of the present disclosure, a method for performing multi-task learning based on task similarity may include performing a similarity analysis between a first task and a second task and training a neural network for the second task based on a result of the similarity analysis. Herein, wherein in response to be determined that a first training dataset used for the first task and a second training dataset used for the second task are similar, the neural network may learn a second parameter allocated to the second training dataset based on a first parameter allocated to the first training dataset.
-
-
-
-
-
-
-
-
-