-
公开(公告)号:US20200286243A1
公开(公告)日:2020-09-10
申请号:US16292847
申请日:2019-03-05
Applicant: International Business Machines Corporation
Inventor: Chuang Gan , Yang Zhang , Sijia Liu , Dakuo Wang
Abstract: Embodiments of the present invention are directed to a computer-implemented method for action localization. A non-limiting example of the computer-implemented method includes receiving, by a processor, a video and segmenting, by the processor, the video into a set of video segments. The computer-implemented method classifies, by the processor, each video segment into a class and calculates, by the processor, importance scores for each video segment of a class within the set of video segments. The computer-implemented method determines, by the processor, a winning video segment of the class within the set of video segments based on the importance scores for each video segment within the class, stores, by the processor, the winning video segment from the set of video segments, and removes the winning video segment from the set of video segments.
-
公开(公告)号:US20200175281A1
公开(公告)日:2020-06-04
申请号:US16206683
申请日:2018-11-30
Applicant: International Business Machines Corporation
Inventor: Chuang Gan , Sijia Liu , Dakuo Wang , Yang Zhang
Abstract: A method (and structure and computer product) of temporal action localization in video data includes receiving a stream of video data and determining all proposals in the video data stream, the proposals being candidate regions for temporal action in the video data stream. Values for a pair-wise relation function are calculated for relating the proposals, wherein the pair-wise relation function calculates a scalar value representing a pair-wise relation weight for pairs of the proposals.
-
63.
公开(公告)号:US12249148B2
公开(公告)日:2025-03-11
申请号:US17656296
申请日:2022-03-24
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Zhenfang Chen , Chuang Gan , Bo Wu , Dakuo Wang
Abstract: According to one embodiment, a method, computer system, and computer program product for identifying one or more intrinsic physical properties of one or more objects is provided. The present invention may include identifying one or more objects in a video set, extracting observable physical properties of the identified one or more objects from the video set, including one or more trajectories, and inferring, by a property-based graph neural network, intrinsic properties of the one or more objects based on the trajectories.
-
公开(公告)号:US20250068821A1
公开(公告)日:2025-02-27
申请号:US18455745
申请日:2023-08-25
Applicant: International Business Machines Corporation
Inventor: Shun Zhang , Xin Zhang , Shaoze Fan , Ningyuan Cao , Jing Li , Xiaoxiao Guo , Chuang Gan
IPC: G06F30/398 , G06N20/20
Abstract: Methods and systems for circuit generation include generating a circuit design. Paths are extracted from the circuit design, with the paths representing sequences of connected circuit components from one terminal of the circuit to another. The extracted paths are embedded as respective vectors in a latent space. A property of the circuit design is determined using an ensemble of trained surrogate models that accept a sequence of the vectors as input.
-
公开(公告)号:US12182698B2
公开(公告)日:2024-12-31
申请号:US17039989
申请日:2020-09-30
Inventor: Dakuo Wang , Sijia Liu , Abel Valente , Chuang Gan , Bei Chen , Dongyu Liu , Yi Sun
Abstract: Use a computerized trained graph neural network model to classify an input instance with a predicted label. With a computerized graph neural network interpretation module, compute a gradient-based saliency matrix based on the input instance and the predicted label, by taking a partial derivative of class prediction with respect to an adjacency matrix of the model. With a computerized user interface, obtain user input responsive to the gradient-based saliency matrix. Optionally, modify the trained graph neural network model based on the user input; and re-classify the input instance with a new predicted label based on the modified trained graph neural network model.
-
公开(公告)号:US12175384B2
公开(公告)日:2024-12-24
申请号:US17381408
申请日:2021-07-21
Applicant: International Business Machines Corporation
Inventor: Bo Wu , Chuang Gan , Dakuo Wang , Zhenfang Chen
IPC: G06F17/00 , G06F40/205 , G06F40/284 , G06N5/02 , G06N5/04 , G06N20/20 , G06V20/40
Abstract: Mechanisms are provided for performing artificial intelligence-based video question answering. A video parser parses an input video data sequence to generate situation data structure(s), each situation data structure comprising data elements corresponding to entities, and first relationships between entities, identified by the video parser as present in images of the input video data sequence. First machine learning computer model(s) operate on the situation data structure(s) to predict second relationship(s) between the situation data structure(s). Second machine learning computer model(s) execute on a received input question to predict an executable program to execute to answer the received question. The program is executed on the situation data structure(s) and predicted second relationship(s). An answer to the question is output based on results of executing the program.
-
公开(公告)号:US20240420455A1
公开(公告)日:2024-12-19
申请号:US18451878
申请日:2023-08-18
Inventor: Pin-Yu Chen , I-Hsin Chung , Bo Wu , Chuang Gan , Tsung-Yi Ho , Sheng-Yen Chou
IPC: G06V10/774
Abstract: Techniques regarding generating a synthetic dataset of objects are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can include a defining component that can define a tractable forward process associated with a diffusion model, with defining the tractable forward process including inputting noise to compromise training data, resulting in compromised training data. The computer executable components can further include a training component that, using the compromised training data, trains the diffusion model to reverse process the tractable forward process, wherein the training results in a compromised diffusion model.
-
公开(公告)号:US20240170007A1
公开(公告)日:2024-05-23
申请号:US18053056
申请日:2022-11-07
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Kaizhi Qian , Yang Zhang , Chuang Gan , Dakuo Wang , Bo Wu
IPC: G10L25/30 , G10L21/0272
CPC classification number: G10L25/30 , G10L21/0272
Abstract: A method, computer system and computer program product is presented for providing a self-supervised speech representation. In one embodiment, audio input is received including speech utterances. A label sequence is generated from these speech utterances by a teacher label generator. A speech representation is generated of a partially masked version of the speech utterance using a speech representation network. The speech utterance is passed into two random transformations that alter only speaker information prior to the partial masking. A predictor will then predict the label sequence. In one embodiment performance-based assessment is made on a cross-entropy loss between the generated label sequence and a predicted label sequence.
-
公开(公告)号:US11989237B2
公开(公告)日:2024-05-21
申请号:US16551021
申请日:2019-08-26
Applicant: International Business Machines Corporation
Inventor: Dakuo Wang , Ming Tan , Chuang Gan , Haoyu Wang , Mo Yu
IPC: G06F16/9032 , G06N5/04
CPC classification number: G06F16/90332 , G06N5/04
Abstract: An artificial intelligence (AI) interaction method, system, and computer program product include selecting an artificial intelligence model to respond to a query to generating a response to the query using the selected artificial intelligence model, and receiving the response to the query from the selected artificial intelligence model.
-
70.
公开(公告)号:US20240111950A1
公开(公告)日:2024-04-04
申请号:US17936097
申请日:2022-09-28
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Zhenfang Chen , Chuang Gan , Bo Wu , Dakuo Wang
IPC: G06F40/205 , G06N3/04 , G06T7/12
CPC classification number: G06F40/205 , G06N3/0454 , G06T7/12
Abstract: A computer-implemented method for fine-grained referring expression comprehension is provided. The computer-implemented method includes receiving, at a processor, a textual expression and an image as inputs and executing, at the processor, fine-grained referring expression comprehension. The executing includes decomposing the textual expression into different textual modules, extracting visual regional proposals from the image, using language-guided graph neural networks to mine fine-grained object relations from the visual regional proposals and aggregating different matching similarities between the different textual modules and the fine-grained object relations.
-
-
-
-
-
-
-
-
-