Patent search ap:("INTERNATIONAL BUSINESS MACHINES CORPORATION") AND inv:"Chuang Gan" Page 7

61.

发明申请
ITERATIVE APPROACH FOR WEAKLY-SUPERVISED ACTION LOCALIZATION 审中-公开

公开(公告)号：US20200286243A1

公开(公告)日：2020-09-10

申请号：US16292847

申请日：2019-03-05

Applicant: International Business Machines Corporation

Inventor： Chuang Gan , Yang Zhang , Sijia Liu , Dakuo Wang

IPC: G06T7/215 , G06K9/62 , G06N3/04 , G06K9/00 , G06N20/00 , G06F16/75

Abstract: Embodiments of the present invention are directed to a computer-implemented method for action localization. A non-limiting example of the computer-implemented method includes receiving, by a processor, a video and segmenting, by the processor, the video into a set of video segments. The computer-implemented method classifies, by the processor, each video segment into a class and calculates, by the processor, importance scores for each video segment of a class within the set of video segments. The computer-implemented method determines, by the processor, a winning video segment of the class within the set of video segments based on the importance scores for each video segment within the class, stores, by the processor, the winning video segment from the set of video segments, and removes the winning video segment from the set of video segments.

62.

发明申请
RELATION ATTENTION MODULE FOR TEMPORAL ACTION LOCALIZATION 审中-公开

公开(公告)号：US20200175281A1

公开(公告)日：2020-06-04

申请号：US16206683

申请日：2018-11-30

Applicant: International Business Machines Corporation

Inventor： Chuang Gan , Sijia Liu , Dakuo Wang , Yang Zhang

IPC: G06K9/00 , G06K9/32 , G06K9/62 , G06F17/16 , G06F17/11

Abstract: A method (and structure and computer product) of temporal action localization in video data includes receiving a stream of video data and determining all proposals in the video data stream, the proposals being candidate regions for temporal action in the video data stream. Values for a pair-wise relation function are calculated for relating the proposals, wherein the pair-wise relation function calculates a scalar value representing a pair-wise relation weight for pairs of the proposals.

63.

发明授权
Object-centric and relation-centric graph neural networks for physical property discovery 有权

公开(公告)号：US12249148B2

公开(公告)日：2025-03-11

申请号：US17656296

申请日：2022-03-24

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor： Zhenfang Chen , Chuang Gan , Bo Wu , Dakuo Wang

IPC: G06V20/40 , G06T7/20 , G06V10/62 , G06V10/82 , G06V20/50

Abstract: According to one embodiment, a method, computer system, and computer program product for identifying one or more intrinsic physical properties of one or more objects is provided. The present invention may include identifying one or more objects in a video set, extracting observable physical properties of the identified one or more objects from the video set, including one or more trajectories, and inferring, by a property-based graph neural network, intrinsic properties of the one or more objects based on the trajectories.

64.

发明申请
CIRCUIT DESIGN WITH ENSEMBLE-BASED LEARNING 有权

公开(公告)号：US20250068821A1

公开(公告)日：2025-02-27

申请号：US18455745

申请日：2023-08-25

Applicant: International Business Machines Corporation

Inventor： Shun Zhang , Xin Zhang , Shaoze Fan , Ningyuan Cao , Jing Li , Xiaoxiao Guo , Chuang Gan

IPC: G06F30/398 , G06N20/20

Abstract: Methods and systems for circuit generation include generating a circuit design. Paths are extracted from the circuit design, with the paths representing sequences of connected circuit components from one terminal of the circuit to another. The extracted paths are embedded as respective vectors in a latent space. A property of the circuit design is determined using an ensemble of trained surrogate models that accept a sequence of the vectors as input.

65.

发明授权
Interpretable visualization system for graph neural network 有权

公开(公告)号：US12182698B2

公开(公告)日：2024-12-31

申请号：US17039989

申请日：2020-09-30

Applicant: International Business Machines Corporation , Massachusetts Institute of Technology

Inventor： Dakuo Wang , Sijia Liu , Abel Valente , Chuang Gan , Bei Chen , Dongyu Liu , Yi Sun

IPC: G06N3/04 , G06N3/08

Abstract: Use a computerized trained graph neural network model to classify an input instance with a predicted label. With a computerized graph neural network interpretation module, compute a gradient-based saliency matrix based on the input instance and the predicted label, by taking a partial derivative of class prediction with respect to an adjacency matrix of the model. With a computerized user interface, obtain user input responsive to the gradient-based saliency matrix. Optionally, modify the trained graph neural network model based on the user input; and re-classify the input instance with a new predicted label based on the modified trained graph neural network model.

66.

发明授权
Neural-symbolic action transformers for video question answering 有权

公开(公告)号：US12175384B2

公开(公告)日：2024-12-24

申请号：US17381408

申请日：2021-07-21

Applicant: International Business Machines Corporation

Inventor： Bo Wu , Chuang Gan , Dakuo Wang , Zhenfang Chen

IPC: G06F17/00 , G06F40/205 , G06F40/284 , G06N5/02 , G06N5/04 , G06N20/20 , G06V20/40

Abstract: Mechanisms are provided for performing artificial intelligence-based video question answering. A video parser parses an input video data sequence to generate situation data structure(s), each situation data structure comprising data elements corresponding to entities, and first relationships between entities, identified by the video parser as present in images of the input video data sequence. First machine learning computer model(s) operate on the situation data structure(s) to predict second relationship(s) between the situation data structure(s). Second machine learning computer model(s) execute on a received input question to predict an executable program to execute to answer the received question. The program is executed on the situation data structure(s) and predicted second relationship(s). An answer to the question is output based on results of executing the program.

67.

发明申请
GENERATING A TEST DIFFUSION MODEL 有权

公开(公告)号：US20240420455A1

公开(公告)日：2024-12-19

申请号：US18451878

申请日：2023-08-18

Applicant: International Business Machines Corporation , National Tsing Hua University

Inventor： Pin-Yu Chen , I-Hsin Chung , Bo Wu , Chuang Gan , Tsung-Yi Ho , Sheng-Yen Chou

IPC: G06V10/774

Abstract: Techniques regarding generating a synthetic dataset of objects are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can include a defining component that can define a tractable forward process associated with a diffusion model, with defining the tractable forward process including inputting noise to compromise training data, resulting in compromised training data. The computer executable components can further include a training component that, using the compromised training data, trains the diffusion model to reverse process the tractable forward process, wherein the training results in a compromised diffusion model.

68.

发明公开
SELF-SUPERVISED SPEECH REPRESENTATIONS BY DISENTANGLING SPEAKERS 审中-公开

公开(公告)号：US20240170007A1

公开(公告)日：2024-05-23

申请号：US18053056

申请日：2022-11-07

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor： Kaizhi Qian , Yang Zhang , Chuang Gan , Dakuo Wang , Bo Wu

IPC: G10L25/30 , G10L21/0272

CPC classification number: G10L25/30 , G10L21/0272

Abstract: A method, computer system and computer program product is presented for providing a self-supervised speech representation. In one embodiment, audio input is received including speech utterances. A label sequence is generated from these speech utterances by a teacher label generator. A speech representation is generated of a partially masked version of the speech utterance using a speech representation network. The speech utterance is passed into two random transformations that alter only speaker information prior to the partial masking. A predictor will then predict the label sequence. In one embodiment performance-based assessment is made on a cross-entropy loss between the generated label sequence and a predicted label sequence.

69.

发明授权
Natural language interaction with automated machine learning systems 有权

公开(公告)号：US11989237B2

公开(公告)日：2024-05-21

申请号：US16551021

申请日：2019-08-26

Applicant: International Business Machines Corporation

Inventor： Dakuo Wang , Ming Tan , Chuang Gan , Haoyu Wang , Mo Yu

IPC: G06F16/9032 , G06N5/04

CPC classification number: G06F16/90332 , G06N5/04

Abstract: An artificial intelligence (AI) interaction method, system, and computer program product include selecting an artificial intelligence model to respond to a query to generating a response to the query using the selected artificial intelligence model, and receiving the response to the query from the selected artificial intelligence model.

70.

发明公开
MODULARIZED ATTENTIVE GRAPH NETWORKS FOR FINE-GRAINED REFERRING EXPRESSION COMPREHENSION 审中-公开

公开(公告)号：US20240111950A1

公开(公告)日：2024-04-04

申请号：US17936097

申请日：2022-09-28

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor： Zhenfang Chen , Chuang Gan , Bo Wu , Dakuo Wang

IPC: G06F40/205 , G06N3/04 , G06T7/12

CPC classification number: G06F40/205 , G06N3/0454 , G06T7/12

Abstract: A computer-implemented method for fine-grained referring expression comprehension is provided. The computer-implemented method includes receiving, at a processor, a textual expression and an image as inputs and executing, at the processor, fine-grained referring expression comprehension. The executing includes decomposing the textual expression into different textual modules, extracting visual regional proposals from the image, using language-guided graph neural networks to mine fine-grained object relations from the visual regional proposals and aggregating different matching similarities between the different textual modules and the fine-grained object relations.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification