-
公开(公告)号:US11620582B2
公开(公告)日:2023-04-04
申请号:US16942247
申请日:2020-07-29
Applicant: International Business Machines Corporation
Inventor: Bei Chen , Long Vu , Syed Yousaf Shah , Xuan-Hong Dang , Peter Daniel Kirchner , Si Er Han , Ji Hui Yang , Jun Wang , Jing James Xu , Dakuo Wang , Dhavalkumar C. Patel , Gregory Bramble , Horst Cornelius Samulowitz , Saket Sathe , Chuang Gan
IPC: G06N20/20
Abstract: Techniques regarding one or more automated machine learning processes that analyze time series data are provided. For example, one or more embodiments described herein can comprise a system, which can comprise a memory that can store computer executable components. The system can also comprise a processor, operably coupled to the memory, and that can execute the computer executable components stored in the memory. The computer executable components can comprise a time series analysis component that selects a machine learning pipeline for meta transfer learning on time series data by sequentially allocating subsets of training data from the time series data amongst a plurality of machine learning pipeline candidates.
-
公开(公告)号:US20220392429A1
公开(公告)日:2022-12-08
申请号:US17337518
申请日:2021-06-03
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Kaizhi Qian , Yang Zhang , Shiyu Chang , Jinjun Xiong , Chuang Gan , David Cox
IPC: G10L13/10 , G06N20/00 , G10L21/013 , G10L17/04 , G10L25/63
Abstract: A computer-implemented method is provided of using a machine learning model for disentanglement of prosody in spoken natural language. The method includes encoding, by a computing device, the spoken natural language to produce content code. The method further includes resampling, by the computing device without text transcriptions, the content code to obscure the prosody by applying an unsupervised technique to the machine learning model to generate prosody-obscured content code. The method additionally includes decoding, by the computing device, the prosody-obscured content code to synthesize speech indirectly based upon the content code.
-
公开(公告)号:US20220374629A1
公开(公告)日:2022-11-24
申请号:US17315319
申请日:2021-05-09
Applicant: International Business Machines Corporation
Inventor: Bo Wu , Chuang Gan , Dakuo Wang , Kaizhi Qian
Abstract: A bi-directional spatial-temporal transformer neural network (BDSTT) is trained to predict original coordinates of a skeletal joint in a specific frame through relative relationships of the skeletal joint to other joints and to the state of the skeletal joint in other frames. Obtain a plurality of frames comprising coordinates of the skeletal joint and coordinates of other joints. Produce a spatially masked frame by masking the original coordinates of the skeletal joint. Provide the specific frame, the spatially masked frame, and at least one more frame to a coordinate prediction head of the BDSTT. Obtain, from the coordinate prediction head, a prediction of coordinates for the skeletal joint. Adjust parameters of the BDSTT until a mean-squared error, between the prediction of coordinates for the skeletal joint and the original coordinates of the skeletal joint, converges.
-
104.
公开(公告)号:US11488227B2
公开(公告)日:2022-11-01
申请号:US16936537
申请日:2020-07-23
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Qi Cheng Li , Lijun Mei , Hao Chen , Xin Zhou , Chuang Gan
IPC: G06Q30/00 , G06Q30/06 , G06Q10/00 , G06F16/9535 , G06F16/901 , G06Q10/06
Abstract: Techniques for topology based interoperability determination for an information technology (IT) infrastructure are described herein. An aspect includes receiving a replacement notification for an element of an IT infrastructure. A topology subgraph corresponding to the element is generated based on a topology graph of the IT infrastructure. A plurality of replacement candidates for the element is determined. A respective interoperability subgraph based on the topology subgraph is generated for each replacement candidate of the plurality of replacement candidates. A recommended replacement candidate is selected from the plurality of replacement candidates based on the generated interoperability subgraphs.
-
公开(公告)号:US11481425B2
公开(公告)日:2022-10-25
申请号:US17181397
申请日:2021-02-22
Applicant: International Business Machines Corporation
Inventor: Dakuo Wang , Yufang Hou , Xin Ru Wang , Yunfeng Zhang , Chuang Gan , Edward Sun
IPC: G06F16/43 , G06F16/438 , G06F16/34 , G06N20/00 , G06K9/62 , G06V30/416
Abstract: Systems and methods for creating presentation slides. A slide title is received and portions of source documents relevant to the title are identified based on a dense vector information retrieval machine learning process. An abstractive summary of the portions is generated based on a long form question answering machine learning process. A first presentation slide is created with the abstractive summary and the title. The first presentation slide is presented to an operator and an input indicating one of accepting or rejection the abstractive summary is received. Based on the input that indicating rejecting the abstractive summary, the abstractive summary is removed from the presentation slide and negative training feedback for the abstractive summary is provided to at least one of the dense vector information retrieval machine learning process or the long form question answering machine learning process.
-
公开(公告)号:US20220309278A1
公开(公告)日:2022-09-29
申请号:US17216605
申请日:2021-03-29
Applicant: International Business Machines Corporation
Inventor: Chuang Gan , Dakuo Wang , Antonio Jose Jimeno Yepes , Bo Wu
Abstract: Unsupervised learning for video classification. One or more features from one or more video clips are extracted using a spatial-temporal encoder. The one or more extracted features are processed, using a video instance discrimination task, to generate a classification label, the classification label indicating whether two of the video clips are from a same video. The one or more extracted features are processed, using a pair-wise speed discrimination task, to generate a comparison label, the comparison label indicating a relative playback speed between two given video clips. A search is performed in a video database for a video that is similar to a given video based on the comparison label.
-
公开(公告)号:US11442986B2
公开(公告)日:2022-09-13
申请号:US16792208
申请日:2020-02-15
Applicant: International Business Machines Corporation
Inventor: Chuang Gan , Sijia Liu , Subhro Das , Dakuo Wang , Yang Zhang
Abstract: Method and apparatus that includes receiving a query describing an aspect in a video, the video including a plurality of frames, identifying multiple proposals that potentially correspond to the query where each of the proposals includes a subset of the plurality of frames, ranking the proposals using a graph convolution network that identifies relationships between the proposals, and selecting, based on the ranking, one of the proposals as a video segment that correlates to the query.
-
公开(公告)号:US20220253714A1
公开(公告)日:2022-08-11
申请号:US17157077
申请日:2021-01-25
Inventor: Pin-Yu Chen , Chia-Yi Hsu , Songtao Lu , Sijia Liu , Chuang Gan , Chia-Mu Yu
Abstract: A trained machine learning model and a training dataset used to train the trained machine learning model can be received. Based on the training dataset, unsupervised adversarial examples can be generated. Robustness of the trained machine learning model can be determined using the generated unsupervised adversarial examples. The training dataset can be augmented with the generated unsupervised adversarial examples. The trained machine learning model can be retrained using the augmented training dataset.
-
公开(公告)号:US20220129679A1
公开(公告)日:2022-04-28
申请号:US17081239
申请日:2020-10-27
Applicant: International Business Machines Corporation
Inventor: Rameswar Panda , Chuang Gan , Pin-Yu Chen , Bo Wu
IPC: G06K9/00 , G06N20/00 , G06F16/783
Abstract: Machine learning-based techniques for summarizing collections of data such as image and video data leveraging side information obtained from related (e.g., video) data are provided. In one aspect, a method for video summarization includes: obtaining related videos having content related to a target video; and creating a summary of the target video using information provided by the target video and side information provided by the related videos to select portions of the target video to include in the summary. The side information can include video data, still image data, text, comments, natural language descriptions, and combinations thereof.
-
公开(公告)号:US11257222B2
公开(公告)日:2022-02-22
申请号:US16292847
申请日:2019-03-05
Applicant: International Business Machines Corporation
Inventor: Chuang Gan , Yang Zhang , Sijia Liu , Dakuo Wang
Abstract: Embodiments of the present invention are directed to a computer-implemented method for action localization. A non-limiting example of the computer-implemented method includes receiving, by a processor, a video and segmenting, by the processor, the video into a set of video segments. The computer-implemented method classifies, by the processor, each video segment into a class and calculates, by the processor, importance scores for each video segment of a class within the set of video segments. The computer-implemented method determines, by the processor, a winning video segment of the class within the set of video segments based on the importance scores for each video segment within the class, stores, by the processor, the winning video segment from the set of video segments, and removes the winning video segment from the set of video segments.
-
-
-
-
-
-
-
-
-