Patent search ap:("INTERNATIONAL BUSINESS MACHINES CORPORATION") AND inv:"Xiaodong Cui" Page 1

1.

发明授权
Updating of statistical sets for decentralized distributed training of a machine learning model 有权

公开(公告)号：US11636280B2

公开(公告)日：2023-04-25

申请号：US17159710

申请日：2021-01-27

Applicant: International Business Machines Corporation

Inventor： Xiaodong Cui , Wei Zhang , Mingrui Liu , Abdullah Kayi , Youssef Mroueh , Alper Buyuktosunoglu

IPC: G06K9/62 , G06F15/173 , G06N20/00 , G06N3/08

Abstract: Systems, computer-implemented methods, and computer program products to facilitate updating, such as averaging and/or training, of one or more statistical sets are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can include a computing component that averages a statistical set, provided by the system, with an additional statistical set, that is compatible with the statistical set, to compute an averaged statistical set, where the additional statistical set is obtained from a selected additional system of a plurality of additional systems. The computer executable components also can include a selecting component that selects the selected additional system according to a randomization pattern.

2.

发明申请
TRANSFORMER-BASED ENCODING INCORPORATING METADATA 有权

公开(公告)号：US20220358288A1

公开(公告)日：2022-11-10

申请号：US17308575

申请日：2021-05-05

Applicant: International Business Machines Corporation

Inventor： Hui Wan , Xiaodong Cui , Luis A. Lastras-Montano

IPC: G06F40/284 , G06F40/205 , G06F40/237 , G06F40/30 , G06F40/42 , G06K9/66

Abstract: From metadata of a corpus of natural language text documents, a relativity matrix is constructed, a row-column intersection in the relativity matrix corresponding to a relationship between two instances of a type of metadata. An encoder model is trained, generating a trained encoder model, to compute an embedding corresponding to a token of a natural language text document within the corpus and the relativity matrix, the encoder model comprising a first encoder layer, the first encoder layer comprising a token embedding portion, a relativity embedding portion, a token self-attention portion, a metadata self-attention portion, and a fusion portion, the training comprising adjusting a set of parameters of the encoder model.

3.

发明申请
DYNAMIC NETWORK BANDWIDTH IN DISTRIBUTED DEEP LEARNING TRAINING 有权

公开(公告)号：US20220012642A1

公开(公告)日：2022-01-13

申请号：US16925192

申请日：2020-07-09

Applicant: International Business Machines Corporation

Inventor： Wei Zhang , Xiaodong Cui , Abdullah Kayi , Alper Buyuktosunoglu

IPC: G06N20/20 , H04L12/24 , G06N3/02

Abstract: Embodiments of a method are disclosed. The method includes performing distributed deep learning training on a batch of training data. The method also includes determining training times representing an amount of time between a beginning batch time and an end batch time. Further, the method includes modifying a communication aspect of the communication straggler to reduce a future network communication time for the communication straggler to send a future result of the distributed deep learning training on a new batch of training data in response to the centralized parameter server determining that the learner is the communication straggler.

4.

发明申请
ADJUSTING A DEEP NEURAL NETWORK ACOUSTIC MODEL 审中-公开

公开(公告)号：US20180068655A1

公开(公告)日：2018-03-08

申请号：US15258836

申请日：2016-09-07

Applicant: International Business Machines Corporation

Inventor： Xiaodong Cui , Vaibhava Goel

IPC: G10L15/16 , G10L15/07 , G10L15/06

CPC classification number: G10L15/16 , G10L15/063 , G10L15/07 , G10L15/075 , G10L17/18

Abstract: A computer-implemented method according to one embodiment includes estimating a speaker dependent acoustic model utilizing test speech data and a hybrid estimation technique, transforming labeled speech data to create transformed speech data, utilizing the speaker dependent acoustic model and a nonlinear transformation, and adjusting a deep neural network (DNN) acoustic model, utilizing the transformed speech data.

5.

发明申请
CONTEXT-AWARE RELEVANCE MODELING IN CONVERSATIONAL SYSTEMS 有权

公开(公告)号：US20250068635A1

公开(公告)日：2025-02-27

申请号：US18453127

申请日：2023-08-21

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor： Hui Wan , Xiaodong Cui , Songtao Lu , Marina Danilevsky Hailpern

IPC: G06F16/2457

Abstract: A method, computer system, and a computer program product are provided for a context-aware relevancy modelling in conversational systems. A user query is received. A latent static content d is selected from a corpus of content D. A latent set of context C from a set of external context Cu is also selected. A result is generated using a scoring function and using the latent static content d from a corpus D and the latent set of context C from the set of external contexts CU so as to provide a most relevant context-base search response to said user query q. The result provides a most relevant context-base search response to said user query q. A response is then generated based on said result using said scoring function result to said user query q.

6.

发明公开
LENGTH PERTURBATION TECHNIQUES FOR IMPROVING GENERALIZATION OF DEEP NEURAL NETWORK ACOUSTIC MODELS 审中-公开

公开(公告)号：US20240170005A1

公开(公告)日：2024-05-23

申请号：US18057967

申请日：2022-11-22

Applicant: International Business Machines Corporation

Inventor： Xiaodong Cui , Brian E. D. Kingsbury , George Andrei Saon

IPC: G10L21/04

CPC classification number: G10L21/04

Abstract: One or more systems, devices, computer program products and/or computer-implemented methods of use provided herein relate to length perturbation techniques for improving generalization of DNN acoustic models. A computer-implemented system can comprise a memory that can store computer executable components. The computer-implemented system can further comprise a processor that can execute the computer executable components stored in the memory, wherein the computer executable components can comprise a frame skipping component that can remove one or more frames from an acoustic utterance via frame skipping. The computer executable components can further comprise a frame insertion component that can insert one or more replacement frames into the acoustic utterance via frame insertion to replace the one or more frames with the one or more replacement frames to enable length perturbation of the acoustic utterance.

7.

发明公开
Bilevel Optimization Based Decentralized Framework for Personalized Client Learning 审中-公开

公开(公告)号：US20240095515A1

公开(公告)日：2024-03-21

申请号：US17943839

申请日：2022-09-13

Applicant: International Business Machines Corporation

Inventor： Songtao Lu , Xiaodong Cui , Mark S. Squillante , Brian E.D. Kingsbury , Lior Horesh

IPC: G06N3/08

CPC classification number: G06N3/08

Abstract: Decentralized bilevel optimization techniques for personalized learning over a heterogenous network are provided. In one aspect, a decentralized learning system includes: a distributed machine learning network with multiple nodes, and datasets associated with the nodes; and a bilevel learning structure at each of the nodes for optimizing one or more features from each of the datasets using a decentralized bilevel optimization solver, while maintaining distinct features from each of the datasets. A method for decentralized learning is also provided.

8.

发明授权
Transformer-based encoding incorporating metadata 有权

公开(公告)号：US11893346B2

公开(公告)日：2024-02-06

申请号：US17308575

申请日：2021-05-05

Applicant: International Business Machines Corporation

Inventor： Hui Wan , Xiaodong Cui , Luis A. Lastras-Montano

IPC: G06F40/284 , G06F40/205 , G06F40/30 , G06F40/42 , G06F40/237 , G06V30/194

CPC classification number: G06F40/284 , G06F40/205 , G06F40/237 , G06F40/30 , G06F40/42 , G06V30/194

Abstract: From metadata of a corpus of natural language text documents, a relativity matrix is constructed, a row-column intersection in the relativity matrix corresponding to a relationship between two instances of a type of metadata. An encoder model is trained, generating a trained encoder model, to compute an embedding corresponding to a token of a natural language text document within the corpus and the relativity matrix, the encoder model comprising a first encoder layer, the first encoder layer comprising a token embedding portion, a relativity embedding portion, a token self-attention portion, a metadata self-attention portion, and a fusion portion, the training comprising adjusting a set of parameters of the encoder model.

9.

发明授权
Deep learning to correct map and image features 有权

公开(公告)号：US11557053B2

公开(公告)日：2023-01-17

申请号：US16785469

申请日：2020-02-07

Applicant: International Business Machines Corporation

Inventor： Rui Zhang , Conrad M. Albrecht , Siyuan Lu , Wei Zhang , Ulrich Alfons Finkler , David S. Kung , Xiaodong Cui , Marcus Freitag

IPC: G06T7/55 , G06T7/33 , G06N3/08 , G06N20/20 , G06N3/04

Abstract: Techniques for image processing and transformation are provided. A plurality of images and a plurality of maps are received, and a system of neural networks is trained based on the plurality of images and the plurality of maps. A first image is received, and a first map is generated by processing the first image using the system of neural networks.

10.

发明申请
EXPLAINING OUTLIERS IN TIME SERIES AND EVALUATING ANOMALY DETECTION METHODS 有权

公开(公告)号：US20220253426A1

公开(公告)日：2022-08-11

申请号：US17170164

申请日：2021-02-08

Applicant: International Business Machines Corporation , The Board of Trustees of the University of Illinois

Inventor： Yada Zhu , Jinjun Xiong , Jingrui He , Lecheng Zheng , Xiaodong Cui

IPC: G06F16/23 , G06N3/04 , G06N3/08

Abstract: Time series data can be received. A machine learning model can be trained using the time series data. A contaminating process can be estimated based on the time series data, the contaminating process including outliers associated with the time series data. A parameter associated with the contaminating process can be determined. Based on the trained machine learning model and the parameter associated with the contaminating process, a single-valued metric can be determined, which represents an impact of the contaminating process on the machine learning model's future prediction. A plurality of different outlier detecting machine learning models can be used to estimate the contaminating process and the single-valued metric can be determined for each of the plurality of different outlier detecting machine learning models. The plurality of different outlier detecting machine learning models can be ranked according to the associated single-valued metric.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification