专利检索 ap:("International Business Machines Corporation" OR "The Board of Trustees of the University of Illinois") AND inv:"Xiaodong Cui" 第 1 页

1.

发明申请
EXPLAINING OUTLIERS IN TIME SERIES AND EVALUATING ANOMALY DETECTION METHODS 有权

公开(公告)号：US20220253426A1

公开(公告)日：2022-08-11

申请号：US17170164

申请日：2021-02-08

申请人： International Business Machines Corporation , The Board of Trustees of the University of Illinois

发明人： Yada Zhu , Jinjun Xiong , Jingrui He , Lecheng Zheng , Xiaodong Cui

IPC分类号： G06F16/23 , G06N3/04 , G06N3/08

摘要： Time series data can be received. A machine learning model can be trained using the time series data. A contaminating process can be estimated based on the time series data, the contaminating process including outliers associated with the time series data. A parameter associated with the contaminating process can be determined. Based on the trained machine learning model and the parameter associated with the contaminating process, a single-valued metric can be determined, which represents an impact of the contaminating process on the machine learning model's future prediction. A plurality of different outlier detecting machine learning models can be used to estimate the contaminating process and the single-valued metric can be determined for each of the plurality of different outlier detecting machine learning models. The plurality of different outlier detecting machine learning models can be ranked according to the associated single-valued metric.

2.

发明公开
LENGTH PERTURBATION TECHNIQUES FOR IMPROVING GENERALIZATION OF DEEP NEURAL NETWORK ACOUSTIC MODELS 审中-公开

公开(公告)号：US20240170005A1

公开(公告)日：2024-05-23

申请号：US18057967

申请日：2022-11-22

申请人： International Business Machines Corporation

发明人： Xiaodong Cui , Brian E. D. Kingsbury , George Andrei Saon

IPC分类号： G10L21/04

CPC分类号： G10L21/04

摘要： One or more systems, devices, computer program products and/or computer-implemented methods of use provided herein relate to length perturbation techniques for improving generalization of DNN acoustic models. A computer-implemented system can comprise a memory that can store computer executable components. The computer-implemented system can further comprise a processor that can execute the computer executable components stored in the memory, wherein the computer executable components can comprise a frame skipping component that can remove one or more frames from an acoustic utterance via frame skipping. The computer executable components can further comprise a frame insertion component that can insert one or more replacement frames into the acoustic utterance via frame insertion to replace the one or more frames with the one or more replacement frames to enable length perturbation of the acoustic utterance.

3.

发明公开
Bilevel Optimization Based Decentralized Framework for Personalized Client Learning 审中-公开

公开(公告)号：US20240095515A1

公开(公告)日：2024-03-21

申请号：US17943839

申请日：2022-09-13

申请人： International Business Machines Corporation

发明人： Songtao Lu , Xiaodong Cui , Mark S. Squillante , Brian E.D. Kingsbury , Lior Horesh

IPC分类号： G06N3/08

CPC分类号： G06N3/08

摘要： Decentralized bilevel optimization techniques for personalized learning over a heterogenous network are provided. In one aspect, a decentralized learning system includes: a distributed machine learning network with multiple nodes, and datasets associated with the nodes; and a bilevel learning structure at each of the nodes for optimizing one or more features from each of the datasets using a decentralized bilevel optimization solver, while maintaining distinct features from each of the datasets. A method for decentralized learning is also provided.

4.

发明授权
Transformer-based encoding incorporating metadata 有权

公开(公告)号：US11893346B2

公开(公告)日：2024-02-06

申请号：US17308575

申请日：2021-05-05

申请人： International Business Machines Corporation

发明人： Hui Wan , Xiaodong Cui , Luis A. Lastras-Montano

IPC分类号： G06F40/284 , G06F40/205 , G06F40/30 , G06F40/42 , G06F40/237 , G06V30/194

CPC分类号： G06F40/284 , G06F40/205 , G06F40/237 , G06F40/30 , G06F40/42 , G06V30/194

摘要： From metadata of a corpus of natural language text documents, a relativity matrix is constructed, a row-column intersection in the relativity matrix corresponding to a relationship between two instances of a type of metadata. An encoder model is trained, generating a trained encoder model, to compute an embedding corresponding to a token of a natural language text document within the corpus and the relativity matrix, the encoder model comprising a first encoder layer, the first encoder layer comprising a token embedding portion, a relativity embedding portion, a token self-attention portion, a metadata self-attention portion, and a fusion portion, the training comprising adjusting a set of parameters of the encoder model.

5.

发明授权
Deep learning to correct map and image features 有权

公开(公告)号：US11557053B2

公开(公告)日：2023-01-17

申请号：US16785469

申请日：2020-02-07

申请人： International Business Machines Corporation

发明人： Rui Zhang , Conrad M. Albrecht , Siyuan Lu , Wei Zhang , Ulrich Alfons Finkler , David S. Kung , Xiaodong Cui , Marcus Freitag

IPC分类号： G06T7/55 , G06T7/33 , G06N3/08 , G06N20/20 , G06N3/04

摘要： Techniques for image processing and transformation are provided. A plurality of images and a plurality of maps are received, and a system of neural networks is trained based on the plurality of images and the plurality of maps. A first image is received, and a first map is generated by processing the first image using the system of neural networks.

6.

发明申请
DYNAMIC COMPUTATION IN DECENTRALIZED DISTRIBUTED DEEP LEARNING TRAINING 有权

公开(公告)号：US20220012584A1

公开(公告)日：2022-01-13

申请号：US16925178

申请日：2020-07-09

申请人： International Business Machines Corporation

发明人： Wei Zhang , Xiaodong Cui , Abdullah Kayi , Alper Buyuktosunoglu

IPC分类号： G06N3/08 , G06N3/04

摘要： Embodiments of a method are disclosed. The method includes performing decentralized distributed deep learning training on a batch of training data. Additionally, the method includes determining a training time wherein the learner performs the decentralized distributed deep learning training on the batch of training data. Further, the method includes generating a table having the training time and other processing times for corresponding other learners performing the decentralized distributed deep learning training on corresponding other batches of other training data. The method also includes determining that the learner is a straggler based on the table and a threshold for the training time. Additionally, the method includes modifying a processing aspect of the straggler to reduce a future training time of the straggler for performing the decentralized distributed deep learning training on a new batch of training data in response to determining the learner is the straggler.

7.

发明授权
Updating of statistical sets for decentralized distributed training of a machine learning model 有权

公开(公告)号：US11636280B2

公开(公告)日：2023-04-25

申请号：US17159710

申请日：2021-01-27

申请人： International Business Machines Corporation

发明人： Xiaodong Cui , Wei Zhang , Mingrui Liu , Abdullah Kayi , Youssef Mroueh , Alper Buyuktosunoglu

IPC分类号： G06K9/62 , G06F15/173 , G06N20/00 , G06N3/08

摘要： Systems, computer-implemented methods, and computer program products to facilitate updating, such as averaging and/or training, of one or more statistical sets are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can include a computing component that averages a statistical set, provided by the system, with an additional statistical set, that is compatible with the statistical set, to compute an averaged statistical set, where the additional statistical set is obtained from a selected additional system of a plurality of additional systems. The computer executable components also can include a selecting component that selects the selected additional system according to a randomization pattern.

8.

发明申请
TRANSFORMER-BASED ENCODING INCORPORATING METADATA 有权

公开(公告)号：US20220358288A1

公开(公告)日：2022-11-10

申请号：US17308575

申请日：2021-05-05

申请人： International Business Machines Corporation

发明人： Hui Wan , Xiaodong Cui , Luis A. Lastras-Montano

IPC分类号： G06F40/284 , G06F40/205 , G06F40/237 , G06F40/30 , G06F40/42 , G06K9/66

摘要： From metadata of a corpus of natural language text documents, a relativity matrix is constructed, a row-column intersection in the relativity matrix corresponding to a relationship between two instances of a type of metadata. An encoder model is trained, generating a trained encoder model, to compute an embedding corresponding to a token of a natural language text document within the corpus and the relativity matrix, the encoder model comprising a first encoder layer, the first encoder layer comprising a token embedding portion, a relativity embedding portion, a token self-attention portion, a metadata self-attention portion, and a fusion portion, the training comprising adjusting a set of parameters of the encoder model.

9.

发明申请
DYNAMIC NETWORK BANDWIDTH IN DISTRIBUTED DEEP LEARNING TRAINING 有权

公开(公告)号：US20220012642A1

公开(公告)日：2022-01-13

申请号：US16925192

申请日：2020-07-09

申请人： International Business Machines Corporation

发明人： Wei Zhang , Xiaodong Cui , Abdullah Kayi , Alper Buyuktosunoglu

IPC分类号： G06N20/20 , H04L12/24 , G06N3/02

摘要： Embodiments of a method are disclosed. The method includes performing distributed deep learning training on a batch of training data. The method also includes determining training times representing an amount of time between a beginning batch time and an end batch time. Further, the method includes modifying a communication aspect of the communication straggler to reduce a future network communication time for the communication straggler to send a future result of the distributed deep learning training on a new batch of training data in response to the centralized parameter server determining that the learner is the communication straggler.

10.

发明申请
ADJUSTING A DEEP NEURAL NETWORK ACOUSTIC MODEL 审中-公开

公开(公告)号：US20180068655A1

公开(公告)日：2018-03-08

申请号：US15258836

申请日：2016-09-07

申请人： International Business Machines Corporation

发明人： Xiaodong Cui , Vaibhava Goel

IPC分类号： G10L15/16 , G10L15/07 , G10L15/06

CPC分类号： G10L15/16 , G10L15/063 , G10L15/07 , G10L15/075 , G10L17/18

摘要： A computer-implemented method according to one embodiment includes estimating a speaker dependent acoustic model utilizing test speech data and a hybrid estimation technique, transforming labeled speech data to create transformed speech data, utilizing the speaker dependent acoustic model and a nonlinear transformation, and adjusting a deep neural network (DNN) acoustic model, utilizing the transformed speech data.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类