-
1.
公开(公告)号:US11636280B2
公开(公告)日:2023-04-25
申请号:US17159710
申请日:2021-01-27
Applicant: International Business Machines Corporation
Inventor: Xiaodong Cui , Wei Zhang , Mingrui Liu , Abdullah Kayi , Youssef Mroueh , Alper Buyuktosunoglu
IPC: G06K9/62 , G06F15/173 , G06N20/00 , G06N3/08
Abstract: Systems, computer-implemented methods, and computer program products to facilitate updating, such as averaging and/or training, of one or more statistical sets are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can include a computing component that averages a statistical set, provided by the system, with an additional statistical set, that is compatible with the statistical set, to compute an averaged statistical set, where the additional statistical set is obtained from a selected additional system of a plurality of additional systems. The computer executable components also can include a selecting component that selects the selected additional system according to a randomization pattern.
-
公开(公告)号:US20220358288A1
公开(公告)日:2022-11-10
申请号:US17308575
申请日:2021-05-05
Applicant: International Business Machines Corporation
Inventor: Hui Wan , Xiaodong Cui , Luis A. Lastras-Montano
IPC: G06F40/284 , G06F40/205 , G06F40/237 , G06F40/30 , G06F40/42 , G06K9/66
Abstract: From metadata of a corpus of natural language text documents, a relativity matrix is constructed, a row-column intersection in the relativity matrix corresponding to a relationship between two instances of a type of metadata. An encoder model is trained, generating a trained encoder model, to compute an embedding corresponding to a token of a natural language text document within the corpus and the relativity matrix, the encoder model comprising a first encoder layer, the first encoder layer comprising a token embedding portion, a relativity embedding portion, a token self-attention portion, a metadata self-attention portion, and a fusion portion, the training comprising adjusting a set of parameters of the encoder model.
-
公开(公告)号:US20220012642A1
公开(公告)日:2022-01-13
申请号:US16925192
申请日:2020-07-09
Applicant: International Business Machines Corporation
Inventor: Wei Zhang , Xiaodong Cui , Abdullah Kayi , Alper Buyuktosunoglu
Abstract: Embodiments of a method are disclosed. The method includes performing distributed deep learning training on a batch of training data. The method also includes determining training times representing an amount of time between a beginning batch time and an end batch time. Further, the method includes modifying a communication aspect of the communication straggler to reduce a future network communication time for the communication straggler to send a future result of the distributed deep learning training on a new batch of training data in response to the centralized parameter server determining that the learner is the communication straggler.
-
公开(公告)号:US20180068655A1
公开(公告)日:2018-03-08
申请号:US15258836
申请日:2016-09-07
Applicant: International Business Machines Corporation
Inventor: Xiaodong Cui , Vaibhava Goel
CPC classification number: G10L15/16 , G10L15/063 , G10L15/07 , G10L15/075 , G10L17/18
Abstract: A computer-implemented method according to one embodiment includes estimating a speaker dependent acoustic model utilizing test speech data and a hybrid estimation technique, transforming labeled speech data to create transformed speech data, utilizing the speaker dependent acoustic model and a nonlinear transformation, and adjusting a deep neural network (DNN) acoustic model, utilizing the transformed speech data.
-
公开(公告)号:US20250068635A1
公开(公告)日:2025-02-27
申请号:US18453127
申请日:2023-08-21
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Hui Wan , Xiaodong Cui , Songtao Lu , Marina Danilevsky Hailpern
IPC: G06F16/2457
Abstract: A method, computer system, and a computer program product are provided for a context-aware relevancy modelling in conversational systems. A user query is received. A latent static content d is selected from a corpus of content D. A latent set of context C from a set of external context Cu is also selected. A result is generated using a scoring function and using the latent static content d from a corpus D and the latent set of context C from the set of external contexts CU so as to provide a most relevant context-base search response to said user query q. The result provides a most relevant context-base search response to said user query q. A response is then generated based on said result using said scoring function result to said user query q.
-
6.
公开(公告)号:US20240170005A1
公开(公告)日:2024-05-23
申请号:US18057967
申请日:2022-11-22
Applicant: International Business Machines Corporation
Inventor: Xiaodong Cui , Brian E. D. Kingsbury , George Andrei Saon
IPC: G10L21/04
CPC classification number: G10L21/04
Abstract: One or more systems, devices, computer program products and/or computer-implemented methods of use provided herein relate to length perturbation techniques for improving generalization of DNN acoustic models. A computer-implemented system can comprise a memory that can store computer executable components. The computer-implemented system can further comprise a processor that can execute the computer executable components stored in the memory, wherein the computer executable components can comprise a frame skipping component that can remove one or more frames from an acoustic utterance via frame skipping. The computer executable components can further comprise a frame insertion component that can insert one or more replacement frames into the acoustic utterance via frame insertion to replace the one or more frames with the one or more replacement frames to enable length perturbation of the acoustic utterance.
-
公开(公告)号:US20240095515A1
公开(公告)日:2024-03-21
申请号:US17943839
申请日:2022-09-13
Applicant: International Business Machines Corporation
Inventor: Songtao Lu , Xiaodong Cui , Mark S. Squillante , Brian E.D. Kingsbury , Lior Horesh
IPC: G06N3/08
CPC classification number: G06N3/08
Abstract: Decentralized bilevel optimization techniques for personalized learning over a heterogenous network are provided. In one aspect, a decentralized learning system includes: a distributed machine learning network with multiple nodes, and datasets associated with the nodes; and a bilevel learning structure at each of the nodes for optimizing one or more features from each of the datasets using a decentralized bilevel optimization solver, while maintaining distinct features from each of the datasets. A method for decentralized learning is also provided.
-
公开(公告)号:US11893346B2
公开(公告)日:2024-02-06
申请号:US17308575
申请日:2021-05-05
Applicant: International Business Machines Corporation
Inventor: Hui Wan , Xiaodong Cui , Luis A. Lastras-Montano
IPC: G06F40/284 , G06F40/205 , G06F40/30 , G06F40/42 , G06F40/237 , G06V30/194
CPC classification number: G06F40/284 , G06F40/205 , G06F40/237 , G06F40/30 , G06F40/42 , G06V30/194
Abstract: From metadata of a corpus of natural language text documents, a relativity matrix is constructed, a row-column intersection in the relativity matrix corresponding to a relationship between two instances of a type of metadata. An encoder model is trained, generating a trained encoder model, to compute an embedding corresponding to a token of a natural language text document within the corpus and the relativity matrix, the encoder model comprising a first encoder layer, the first encoder layer comprising a token embedding portion, a relativity embedding portion, a token self-attention portion, a metadata self-attention portion, and a fusion portion, the training comprising adjusting a set of parameters of the encoder model.
-
公开(公告)号:US11557053B2
公开(公告)日:2023-01-17
申请号:US16785469
申请日:2020-02-07
Applicant: International Business Machines Corporation
Inventor: Rui Zhang , Conrad M. Albrecht , Siyuan Lu , Wei Zhang , Ulrich Alfons Finkler , David S. Kung , Xiaodong Cui , Marcus Freitag
Abstract: Techniques for image processing and transformation are provided. A plurality of images and a plurality of maps are received, and a system of neural networks is trained based on the plurality of images and the plurality of maps. A first image is received, and a first map is generated by processing the first image using the system of neural networks.
-
公开(公告)号:US20220253426A1
公开(公告)日:2022-08-11
申请号:US17170164
申请日:2021-02-08
Applicant: International Business Machines Corporation , The Board of Trustees of the University of Illinois
Inventor: Yada Zhu , Jinjun Xiong , Jingrui He , Lecheng Zheng , Xiaodong Cui
Abstract: Time series data can be received. A machine learning model can be trained using the time series data. A contaminating process can be estimated based on the time series data, the contaminating process including outliers associated with the time series data. A parameter associated with the contaminating process can be determined. Based on the trained machine learning model and the parameter associated with the contaminating process, a single-valued metric can be determined, which represents an impact of the contaminating process on the machine learning model's future prediction. A plurality of different outlier detecting machine learning models can be used to estimate the contaminating process and the single-valued metric can be determined for each of the plurality of different outlier detecting machine learning models. The plurality of different outlier detecting machine learning models can be ranked according to the associated single-valued metric.
-
-
-
-
-
-
-
-
-