-
1.
公开(公告)号:US20240170005A1
公开(公告)日:2024-05-23
申请号:US18057967
申请日:2022-11-22
IPC分类号: G10L21/04
CPC分类号: G10L21/04
摘要: One or more systems, devices, computer program products and/or computer-implemented methods of use provided herein relate to length perturbation techniques for improving generalization of DNN acoustic models. A computer-implemented system can comprise a memory that can store computer executable components. The computer-implemented system can further comprise a processor that can execute the computer executable components stored in the memory, wherein the computer executable components can comprise a frame skipping component that can remove one or more frames from an acoustic utterance via frame skipping. The computer executable components can further comprise a frame insertion component that can insert one or more replacement frames into the acoustic utterance via frame insertion to replace the one or more frames with the one or more replacement frames to enable length perturbation of the acoustic utterance.
-
公开(公告)号:US20240095515A1
公开(公告)日:2024-03-21
申请号:US17943839
申请日:2022-09-13
IPC分类号: G06N3/08
CPC分类号: G06N3/08
摘要: Decentralized bilevel optimization techniques for personalized learning over a heterogenous network are provided. In one aspect, a decentralized learning system includes: a distributed machine learning network with multiple nodes, and datasets associated with the nodes; and a bilevel learning structure at each of the nodes for optimizing one or more features from each of the datasets using a decentralized bilevel optimization solver, while maintaining distinct features from each of the datasets. A method for decentralized learning is also provided.
-
公开(公告)号:US11893346B2
公开(公告)日:2024-02-06
申请号:US17308575
申请日:2021-05-05
发明人: Hui Wan , Xiaodong Cui , Luis A. Lastras-Montano
IPC分类号: G06F40/284 , G06F40/205 , G06F40/30 , G06F40/42 , G06F40/237 , G06V30/194
CPC分类号: G06F40/284 , G06F40/205 , G06F40/237 , G06F40/30 , G06F40/42 , G06V30/194
摘要: From metadata of a corpus of natural language text documents, a relativity matrix is constructed, a row-column intersection in the relativity matrix corresponding to a relationship between two instances of a type of metadata. An encoder model is trained, generating a trained encoder model, to compute an embedding corresponding to a token of a natural language text document within the corpus and the relativity matrix, the encoder model comprising a first encoder layer, the first encoder layer comprising a token embedding portion, a relativity embedding portion, a token self-attention portion, a metadata self-attention portion, and a fusion portion, the training comprising adjusting a set of parameters of the encoder model.
-
公开(公告)号:US11557053B2
公开(公告)日:2023-01-17
申请号:US16785469
申请日:2020-02-07
发明人: Rui Zhang , Conrad M. Albrecht , Siyuan Lu , Wei Zhang , Ulrich Alfons Finkler , David S. Kung , Xiaodong Cui , Marcus Freitag
摘要: Techniques for image processing and transformation are provided. A plurality of images and a plurality of maps are received, and a system of neural networks is trained based on the plurality of images and the plurality of maps. A first image is received, and a first map is generated by processing the first image using the system of neural networks.
-
公开(公告)号:US20220253426A1
公开(公告)日:2022-08-11
申请号:US17170164
申请日:2021-02-08
申请人: International Business Machines Corporation , The Board of Trustees of the University of Illinois
发明人: Yada Zhu , Jinjun Xiong , Jingrui He , Lecheng Zheng , Xiaodong Cui
摘要: Time series data can be received. A machine learning model can be trained using the time series data. A contaminating process can be estimated based on the time series data, the contaminating process including outliers associated with the time series data. A parameter associated with the contaminating process can be determined. Based on the trained machine learning model and the parameter associated with the contaminating process, a single-valued metric can be determined, which represents an impact of the contaminating process on the machine learning model's future prediction. A plurality of different outlier detecting machine learning models can be used to estimate the contaminating process and the single-valued metric can be determined for each of the plurality of different outlier detecting machine learning models. The plurality of different outlier detecting machine learning models can be ranked according to the associated single-valued metric.
-
公开(公告)号:US20220012584A1
公开(公告)日:2022-01-13
申请号:US16925178
申请日:2020-07-09
发明人: Wei Zhang , Xiaodong Cui , Abdullah Kayi , Alper Buyuktosunoglu
摘要: Embodiments of a method are disclosed. The method includes performing decentralized distributed deep learning training on a batch of training data. Additionally, the method includes determining a training time wherein the learner performs the decentralized distributed deep learning training on the batch of training data. Further, the method includes generating a table having the training time and other processing times for corresponding other learners performing the decentralized distributed deep learning training on corresponding other batches of other training data. The method also includes determining that the learner is a straggler based on the table and a threshold for the training time. Additionally, the method includes modifying a processing aspect of the straggler to reduce a future training time of the straggler for performing the decentralized distributed deep learning training on a new batch of training data in response to determining the learner is the straggler.
-
7.
公开(公告)号:US11636280B2
公开(公告)日:2023-04-25
申请号:US17159710
申请日:2021-01-27
发明人: Xiaodong Cui , Wei Zhang , Mingrui Liu , Abdullah Kayi , Youssef Mroueh , Alper Buyuktosunoglu
IPC分类号: G06K9/62 , G06F15/173 , G06N20/00 , G06N3/08
摘要: Systems, computer-implemented methods, and computer program products to facilitate updating, such as averaging and/or training, of one or more statistical sets are provided. According to an embodiment, a system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can include a computing component that averages a statistical set, provided by the system, with an additional statistical set, that is compatible with the statistical set, to compute an averaged statistical set, where the additional statistical set is obtained from a selected additional system of a plurality of additional systems. The computer executable components also can include a selecting component that selects the selected additional system according to a randomization pattern.
-
公开(公告)号:US20220358288A1
公开(公告)日:2022-11-10
申请号:US17308575
申请日:2021-05-05
发明人: Hui Wan , Xiaodong Cui , Luis A. Lastras-Montano
IPC分类号: G06F40/284 , G06F40/205 , G06F40/237 , G06F40/30 , G06F40/42 , G06K9/66
摘要: From metadata of a corpus of natural language text documents, a relativity matrix is constructed, a row-column intersection in the relativity matrix corresponding to a relationship between two instances of a type of metadata. An encoder model is trained, generating a trained encoder model, to compute an embedding corresponding to a token of a natural language text document within the corpus and the relativity matrix, the encoder model comprising a first encoder layer, the first encoder layer comprising a token embedding portion, a relativity embedding portion, a token self-attention portion, a metadata self-attention portion, and a fusion portion, the training comprising adjusting a set of parameters of the encoder model.
-
公开(公告)号:US20220012642A1
公开(公告)日:2022-01-13
申请号:US16925192
申请日:2020-07-09
发明人: Wei Zhang , Xiaodong Cui , Abdullah Kayi , Alper Buyuktosunoglu
摘要: Embodiments of a method are disclosed. The method includes performing distributed deep learning training on a batch of training data. The method also includes determining training times representing an amount of time between a beginning batch time and an end batch time. Further, the method includes modifying a communication aspect of the communication straggler to reduce a future network communication time for the communication straggler to send a future result of the distributed deep learning training on a new batch of training data in response to the centralized parameter server determining that the learner is the communication straggler.
-
公开(公告)号:US20180068655A1
公开(公告)日:2018-03-08
申请号:US15258836
申请日:2016-09-07
发明人: Xiaodong Cui , Vaibhava Goel
CPC分类号: G10L15/16 , G10L15/063 , G10L15/07 , G10L15/075 , G10L17/18
摘要: A computer-implemented method according to one embodiment includes estimating a speaker dependent acoustic model utilizing test speech data and a hybrid estimation technique, transforming labeled speech data to create transformed speech data, utilizing the speaker dependent acoustic model and a nonlinear transformation, and adjusting a deep neural network (DNN) acoustic model, utilizing the transformed speech data.
-
-
-
-
-
-
-
-
-