专利检索 ap:("Baidu USA, LLC") AND inv:"Dingcheng Li" 第 1 页

1.

发明授权
Disentangle syntax and semantics in sentence representation with decomposable variational autoencoder 有权

公开(公告)号：US12039270B2

公开(公告)日：2024-07-16

申请号：US16985904

申请日：2020-08-05

申请人： Baidu USA, LLC

发明人： Dingcheng Li , Shaogang Ren , Ping Li

IPC分类号： G06F40/30 , G06F3/08 , G06F40/284 , G06N3/049

CPC分类号： G06F40/284 , G06F40/30 , G06N3/049

摘要： Described herein are embodiments of a framework named decomposable variational autoencoder (DecVAE) to disentangle syntax and semantics by using total correlation penalties of Kullback-Leibler (KL) divergences. KL divergence term of the original VAE is decomposed such that the hidden variables generated may be separated in a clear-cut and interpretable way. Embodiments of DecVAE models are evaluated on various semantic similarity and syntactic similarity datasets. Experimental results show that embodiments of DecVAE models achieve state-of-the-art (SOTA) performance in disentanglement between syntactic and semantic representations.

2.

发明授权
Systems and methods for end-to-end deep reinforcement learning based coreference resolution 有权

公开(公告)号：US11630953B2

公开(公告)日：2023-04-18

申请号：US16960014

申请日：2019-07-25

申请人： Baidu USA, LLC , Baidu.com Times Technology (Beijing) Co., Ltd.

发明人： Hongliang Fei , Xu Li , Dingcheng Li , Ping Li

IPC分类号： G06F40/30 , G06N20/00 , G06F40/279 , G06N3/082 , G06N3/044 , G06N3/047

摘要： Described herein are embodiments for end-to-end reinforcement learning based coreference resolution models to directly optimize coreference evaluation metrics. Embodiments of a reinforced policy gradient model are disclosed to incorporate reward associated with a sequence of coreference linking actions. Furthermore, maximum entropy regularization may be used for adequate exploration to prevent a model embodiment from prematurely converging to a bad local optimum. Experiments on datasets compared with state-of-the-art methods verified the effectiveness of embodiments.

3.

发明授权
Total correlation variational autoencoder strengthened with attentions for segmenting syntax and semantics 有权

公开(公告)号：US11748567B2

公开(公告)日：2023-09-05

申请号：US16926525

申请日：2020-07-10

申请人： Baidu USA, LLC

发明人： Dingcheng Li , Shaogang Ren , Ping Li

IPC分类号： G06F40/10 , G06F40/284 , G06F40/211 , G06F40/30 , G06N3/08

CPC分类号： G06F40/284 , G06F40/211 , G06F40/30 , G06N3/08

摘要： Described herein are embodiments of a framework named as total correlation variational autoencoder (TC_VAE) to disentangle syntax and semantics by making use of total correlation penalties of KL divergences. One or more Kullback-Leibler (KL) divergence terms in a loss for a variational autoencoder are discomposed so that generated hidden variables may be separated. Embodiments of the TC_VAE framework were examined on semantic similarity tasks and syntactic similarity tasks. Experimental results show that better disentanglement between syntactic and semantic representations have been achieved compared with state-of-the-art (SOTA) results on the same data sets in similar settings.

4.

发明授权
Systems and methods for large scale semantic indexing with deep level-wise extreme multi-label learning 有权

公开(公告)号：US11748613B2

公开(公告)日：2023-09-05

申请号：US16409148

申请日：2019-05-10

申请人： Baidu USA, LLC

发明人： Dingcheng Li , Jingyuan Zhang , Ping Li

IPC分类号： G06F16/93 , G06F16/35 , G06F40/205 , G06F40/30 , G06N3/08 , G06N3/04 , G06N3/044 , G06N3/045

CPC分类号： G06N3/08 , G06F16/353 , G06F16/93 , G06F40/205 , G06F40/30 , G06N3/04 , G06N3/044 , G06N3/045

摘要： Described herein are embodiments for a deep level-wise extreme multi-label learning and classification (XMLC) framework to facilitate the semantic indexing of literatures. In one or more embodiments, the Deep Level-wise XMLC framework comprises two sequential modules, a deep level-wise multi-label learning module and a hierarchical pointer generation module. In one or more embodiments, the first module decomposes terms of domain ontology into multiple levels and builds a special convolutional neural network for each level with category-dependent dynamic max-pooling and macro F-measure based weights tuning. In one or more embodiments, the second module merges the level-wise outputs into a final summarized semantic indexing. The effectiveness of Deep Level-wise XMLC framework embodiments is demonstrated by comparing it with several state-of-the-art methods of automatic labeling on various datasets.

5.

发明授权
Knowledge-graph-embedding-based question answering 有权

公开(公告)号：US11727243B2

公开(公告)日：2023-08-15

申请号：US16262618

申请日：2019-01-30

申请人： Baidu USA, LLC

发明人： Jingyuan Zhang , Dingcheng Li , Ping Li , Xiao Huang

IPC分类号： G06N3/00 , G06F16/901 , G06N3/08 , G06F16/2452 , G06N3/04 , G06N3/006 , G06N3/042 , G06N3/044

CPC分类号： G06N3/006 , G06F16/24522 , G06F16/9024 , G06N3/042 , G06N3/044 , G06N3/08

摘要： Described herein are embodiments for question answering over knowledge graph using a Knowledge Embedding based Question Answering (KEQA) framework. Instead of inferring an input questions' head entity and predicate directly, KEQA embodiments target jointly recovering the question's head entity, predicate, and tail entity representations in the KG embedding spaces. In embodiments, a joint distance metric incorporating various loss terms is used to measure distances of a predicated fact to all candidate facts. In embodiments, the fact with the minimum distance is returned as the answer. Embodiments of a joint training strategy are also disclosed for better performance. Performance evaluation on various datasets demonstrates the effectiveness of the disclosed systems and methods using the KEQA framework.

6.

发明授权
Integration of knowledge graph embedding into topic modeling with hierarchical Dirichlet process 有权

公开(公告)号：US11636355B2

公开(公告)日：2023-04-25

申请号：US16427225

申请日：2019-05-30

申请人： Baidu USA, LLC

发明人： Dingcheng Li , Jingyuan Zhang , Ping Li , Siamak Zamani Dadaneh

IPC分类号： G06F40/289 , G06N5/04 , G06N20/00 , G06F40/20

摘要： Leveraging domain knowledge is an effective strategy for enhancing the quality of inferred low-dimensional representations of documents by topic models. Presented herein are embodiments of a Bayesian nonparametric model that employ knowledge graph (KG) embedding in the context of topic modeling for extracting more coherent topics; embodiments of the model may be referred to as topic modeling with knowledge graph embedding (TMKGE). TMKGE embodiments are hierarchical Dirichlet process (HDP)-based models that flexibly borrow information from a KG to improve the interpretability of topics. Also, embodiments of a new, efficient online variational inference method based on a stick-breaking construction of HDP were developed for TMKGE models, making TMKGE suitable for large document corpora and KGs. Experiments on datasets illustrate the superior performance of TMKGE in terms of topic coherence and document classification accuracy, compared to state-of-the-art topic modeling methods.

7.

发明授权
Video recommendation with multi-gate mixture of experts soft actor critic 有权

公开(公告)号：US11922287B2

公开(公告)日：2024-03-05

申请号：US17040039

申请日：2020-07-15

申请人： Baidu USA, LLC , Baidu.com Times Technology (Beijing) Co., Ltd.

发明人： Dingcheng Li , Xu Li , Jun Wang , Ping Li

IPC分类号： G06N3/04 , G06N3/042 , G06N3/08 , H04N21/25

CPC分类号： G06N3/042 , G06N3/08 , H04N21/251

摘要： Described herein are embodiments of a reinforcement learning based large-scale multi-objective ranking system. Embodiments of the system may be used for optimizing short-video recommendation on a video sharing platform. Multiple competing ranking objective and implicit selection bias in user feedback are the main challenges in real-world platform. In order to address those challenges, multi-gate mixture of experts (MMoE) and soft actor critic (SAC) are integrated together into a MMoE_SAC system. Experiment results demonstrate that embodiments of the MMoE_SAC system may greatly reduce a loss function compared to systems only based on single strategies.

8.

发明授权
Learning latent structural relations with segmentation variational autoencoders 有权

公开(公告)号：US11816533B2

公开(公告)日：2023-11-14

申请号：US16951158

申请日：2020-11-18

申请人： Baidu USA, LLC

发明人： Shaogang Ren , Hongliang Fei , Dingcheng Li , Ping Li

IPC分类号： G06N5/04 , G06N3/088 , G06N3/045

CPC分类号： G06N5/042 , G06N3/045 , G06N3/088

摘要： Learning disentangled representations is an important topic in machine learning for a wide range of applications. Disentangled latent variables represent interpretable semantic information and reflect separate factors of variation in data. Although generative models may learn latent representations and generate data samples as well, existing models may ignore the structural information among latent representations. Described in the present disclosure are embodiments to learn disentangled latent structural representations from data using decomposable variational auto-encoders, which simultaneously learn component representations and encode component relationships. Embodiments of a novel structural prior for latent representations are disclosed to capture interactions among different data components. Embodiments are applied to data segmentation and latent relation discovery among different data components. Experiments on several datasets demonstrate the utility of the present model embodiments.

9.

发明授权
Representation learning for input classification via topic sparse autoencoder and entity embedding 有权

公开(公告)号：US11615311B2

公开(公告)日：2023-03-28

申请号：US16691554

申请日：2019-11-21

申请人： Baidu USA, LLC

发明人： Dingcheng Li , Jingyuan Zhang , Ping Li

IPC分类号： G06N3/08 , G06N3/04 , G06K9/62 , G06F17/18 , G06N3/084

摘要： Described herein are embodiments of a unified neural network framework to integrate Topic modeling, Word embedding and Entity Embedding (TWEE) for representation learning of inputs. In one or more embodiments, a novel topic sparse autoencoder is introduced to incorporate discriminative topics into the representation learning of the input. Topic distributions of inputs are generated from a global viewpoint and are utilized to enable autoencoder to learn topical representations. A sparsity constraint may be added to ensure that the most discriminative representations are related to topics. In addition, both words and entity related information may be embedded into the network to help learn a more comprehensive input representation. Extensive empirical experiments show that embodiments of the TWEE framework outperform the state-of-the-art methods on different datasets.

10.

发明授权
Systems and methods for mutual learning for topic discovery and word embedding 有权

公开(公告)号：US11568266B2

公开(公告)日：2023-01-31

申请号：US16355622

申请日：2019-03-15

申请人： Baidu USA, LLC

发明人： Dingcheng Li , Jingyuan Zhang , Ping Li

IPC分类号： G06N20/00 , G06N3/08 , G06F17/18 , G06F17/16

摘要： Described herein are embodiments for systems and methods for mutual machine learning with global topic discovery and local word embedding. Both topic modeling and word embedding map documents onto a low-dimensional space, with the former clustering words into a global topic space and the latter mapping word into a local continuous embedding space. Embodiments of Topic Modeling and Sparse Autoencoder (TMSA) framework unify these two complementary patterns by constructing a mutual learning mechanism between word co-occurrence based topic modeling and autoencoder. In embodiments, word topics generated with topic modeling are passed into auto-encoder to impose topic sparsity for the autoencoder to learn topic-relevant word representations. In return, word embedding learned by autoencoder is sent back to topic modeling to improve the quality of topic generations. Performance evaluation on various datasets demonstrates the effectiveness of the disclosed TMSA framework in discovering topics and embedding words.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类