-
公开(公告)号:US20180293499A1
公开(公告)日:2018-10-11
申请号:US15484577
申请日:2017-04-11
Applicant: SAP SE
Inventor: Ruidan He , Daniel Dahlmeier
IPC: G06N3/08
Abstract: Methods, systems, and computer-readable storage media for receiving a vocabulary, the vocabulary including text data that is provided as at least a portion of raw data, the raw data being provided in a computer-readable file, associating each word in the vocabulary with a feature vector, providing a sentence embedding for each sentence of the vocabulary based on a plurality of feature vectors to provide a plurality of sentence embeddings, providing a reconstructed sentence embedding for each sentence embedding based on a weighted parameter matrix to provide a plurality of reconstructed sentence embeddings, and training the unsupervised neural attention model based on the sentence embeddings and the reconstructed sentence embeddings to provide a trained neural attention model, the trained neural attention model being used to automatically determine aspects from the vocabulary.
-
公开(公告)号:US20180285344A1
公开(公告)日:2018-10-04
申请号:US15478363
申请日:2017-04-04
Applicant: SAP SE
Inventor: Ruidan He , Daniel Dahlmeier
IPC: G06F17/27
CPC classification number: G06F17/2785
Abstract: Methods, systems, and computer-readable storage media for receiving a vocabulary that includes text data that is provided as at least a portion of raw data, the raw data being provided in a computer-readable file, providing word embeddings based on the vocabulary, the word embeddings including word vectors for words included in the vocabulary, clustering word embeddings to provide a plurality of clusters, each cluster representing an aspect inferred from the vocabulary, determining a respective association score between each word in the vocabulary and a respective aspect, and providing a word ranking for each aspect based on the respective association scores.
-
公开(公告)号:US10755174B2
公开(公告)日:2020-08-25
申请号:US15484577
申请日:2017-04-11
Applicant: SAP SE
Inventor: Ruidan He , Daniel Dahlmeier
IPC: G06N3/08 , G06N3/04 , G06F40/30 , G06F40/216 , G06F40/289
Abstract: Methods, systems, and computer-readable storage media for receiving a vocabulary, the vocabulary including text data that is provided as at least a portion of raw data, the raw data being provided in a computer-readable file, associating each word in the vocabulary with a feature vector, providing a sentence embedding for each sentence of the vocabulary based on a plurality of feature vectors to provide a plurality of sentence embeddings, providing a reconstructed sentence embedding for each sentence embedding based on a weighted parameter matrix to provide a plurality of reconstructed sentence embeddings, and training the unsupervised neural attention model based on the sentence embeddings and the reconstructed sentence embeddings to provide a trained neural attention model, the trained neural attention model being used to automatically determine aspects from the vocabulary.
-
公开(公告)号:US20200159863A1
公开(公告)日:2020-05-21
申请号:US16196008
申请日:2018-11-20
Applicant: SAP SE
Inventor: Wenya Wang , Daniel Dahlmeier , Sinno Jialin Pan
Abstract: Methods, systems, and computer-readable storage media for receiving input data including a set of sentences, each sentence including computer-readable text as a sequence of tokens, providing a memory network with coupled attentions (MNCA), the coupled attentions including an aspect attention and an opinion attention that are coupled by tensor operators for each sentence in the set of sentences, processing the input data through the MNCA to identify a set of aspect terms, and a set of opinion terms, and simultaneously assign a category to each aspect term and each opinion term from a set of categories, and outputting the set of aspect terms with respective categories, and the set of opinion terms with respective categories.
-
公开(公告)号:US10223354B2
公开(公告)日:2019-03-05
申请号:US15478363
申请日:2017-04-04
Applicant: SAP SE
Inventor: Ruidan He , Daniel Dahlmeier
Abstract: Methods, systems, and computer-readable storage media for receiving a vocabulary that includes text data that is provided as at least a portion of raw data, the raw data being provided in a computer-readable file, providing word embeddings based on the vocabulary, the word embeddings including word vectors for words included in the vocabulary, clustering word embeddings to provide a plurality of clusters, each cluster representing an aspect inferred from the vocabulary, determining a respective association score between each word in the vocabulary and a respective aspect, and providing a word ranking for each aspect based on the respective association scores.
-
-
-
-