UNSUPERVISED NEURAL ATTENTION MODEL FOR ASPECT EXTRACTION

    公开(公告)号:US20180293499A1

    公开(公告)日:2018-10-11

    申请号:US15484577

    申请日:2017-04-11

    Applicant: SAP SE

    Abstract: Methods, systems, and computer-readable storage media for receiving a vocabulary, the vocabulary including text data that is provided as at least a portion of raw data, the raw data being provided in a computer-readable file, associating each word in the vocabulary with a feature vector, providing a sentence embedding for each sentence of the vocabulary based on a plurality of feature vectors to provide a plurality of sentence embeddings, providing a reconstructed sentence embedding for each sentence embedding based on a weighted parameter matrix to provide a plurality of reconstructed sentence embeddings, and training the unsupervised neural attention model based on the sentence embeddings and the reconstructed sentence embeddings to provide a trained neural attention model, the trained neural attention model being used to automatically determine aspects from the vocabulary.

    UNSUPERVISED ASPECT EXTRACTION FROM RAW DATA USING WORD EMBEDDINGS

    公开(公告)号:US20180285344A1

    公开(公告)日:2018-10-04

    申请号:US15478363

    申请日:2017-04-04

    Applicant: SAP SE

    CPC classification number: G06F17/2785

    Abstract: Methods, systems, and computer-readable storage media for receiving a vocabulary that includes text data that is provided as at least a portion of raw data, the raw data being provided in a computer-readable file, providing word embeddings based on the vocabulary, the word embeddings including word vectors for words included in the vocabulary, clustering word embeddings to provide a plurality of clusters, each cluster representing an aspect inferred from the vocabulary, determining a respective association score between each word in the vocabulary and a respective aspect, and providing a word ranking for each aspect based on the respective association scores.

    Unsupervised neural attention model for aspect extraction

    公开(公告)号:US10755174B2

    公开(公告)日:2020-08-25

    申请号:US15484577

    申请日:2017-04-11

    Applicant: SAP SE

    Abstract: Methods, systems, and computer-readable storage media for receiving a vocabulary, the vocabulary including text data that is provided as at least a portion of raw data, the raw data being provided in a computer-readable file, associating each word in the vocabulary with a feature vector, providing a sentence embedding for each sentence of the vocabulary based on a plurality of feature vectors to provide a plurality of sentence embeddings, providing a reconstructed sentence embedding for each sentence embedding based on a weighted parameter matrix to provide a plurality of reconstructed sentence embeddings, and training the unsupervised neural attention model based on the sentence embeddings and the reconstructed sentence embeddings to provide a trained neural attention model, the trained neural attention model being used to automatically determine aspects from the vocabulary.

    MEMORY NETWORKS FOR FINE-GRAIN OPINION MINING

    公开(公告)号:US20200159863A1

    公开(公告)日:2020-05-21

    申请号:US16196008

    申请日:2018-11-20

    Applicant: SAP SE

    Abstract: Methods, systems, and computer-readable storage media for receiving input data including a set of sentences, each sentence including computer-readable text as a sequence of tokens, providing a memory network with coupled attentions (MNCA), the coupled attentions including an aspect attention and an opinion attention that are coupled by tensor operators for each sentence in the set of sentences, processing the input data through the MNCA to identify a set of aspect terms, and a set of opinion terms, and simultaneously assign a category to each aspect term and each opinion term from a set of categories, and outputting the set of aspect terms with respective categories, and the set of opinion terms with respective categories.

    Unsupervised aspect extraction from raw data using word embeddings

    公开(公告)号:US10223354B2

    公开(公告)日:2019-03-05

    申请号:US15478363

    申请日:2017-04-04

    Applicant: SAP SE

    Abstract: Methods, systems, and computer-readable storage media for receiving a vocabulary that includes text data that is provided as at least a portion of raw data, the raw data being provided in a computer-readable file, providing word embeddings based on the vocabulary, the word embeddings including word vectors for words included in the vocabulary, clustering word embeddings to provide a plurality of clusters, each cluster representing an aspect inferred from the vocabulary, determining a respective association score between each word in the vocabulary and a respective aspect, and providing a word ranking for each aspect based on the respective association scores.

Patent Agency Ranking