Invention Grant
- Patent Title: Unsupervised topic modeling for short texts
-
Application No.: US15401446Application Date: 2017-01-09
-
Publication No.: US09928231B2Publication Date: 2018-03-27
- Inventor: Vivek Kumar Rangarajan Sridhar
- Applicant: AT&T Intellectual Property I, L.P.
- Applicant Address: US GA Atlanta
- Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
- Current Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.
- Current Assignee Address: US GA Atlanta
- Main IPC: G06F17/27
- IPC: G06F17/27 ; G10L25/30 ; H04W4/14

Abstract:
Topics are determined for short text messages using an unsupervised topic model. In a training corpus created from a number of short text messages, a vocabulary of words is identified, and for each word a distributed vector representation is obtained by processing windows of the corpus having a fixed length. The corpus is modeled as a Gaussian mixture model in which Gaussian components represent topics. To determine a topic of a sample short text message, a posterior distribution over the corpus topics is obtained using the Gaussian mixture model.
Public/Granted literature
- US20170116178A1 Unsupervised Topic Modeling For Short Texts Public/Granted day:2017-04-27
Information query