Abstract:
A system and method for analyzing social media data by obtaining social media data from a social media platform, where the social media data includes documents from multiple users of the social media platform; classifying the documents using a sentiment classifier; tokenizing the documents into terms; associating a sentiment with each term; detecting a first event based on a number of occurrences of a first term in the documents; and providing information associated with the event to a user, where the information includes the first term and a sentiment associated with the first term.
Abstract:
A system and method for analyzing social media data by obtaining social media data from a social media platform, where the social media data includes documents from multiple users of the social media platform; classifying the documents using a sentiment classifier; tokenizing the documents into terms; associating a sentiment with each term; detecting a first event based on a number of occurrences of a first term in the documents; and providing information associated with the event to a user, where the information includes the first term and a sentiment associated with the first term.
Abstract:
Methods, systems and processor-readable media for simultaneous sentiment analysis and topic classification with multiple labels. A sentiment and topic associated with a post can be classified at similar time and a result can be incorporated to predict a feature so that a label of two (or more) tasks can promote and reinforce each other iteratively. A feature extraction and selection can be performed on the tasks and a multi-task multi-label classification model can be trained for each task with maximum entropy utilizing multiple labels to ascertain information derived from an extra label and to manage class ambiguities. Each task has a separate classification model with different predicting features and they can be trained collectively which allows flexibility in model construction. The multi-task multi-label classification model produces a probabilistic result and the classes can be ranked by the probabilistic result and the post can be classified with the multi-label.