Abstract:
The technical solution under the present disclosure automatically analyzes conversations between users by receiving a training dataset having a text sequence including sentences of a conversation between the users; extracting feature(s) from the training dataset based on features; providing equation(s) for a plurality of tasks, the equation(s) being a mathematical function for calculating value of a parameter for each of the tasks based on the extracted feature; determining value of the parameter for tasks by processing the equation(s); assigning label(s) to each of the sentences based on the determined value of the parameter, a first label being selected from a plurality of first labels, and a second label being selected from a number of second labels; and storing and maintaining with the database a pre-defined value of the parameter, first labels, conversations, second labels, a test dataset, equation(s), and pre-defined features.
Abstract:
A method, non-transitory computer readable medium, and apparatus for recommending a topic-cohesive and interactive implicit community are disclosed. For example, the method receives a request for customer care, selects an implicit community identified from a plurality of individual users of a social media website based upon a relevance score related to a topic of the request for customer care and recommends the implicit community in response to the request for customer care.
Abstract:
The disclosed embodiments illustrate methods and systems for detecting personal life events of users. The method includes training classifiers based on a set of features extracted from each of an annotated first set of social media data. The first set of social media is associated with one or more first categories. Further, the first set of social media data are annotated by one or more crowdworkers based on one or more second categories. The method further includes extracting a second set of social media data of one or more users, associated with the one or more first categories, from the one or more social media platforms. The method further includes categorizing the extracted second set of social media data into the one or more second categories by use of the trained classifiers. The categorization is further utilized to detect the personal life events of the one or more users.
Abstract:
Systems and methods of data analytics, which in various embodiments enable business analysts to apply certain machine learning and analytics algorithms in a self-service manner by binding them to generic business questions that they can be used to answer in particular domains. The general approach may be to define the application of an algorithm to solve specific problems (questions) for particular combinations of a business domain and a data category. At design time, the algorithm may be linked to canonical data within a data category and programmed to run with this canonical data set. At runtime, given a dataset and its category, and a business domain, a user may choose from the corresponding questions and the system may run the algorithm bound to that question.
Abstract:
A method for assigning a topic to a collection of microblog posts may include, by an acquisition module, receiving from at least one messaging service server, a plurality of posts, wherein each of the plurality of posts comprise post content; by a generation module, analyzing the posts and extract, from at least one of the posts, a link with an address to an external document; and, by the acquisition module, accessing the external document that is associated with the address and fetch external content associated with the document. The method may also include by the generation module: analyzing the post content to identify at least one label for each post, for each post that includes a link, analyzing the external content to identify a topic, and using a topic modeling technique to generate a trained topic model comprising a plurality of topics and a plurality of associated words.
Abstract:
The disclosed embodiments illustrate methods and systems for searching for a first user. The one or more inputs pertaining to one or more first attributes of the first user are received. Further, the one or more first attributes of the first user are ranked based on at least a presence of the one or more first attributes among one or more second attributes pertaining to one or more second users. Thereafter, one or more search strings comprising at least one attribute selected from the ranked one or more first attributes are generated, wherein the one or more search strings are utilizable to search for the first user. Finally, a list of third users is obtained from one or more search engines in response to the one or more search strings.
Abstract:
Systems and methods of data analytics, which in various embodiments enable business analysts to apply certain machine learning and analytics algorithms in a self-service manner by binding them to generic business questions that they can be used to answer in particular domains. The general approach may be to define the application of an algorithm to solve specific problems (questions) for particular combinations of a business domain and a data category. At design time, the algorithm may be linked to canonical data within a data category and programmed to run with this canonical data set. At runtime, given a dataset and its category, and a business domain, a user may choose from the corresponding questions and the system may run the algorithm bound to that question.
Abstract:
A method and a system are provided for correlation detection in multiple spatio-temporal datasets for event sensing in a geographical area. The method includes extracting datasets, comprising information about one or more events, from one or more data sources. The method further includes identifying a primary data source and secondary data sources from the one or more data sources. The method further includes extracting primary features from the datasets associated with the primary data source and secondary features from the datasets associated with the secondary data sources. The primary features are categorized into one or more categories. The method further includes training classifiers based on the primary features and/or the one or more categories. The method further includes detecting a correlation among the information associated with the one or more events based on a category transfer distribution from the primary data source to the secondary data sources.
Abstract:
The disclosed embodiments illustrate methods and systems for identifying a set of users for a marketing campaign. The method includes retrieving one or more first keywords from one or more messages shared by one or more first users, or from a user profile of each of one or more first users. The one or more first keywords are indicative of one or more events associated with one or more first users, and one or more intents of said one or more first users. The method further includes receiving one or more second keywords, pertaining to marketing campaign, from a computing device of a second user. Thereafter, the method includes identifying said set of users from said one or more first users based on a correlation between said one or more first keywords and said one or more second keywords. The method is performed by one or more microprocessors.
Abstract:
Embodiments of a computer-implemented method for automatically analyzing a conversational sequence between multiple users are disclosed. The method includes receiving signals corresponding to a training dataset including multiple conversational sequences; extracting a feature from the training dataset based on predefined feature categories; formulating multiple tasks for being learned from the training dataset based on the extracted feature, each task related to a predefined label; and providing a model for each formulated task, the model including a set of parameters common to the tasks. The set includes an explicit parameter, which is explicitly shared with each of the formulated tasks. The method further includes optimizing a value of the explicit parameter to create an optimized model; creating a trained model for the formulated tasks using the optimized value of the explicit parameter; and assigning predefined labels for the formulated tasks to a live dataset based on the corresponding trained model.