SYSTEMS AND METHODS FOR DETECTING SENTIMENT-BASED TOPICS
    1.
    发明申请
    SYSTEMS AND METHODS FOR DETECTING SENTIMENT-BASED TOPICS 有权
    用于检测基于敏感度的主题的系统和方法

    公开(公告)号:US20110137906A1

    公开(公告)日:2011-06-09

    申请号:US12634269

    申请日:2009-12-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/2785

    摘要: A method for analyzing sentiment comprising: collecting an object from an external content repository, the collected objects forming a content database; extracting a snippet related to the subject from the content database; calculating a sentiment score for the snippet; classifying the snippet into a sentiment category; creating sentiment taxonomy using the sentiment categories, the sentiment taxonomy classifying the snippets as positive, negative or neutral; identifying topic words within the sentiment taxonomy; classifying the topic words as a sentiment topic word candidates or a non-sentiment topic word candidate, filtering the non-sentiment topic word candidates; identifying the frequency of the non-sentiment topic words in each of the sentiment categories; identifying the importance of the non-sentiment topic word for each of the sentiment categories; and, ranking the topic word, wherein the rank is calculated by combining the frequency of the topic words in each of the categories with its importance.

    摘要翻译: 一种用于分析情绪的方法,包括:从外部存储库收集对象,所收集的对象形成内容数据库; 从所述内容数据库提取与所述主题相关的片段; 计算该片段的情绪分数; 将该片段分类为情感类别; 使用情绪类别创建情感分类,情绪分类法将片段分为正,负或中立; 识别情感分类学中的主题词; 将主题词分类为情感主题词候选词或非情绪主题词候选,过滤非情绪主题词候选词; 识别每个情绪类别中非情绪主题词的频率; 确定每个情绪类别的非情绪主题词的重要性; 并且对主题词进行排名,其中通过将每个类别中的主题词的频率与其重要性组合来计算排名。

    METHOD OF MONITORING ELECTRONIC MEDIA
    2.
    发明申请
    METHOD OF MONITORING ELECTRONIC MEDIA 有权
    监控电子媒体的方法

    公开(公告)号:US20090119275A1

    公开(公告)日:2009-05-07

    申请号:US11927238

    申请日:2007-10-29

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06Q30/02

    摘要: Consumer-generated media (CGM) and/or other media are monitored to allow an organization to become aware of, and respond to, issues that may affect how it is perceived by the public. An extract, transform, load (ETL) engine is used to process CGM and other media content, and an analytical engine utilizes a multi-step progressive filtering approach to identify those documents that are most relevant. The filtering approach includes executing broad queries to extract relevant content from different CGM and other sources, extracting text snippets from the relevant content and performing de-duplication, defining organizational identity (e.g., brand name, trade name, or company name) and hot-topic models using a rule-based and statistical-based approach, and using the models together in an orthogonal filtering approach to effectively generate alerts and reports. The methodology is found to be substantially more effective compared to a conventional keyword based approach.

    摘要翻译: 对消费者生成的媒体(CGM)和/或其他媒体进行监控,以使组织能够意识到可能影响公众感知的问题并对其进行回应。 提取,变换,加载(ETL)引擎用于处理CGM和其他媒体内容,分析引擎利用多步逐步过滤方法来识别最相关的文档。 过滤方法包括执行广泛查询,从不同的CGM和其他来源提取相关内容,从相关内容中提取文本片段,执行重复数据删除,定义组织身份(例如品牌名称,商品名称或公司名称) 主题模型使用基于规则和统计的方法,并以正交过滤方法将模型一起使用,以有效地生成警报和报告。 与常规的基于关键词的方法相比,发现该方法显着更有效。

    CLASSIFYING DOCUMENTS ACCORDING TO READERSHIP
    3.
    发明申请
    CLASSIFYING DOCUMENTS ACCORDING TO READERSHIP 有权
    根据读者分类文件

    公开(公告)号:US20120226695A1

    公开(公告)日:2012-09-06

    申请号:US13473136

    申请日:2012-05-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30707

    摘要: A system for classifying documents in a collection of documents according to their intended readerships includes: a computer configured to select a document in the collection of documents; and a computer to determine a characteristic of the selected document, the characteristic being: misleading when the document includes one or more features that are determined to be for a purpose other than reading the document; commercial when the document includes features that are presented for a commercial purpose; or personal when the document includes features of a personal opinion. A computer classifies the selected document as misleading, commercial, or personal according to its determined characteristic; and a computer repeats the steps of select document, determines a characteristic of the selected document, and classifies the selected document for additional documents in the collection. At least some documents are classified as misleading, some as commercial, and at least some as personal.

    摘要翻译: 一种用于根据其预期读取者对文档集合中的文档进行分类的系统包括:被配置为在文档集合中选择文档的计算机; 以及计算机,用于确定所选择的文档的特征,其特征在于:当所述文档包括确定为用于除了阅读所述文档之外的目的的一个或多个特征时具有误导性; 当文档包含为商业目的呈现的特征时,商业广告; 或个人,当文件包含个人意见的特征。 计算机根据其确定的特征将所选择的文档分类为具有误导性,商业性或个人性的; 并且计算机重复选择文档的步骤,确定所选择的文档的特征,并对所选择的文档进行分类以用于该集合中的附加文档。 至少有一些文件被归类为误导性的,有些则是商业的,至少有些是个人的。

    CLASSIFYING DOCUMENTS ACCORDING TO READERSHIP
    4.
    发明申请
    CLASSIFYING DOCUMENTS ACCORDING TO READERSHIP 有权
    根据读者分类文件

    公开(公告)号:US20110276553A1

    公开(公告)日:2011-11-10

    申请号:US12776779

    申请日:2010-05-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30707

    摘要: One embodiment is a computer-implemented method for classifying documents in a collection of documents according to their intended readerships. The method comprises using a computer to select a document in the collection of documents; and using a computer to determine a characteristic of the selected document, the characteristic being: misleading when the document includes one or more features that are determined to be for a purpose other than reading the document; commercial when the document includes features that are presented for a commercial purpose; or personal when the document includes features of a personal opinion. The method further includes using a computer to classify the selected document as misleading, commercial, or personal according to its determined characteristic; and using a computer to repeat the steps of select document, determine a characteristic of the selected document, and classify the selected document for additional documents in the collection. At least some documents are classified as misleading, at least some documents are classified as commercial, and at least some documents are classified as personal. Other methods and computer program products are also disclosed according to even more embodiments.

    摘要翻译: 一个实施例是一种计算机实现的方法,用于根据其预期的读者对文档集合中的文档进行分类。 该方法包括使用计算机在文档集合中选择文档; 并且使用计算机来确定所选择的文档的特征,其特征是:当所述文档包括被确定为用于除了阅读所述文档之外的目的的一个或多个特征时具有误导性; 当文档包含为商业目的呈现的特征时,商业广告; 或个人,当文件包含个人意见的特征。 该方法还包括使用计算机根据其确定的特征将所选择的文档分类为误导,商业或个人; 并使用计算机重复选择文档的步骤,确定所选文档的特征,并将所选择的文档分类到集合中的附加文档。 至少有些文件被归类为误导性的,至少有些文件被归类为商业,至少有一些文件被归类为个人。 根据甚至更多的实施例还公开了其它方法和计算机程序产品。

    Method of monitoring electronic media
    5.
    发明授权
    Method of monitoring electronic media 有权
    监控电子媒体的方法

    公开(公告)号:US08010524B2

    公开(公告)日:2011-08-30

    申请号:US11927238

    申请日:2007-10-29

    IPC分类号: G06F17/30

    CPC分类号: G06Q30/02

    摘要: Consumer-generated media (CGM) and/or other media are monitored to allow an organization to become aware of, and respond to, issues that may affect how it is perceived by the public. An extract, transform, load (ETL) engine is used to process CGM and other media content, and an analytical engine utilizes a multi-step progressive filtering approach to identify those documents that are most relevant. The filtering approach includes executing broad queries to extract relevant content from different CGM and other sources, extracting text snippets from the relevant content and performing de-duplication, defining organizational identity (e.g., brand name, trade name, or company name) and hot-topic models using a rule-based and statistical-based approach, and using the models together in an orthogonal filtering approach to effectively generate alerts and reports. The methodology is found to be substantially more effective compared to a conventional keyword based approach.

    摘要翻译: 对消费者生成的媒体(CGM)和/或其他媒体进行监控,以使组织能够意识到可能影响公众感知的问题并对其进行回应。 提取,变换,加载(ETL)引擎用于处理CGM和其他媒体内容,分析引擎利用多步逐步过滤方法来识别最相关的文档。 过滤方法包括执行广泛查询,从不同的CGM和其他来源提取相关内容,从相关内容中提取文本片段,执行重复数据删除,定义组织身份(例如品牌名称,商品名称或公司名称) 主题模型使用基于规则和统计的方法,并以正交过滤方法将模型一起使用,以有效地生成警报和报告。 与常规的基于关键词的方法相比,发现该方法显着更有效。

    Interactive process for recognition and evaluation of a partial search query and display of interactive results
    7.
    发明授权
    Interactive process for recognition and evaluation of a partial search query and display of interactive results 失效
    用于识别和评估部分搜索查询和交互结果显示的交互过程

    公开(公告)号:US06862713B1

    公开(公告)日:2005-03-01

    申请号:US09387668

    申请日:1999-08-31

    IPC分类号: G06F3/00 G06F3/16 G06F17/30

    摘要: A method for presenting to an end-user the intermediate matching search results of a keyword search in an index list of information. The method comprising the steps of: coupling to a search engine a graphical user interface for accepting keyword search terms for searching the indexed list of information with the search engine; receiving one or more keyword search terms with one or more separation characters separating there between; performing a keyword search with the one or more keyword search terms received when a separation character is received; and presenting the number of documents matching the keyword search terms to the end-user, and presenting a graphical menu item on a display. In accordance with another embodiment of the present invention, an information processing system and computer readable storage medium carries out the above method.

    摘要翻译: 一种用于向最终用户呈现信息索引列表中的关键词搜索的中间匹配搜索结果的方法。 该方法包括以下步骤:将与图形用户界面相耦合的图形用户接口,用于接收关键词搜索项,用于搜索索引的信息列表; 接收一个或多个关键词搜索词与一个或多个分离字符之间的分隔符; 当接收到分离字符时,用所接收的一个或多个关键字搜索项执行关键字搜索; 以及向所述最终用户呈现与所述关键词搜索项匹配的文档的数量,以及在显示器上呈现图形菜单项。 根据本发明的另一实施例,信息处理系统和计算机可读存储介质执行上述方法。