METHOD OF MONITORING ELECTRONIC MEDIA
    1.
    发明申请
    METHOD OF MONITORING ELECTRONIC MEDIA 有权
    监控电子媒体的方法

    公开(公告)号:US20090119275A1

    公开(公告)日:2009-05-07

    申请号:US11927238

    申请日:2007-10-29

    IPC分类号: G06F7/06 G06F17/30

    CPC分类号: G06Q30/02

    摘要: Consumer-generated media (CGM) and/or other media are monitored to allow an organization to become aware of, and respond to, issues that may affect how it is perceived by the public. An extract, transform, load (ETL) engine is used to process CGM and other media content, and an analytical engine utilizes a multi-step progressive filtering approach to identify those documents that are most relevant. The filtering approach includes executing broad queries to extract relevant content from different CGM and other sources, extracting text snippets from the relevant content and performing de-duplication, defining organizational identity (e.g., brand name, trade name, or company name) and hot-topic models using a rule-based and statistical-based approach, and using the models together in an orthogonal filtering approach to effectively generate alerts and reports. The methodology is found to be substantially more effective compared to a conventional keyword based approach.

    摘要翻译: 对消费者生成的媒体(CGM)和/或其他媒体进行监控,以使组织能够意识到可能影响公众感知的问题并对其进行回应。 提取,变换,加载(ETL)引擎用于处理CGM和其他媒体内容,分析引擎利用多步逐步过滤方法来识别最相关的文档。 过滤方法包括执行广泛查询,从不同的CGM和其他来源提取相关内容,从相关内容中提取文本片段,执行重复数据删除,定义组织身份(例如品牌名称,商品名称或公司名称) 主题模型使用基于规则和统计的方法,并以正交过滤方法将模型一起使用,以有效地生成警报和报告。 与常规的基于关键词的方法相比,发现该方法显着更有效。

    Method of monitoring electronic media
    2.
    发明授权
    Method of monitoring electronic media 有权
    监控电子媒体的方法

    公开(公告)号:US08010524B2

    公开(公告)日:2011-08-30

    申请号:US11927238

    申请日:2007-10-29

    IPC分类号: G06F17/30

    CPC分类号: G06Q30/02

    摘要: Consumer-generated media (CGM) and/or other media are monitored to allow an organization to become aware of, and respond to, issues that may affect how it is perceived by the public. An extract, transform, load (ETL) engine is used to process CGM and other media content, and an analytical engine utilizes a multi-step progressive filtering approach to identify those documents that are most relevant. The filtering approach includes executing broad queries to extract relevant content from different CGM and other sources, extracting text snippets from the relevant content and performing de-duplication, defining organizational identity (e.g., brand name, trade name, or company name) and hot-topic models using a rule-based and statistical-based approach, and using the models together in an orthogonal filtering approach to effectively generate alerts and reports. The methodology is found to be substantially more effective compared to a conventional keyword based approach.

    摘要翻译: 对消费者生成的媒体(CGM)和/或其他媒体进行监控,以使组织能够意识到可能影响公众感知的问题并对其进行回应。 提取,变换,加载(ETL)引擎用于处理CGM和其他媒体内容,分析引擎利用多步逐步过滤方法来识别最相关的文档。 过滤方法包括执行广泛查询,从不同的CGM和其他来源提取相关内容,从相关内容中提取文本片段,执行重复数据删除,定义组织身份(例如品牌名称,商品名称或公司名称) 主题模型使用基于规则和统计的方法,并以正交过滤方法将模型一起使用,以有效地生成警报和报告。 与常规的基于关键词的方法相比,发现该方法显着更有效。

    Using rule induction to identify emerging trends in unstructured text streams
    3.
    发明授权
    Using rule induction to identify emerging trends in unstructured text streams 失效
    使用规则归纳来识别非结构化文本流中的新趋势

    公开(公告)号:US08712926B2

    公开(公告)日:2014-04-29

    申请号:US12126829

    申请日:2008-05-23

    IPC分类号: G06F15/18

    CPC分类号: G06N99/005

    摘要: A method for identifying emerging concepts in unstructured text streams comprises: selecting a subset V of documents from a set U of documents; generating at least one Boolean combination of terms that partitions the set U into a plurality of categories that represent a generalized, statistically based model of the selected subset V wherein the categories are disjoint inasmuch as each document of U is included in only one category of the partition; and generating a descriptive label for each of the disjoint categories from the Boolean combination of terms for that category.

    摘要翻译: 用于识别非结构化文本流中新出现的概念的方法包括:从文档集合U中选择文档的子集V; 生成将集合U分成多个类别的术语的至少一个布尔组合,所述多个类别表示所选择的子集V的广义的,统计学上的模型,其中,所述类别是不相交的,因为U的每个文档仅包括在 划分; 以及从该类别的术语的布尔组合中为每个不相交类别生成描述性标签。

    USING RULE INDUCTION TO IDENTIFY EMERGING TRENDS IN UNSTRUCTURED TEXT STREAMS
    4.
    发明申请
    USING RULE INDUCTION TO IDENTIFY EMERGING TRENDS IN UNSTRUCTURED TEXT STREAMS 失效
    使用规则诱导来识别未经修订的文本流程中的新兴趋势

    公开(公告)号:US20090292660A1

    公开(公告)日:2009-11-26

    申请号:US12126829

    申请日:2008-05-23

    IPC分类号: G06F15/18

    CPC分类号: G06N99/005

    摘要: A method for identifying emerging concepts in unstructured text streams comprises: selecting a subset V of documents from a set U of documents; generating at least one Boolean combination of terms that partitions the set U into a plurality of categories that represent a generalized, statistically based model of the selected subset V wherein the categories are disjoint inasmuch as each document of U is included in only one category of the partition; and generating a descriptive label for each of the disjoint categories from the Boolean combination of terms for that category.

    摘要翻译: 用于识别非结构化文本流中新出现的概念的方法包括:从文档集合U中选择文档的子集V; 生成将集合U分成多个类别的术语的至少一个布尔组合,所述多个类别表示所选择的子集V的广义的,统计学上的模型,其中,所述类别是不相交的,因为U的每个文档仅包括在 划分; 以及从该类别的术语的布尔组合中为每个不相交类别生成描述性标签。

    Simplified entity relationship model to access structure data
    5.
    发明授权
    Simplified entity relationship model to access structure data 失效
    简化的实体关系模型来访问结构数据

    公开(公告)号:US08572124B2

    公开(公告)日:2013-10-29

    申请号:US12715997

    申请日:2010-03-02

    IPC分类号: G06F7/00 G06F17/30 G06F3/048

    摘要: Data may be modeled as an undirected graph. A set of entities and a set of attributes may be defined. A set of relationships may be defined to represent semantic associations with each association connecting at least two entities. Attributes may be associated with entities rather than with relationships. A hierarchical query language with a set of atomic operations on modeled data may be employed. The modeled data may be displayed on a display unit.

    摘要翻译: 数据可以被建模为无向图。 可以定义一组实体和一组属性。 可以定义一组关系以表示与连接至少两个实体的每个关联的语义关联。 属性可能与实体相关联,而不是与关系关联。 可以采用具有对建模数据的一组原子操作的分层查询语言。 建模的数据可以显示在显示单元上。

    SYSTEMS AND METHODS FOR DETECTING SENTIMENT-BASED TOPICS
    6.
    发明申请
    SYSTEMS AND METHODS FOR DETECTING SENTIMENT-BASED TOPICS 有权
    用于检测基于敏感度的主题的系统和方法

    公开(公告)号:US20110137906A1

    公开(公告)日:2011-06-09

    申请号:US12634269

    申请日:2009-12-09

    IPC分类号: G06F17/30

    CPC分类号: G06F17/2785

    摘要: A method for analyzing sentiment comprising: collecting an object from an external content repository, the collected objects forming a content database; extracting a snippet related to the subject from the content database; calculating a sentiment score for the snippet; classifying the snippet into a sentiment category; creating sentiment taxonomy using the sentiment categories, the sentiment taxonomy classifying the snippets as positive, negative or neutral; identifying topic words within the sentiment taxonomy; classifying the topic words as a sentiment topic word candidates or a non-sentiment topic word candidate, filtering the non-sentiment topic word candidates; identifying the frequency of the non-sentiment topic words in each of the sentiment categories; identifying the importance of the non-sentiment topic word for each of the sentiment categories; and, ranking the topic word, wherein the rank is calculated by combining the frequency of the topic words in each of the categories with its importance.

    摘要翻译: 一种用于分析情绪的方法,包括:从外部存储库收集对象,所收集的对象形成内容数据库; 从所述内容数据库提取与所述主题相关的片段; 计算该片段的情绪分数; 将该片段分类为情感类别; 使用情绪类别创建情感分类,情绪分类法将片段分为正,负或中立; 识别情感分类学中的主题词; 将主题词分类为情感主题词候选词或非情绪主题词候选,过滤非情绪主题词候选词; 识别每个情绪类别中非情绪主题词的频率; 确定每个情绪类别的非情绪主题词的重要性; 并且对主题词进行排名,其中通过将每个类别中的主题词的频率与其重要性组合来计算排名。

    CLASSIFYING DOCUMENTS ACCORDING TO READERSHIP
    7.
    发明申请
    CLASSIFYING DOCUMENTS ACCORDING TO READERSHIP 有权
    根据读者分类文件

    公开(公告)号:US20120226695A1

    公开(公告)日:2012-09-06

    申请号:US13473136

    申请日:2012-05-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30707

    摘要: A system for classifying documents in a collection of documents according to their intended readerships includes: a computer configured to select a document in the collection of documents; and a computer to determine a characteristic of the selected document, the characteristic being: misleading when the document includes one or more features that are determined to be for a purpose other than reading the document; commercial when the document includes features that are presented for a commercial purpose; or personal when the document includes features of a personal opinion. A computer classifies the selected document as misleading, commercial, or personal according to its determined characteristic; and a computer repeats the steps of select document, determines a characteristic of the selected document, and classifies the selected document for additional documents in the collection. At least some documents are classified as misleading, some as commercial, and at least some as personal.

    摘要翻译: 一种用于根据其预期读取者对文档集合中的文档进行分类的系统包括:被配置为在文档集合中选择文档的计算机; 以及计算机,用于确定所选择的文档的特征,其特征在于:当所述文档包括确定为用于除了阅读所述文档之外的目的的一个或多个特征时具有误导性; 当文档包含为商业目的呈现的特征时,商业广告; 或个人,当文件包含个人意见的特征。 计算机根据其确定的特征将所选择的文档分类为具有误导性,商业性或个人性的; 并且计算机重复选择文档的步骤,确定所选择的文档的特征,并对所选择的文档进行分类以用于该集合中的附加文档。 至少有一些文件被归类为误导性的,有些则是商业的,至少有些是个人的。

    CLASSIFYING DOCUMENTS ACCORDING TO READERSHIP
    8.
    发明申请
    CLASSIFYING DOCUMENTS ACCORDING TO READERSHIP 有权
    根据读者分类文件

    公开(公告)号:US20110276553A1

    公开(公告)日:2011-11-10

    申请号:US12776779

    申请日:2010-05-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30707

    摘要: One embodiment is a computer-implemented method for classifying documents in a collection of documents according to their intended readerships. The method comprises using a computer to select a document in the collection of documents; and using a computer to determine a characteristic of the selected document, the characteristic being: misleading when the document includes one or more features that are determined to be for a purpose other than reading the document; commercial when the document includes features that are presented for a commercial purpose; or personal when the document includes features of a personal opinion. The method further includes using a computer to classify the selected document as misleading, commercial, or personal according to its determined characteristic; and using a computer to repeat the steps of select document, determine a characteristic of the selected document, and classify the selected document for additional documents in the collection. At least some documents are classified as misleading, at least some documents are classified as commercial, and at least some documents are classified as personal. Other methods and computer program products are also disclosed according to even more embodiments.

    摘要翻译: 一个实施例是一种计算机实现的方法,用于根据其预期的读者对文档集合中的文档进行分类。 该方法包括使用计算机在文档集合中选择文档; 并且使用计算机来确定所选择的文档的特征,其特征是:当所述文档包括被确定为用于除了阅读所述文档之外的目的的一个或多个特征时具有误导性; 当文档包含为商业目的呈现的特征时,商业广告; 或个人,当文件包含个人意见的特征。 该方法还包括使用计算机根据其确定的特征将所选择的文档分类为误导,商业或个人; 并使用计算机重复选择文档的步骤,确定所选文档的特征,并将所选择的文档分类到集合中的附加文档。 至少有些文件被归类为误导性的,至少有些文件被归类为商业,至少有一些文件被归类为个人。 根据甚至更多的实施例还公开了其它方法和计算机程序产品。

    Method, server and system for converging desktop application and web application

    公开(公告)号:US11240287B2

    公开(公告)日:2022-02-01

    申请号:US12675456

    申请日:2008-08-28

    IPC分类号: H04L29/08 G06F9/50 H04L29/06

    摘要: The present invention provides a method, Web server and computer system for converging a desktop application and a Web application. The method may comprise: in response to a request from a client user for using a target desktop application, starting a desktop application initialization process on the Web server and determining an appropriate corresponding hosting server for the user; preparing and provisioning desktop application environment on the corresponding hosting server and starting the target desktop application; transmitting the corresponding hosting server's address to the client so as to make desktop application interaction between the client and the corresponding hosting server; and in response to the completion of the desktop application interaction, stopping and exiting the target desktop application on the corresponding hosting server.