METHODS AND SYSTEMS FOR DATA PROCESSING
    48.
    发明申请

    公开(公告)号:US20170293597A1

    公开(公告)日:2017-10-12

    申请号:US15092941

    申请日:2016-04-07

    CPC classification number: G06F17/2775 G06F16/35 G06F17/2705

    Abstract: This invention relates to methods and systems for message analysis and classification. It is particularly applicable to analysis and classification of very short messages such as “Tweets”. Embodiments of the invention provide methods for unbiased enriched representation for messages which can be used to transform very short messages into comparatively longer text. These methods can make use of word context information in addition to word information itself. This can provide text with enough information for analysis and classification without changing the information in the original message. Embodiments of the invention also provide a statistical learning mechanism which does not require pre-defined keywords, and can automatically detect inherent frequent words and word patterns. These methods can provide satisfactory classification accuracy even for very short messages.

Patent Agency Ranking