Method and system for incremental collection of forum replies
    1.
    发明授权
    Method and system for incremental collection of forum replies 有权
    增量收集论坛答复的方法和系统

    公开(公告)号:US09552435B2

    公开(公告)日:2017-01-24

    申请号:US13997257

    申请日:2011-12-22

    申请人: Xinli Wu Jianwu Yang

    发明人: Xinli Wu Jianwu Yang

    IPC分类号: G06F17/30 G06Q10/10

    摘要: The present application discloses methods and systems for incrementally collecting replies in a forum and belongs to the technical field of collecting network information. The method comprises periodically determining whether there is a newly-established post and a post having new replies in all forum list pages needed to be collected: if yes, extracting a main post and reply information from the newly-established post, and extracting the information of the new replies from the post having new replies. The system comprises a determining device (11) for periodically determining whether there is a newly-established post and a post having new replies in all forum list pages needed to be collected; and an extracting device (12) for extracting a main post and reply information from the newly-established post, and extracting the information of the new replies from the post having new replies. The present application can quickly, accurately and completely collect all main post/replies of a post, so that the drawback that the information of turned pages of a post are missed to be searched or cannot be searched through a general search engine may be overcome.

    摘要翻译: 本申请公开了在论坛中递增收集回复的方法和系统,属于收集网络信息的技术领域。 所述方法包括:定期确定是否存在新建立的帖子和在所有论坛列表页面中需要收集的具有新的回复的帖子:如果是,从新建立的帖子中提取主要帖子和回复信息,并且提取信息 从有新的回复的帖子的新回复。 该系统包括用于周期性地确定是否存在新建立的帖子的确定装置(11)和在所有论坛列表页面中需要收集的具有新的回复的帖子; 以及用于从新建立的帖子中提取主帖子和回复信息的提取装置(12),并且从具有新的回复的帖子中提取新的回复的信息。 本申请可以快速,准确和完整地收集所有主要的帖子/回复,从而可以克服错过通过一般搜索引擎搜索或不能通过一般搜索引擎搜索的帖子的翻页信息的缺点。

    Webpage information detection method and system
    2.
    发明授权
    Webpage information detection method and system 有权
    网页信息检测方法和系统

    公开(公告)号:US09519718B2

    公开(公告)日:2016-12-13

    申请号:US13997251

    申请日:2011-12-22

    IPC分类号: G06F17/30

    摘要: The present application provides web information detecting method and system. The method according to the present application comprises: pre-extracting keywords from a web page; storing a corresponding relationship between the extracted keywords and a URL of the web page in a database; obtaining a source file of a web page to be detected; searching the database for keywords that have the same URL as that of the web page to be detected; comparing the searched keywords to the source file information of the web page to be detected; and determining the existence of information of the web page to be detected according to a matching degree. The present application increases the accuracy of web information detection.

    摘要翻译: 本申请提供了网络信息检测方法和系统。 根据本申请的方法包括:从网页预提取关键字; 在提取的关键字和数据库中的网页的URL之间存储对应的关系; 获取要检测的网页的源文件; 在数据库中搜索与要检测的网页具有相同URL的关键字; 将搜索到的关键字与要检测的网页的源文件信息进行比较; 以及根据匹配度确定要检测的网页的信息的存在。 本申请提高了网络信息检测的准确性。

    METHOD, DEVICE AND SYSTEM FOR PROCESSING PUBLIC OPINION TOPICS
    3.
    发明申请
    METHOD, DEVICE AND SYSTEM FOR PROCESSING PUBLIC OPINION TOPICS 审中-公开
    方法,处理公众意见主题的设备和系统

    公开(公告)号:US20140052753A1

    公开(公告)日:2014-02-20

    申请号:US13997137

    申请日:2011-12-21

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951 G06F16/958

    摘要: The application relates to communication technology field, in particular to a method, an apparatus and a system for processing a public sentiment topic. The method comprises steps of searching a public sentiment topic containing public sentiment information through a network; acquiring characteristic information of the searched public sentiment topic; determining whether the acquired characteristic information of the public sentiment topic meets an alarming condition; and if yes, storing the public sentiment topic and its characteristic information. According to the method, apparatus and system for processing the public sentiment topic provided in embodiments of the present application, it can be determined whether or not an alarm for the public sentiment topic shall be issued by acquiring and determining the characteristic information of the public sentiment topic. In addition, the public sentiment topic can be managed and tracked sequentially to acquire the change trend of propagation, hits and comments thereof, so that the public sentiment topic can be understood comprehensively. The public sentiment topic can also be analyzed and arranged to generate a public sentiment brief report.

    摘要翻译: 本申请涉及通信技术领域,特别涉及用于处理公众情感话题的方法,装置和系统。 该方法包括通过网络搜索包含公众情感信息的公众情感话题的步骤; 获取搜索到的公众情感话题的特征信息; 确定获得的公众情感话题的特征信息是否满足报警条件; 如果是,存储公众情感话题及其特征信息。 根据本申请实施例提供的用于处理公众情感话题的方法,装置和系统,可以通过获取和确定公众情绪的特征信息来确定是否发出公众情感话题的报警 话题。 此外,公众情感话题可以依次进行管理和跟踪,以获取传播,命中和评论的变化趋势,全面了解公众情感话题。 公众情感话题也可以分析安排,形成公众情绪简报。

    METHOD AND SYSTEM FOR INCREMENTAL COLLECTION OF FORUM REPLIES
    4.
    发明申请
    METHOD AND SYSTEM FOR INCREMENTAL COLLECTION OF FORUM REPLIES 有权
    用于收集论文集的收集方法和系统

    公开(公告)号:US20150127644A1

    公开(公告)日:2015-05-07

    申请号:US13997257

    申请日:2011-12-22

    申请人: Xinli Wu Jianwu Yang

    发明人: Xinli Wu Jianwu Yang

    IPC分类号: G06F17/30

    摘要: The present application discloses methods and systems for incrementally collecting replies in a forum and belongs to the technical field of collecting network information. The method comprises periodically determining whether there is a newly-established post and a post having new replies in all forum list pages needed to be collected: if yes, extracting a main post and reply information from the newly-established post, and extracting the information of the new replies from the post having new replies. The system comprises a determining device (11) for periodically determining whether there is a newly-established post and a post having new replies in all forum list pages needed to be collected; and an extracting device (12) for extracting a main post and reply information from the newly-established post, and extracting the information of the new replies from the post having new replies. The present application can quickly, accurately and completely collect all main post/replies of a post, so that the drawback that the information of turned pages of a post are missed to be searched or cannot be searched through a general search engine may be overcome.

    摘要翻译: 本申请公开了在论坛中递增收集回复的方法和系统,属于收集网络信息的技术领域。 所述方法包括:定期确定是否存在新建立的帖子和在所有论坛列表页面中需要收集的具有新的回复的帖子:如果是,从新建立的帖子中提取主要帖子和回复信息,并且提取信息 从有新的回复的帖子的新回复。 该系统包括用于周期性地确定是否存在新建立的帖子的确定装置(11)和在所有论坛列表页面中需要收集的具有新的回复的帖子; 以及用于从新建立的帖子中提取主帖子和回复信息的提取装置(12),并且从具有新的回复的帖子中提取新的回复的信息。 本申请可以快速,准确和完整地收集所有主要的帖子/回复,从而可以克服错过通过一般搜索引擎搜索或不能通过一般搜索引擎搜索的帖子的翻页信息的缺点。

    WEBPAGE INFORMATION DETECTION METHOD AND SYSTEM
    5.
    发明申请
    WEBPAGE INFORMATION DETECTION METHOD AND SYSTEM 有权
    网格信息检测方法与系统

    公开(公告)号:US20140067784A1

    公开(公告)日:2014-03-06

    申请号:US13997251

    申请日:2011-12-22

    IPC分类号: G06F17/30

    摘要: The present application provides web information detecting method and system. The method according to the present application comprises: pre-extracting keywords from a web page; storing a corresponding relationship between the extracted keywords and a URL of the web page in a database; obtaining a source file of a web page to be detected; searching the database for keywords that have the same URL as that of the web page to be detected; comparing the searched keywords to the source file information of the web page to be detected; and determining the existence of information of the web page to be detected according to a matching degree. The present application increases the accuracy of web information detection.

    摘要翻译: 本申请提供了网络信息检测方法和系统。 根据本申请的方法包括:从网页预提取关键字; 在提取的关键字和数据库中的网页的URL之间存储对应的关系; 获取要检测的网页的源文件; 在数据库中搜索与要检测的网页具有相同URL的关键字; 将搜索到的关键字与要检测的网页的源文件信息进行比较; 以及根据匹配度确定要检测的网页的信息的存在。 本申请提高了网络信息检测的准确性。

    METHOD AND DEVICE FOR FILTERING HARMFUL INFORMATION
    6.
    发明申请
    METHOD AND DEVICE FOR FILTERING HARMFUL INFORMATION 审中-公开
    用于过滤有害信息的方法和设备

    公开(公告)号:US20140013221A1

    公开(公告)日:2014-01-09

    申请号:US13997666

    申请日:2011-12-26

    IPC分类号: G06F17/24

    摘要: The application discloses a method and a device for filtering bad information on Internet relating to the computer information process technology and the information filtering technology. Embodiments of the application provide a method for filtering bad information on Internet, comprising: obtaining texts to be filtered, system advanced-research model and a user feedback model; pre-processing the obtained texts; obtaining a first matching result through performing feature information matching between the pre-processed information and the system advanced-research model information; obtaining a second matching result through performing feature information matching between the pre-processed information and the user feedback model information; and performing filtering process on the information of the obtained texts based on the first and second matching results. Through the technical solution disclosed in the application, the performance for automatically filtering bad information can be improved, and the system information can be updated automatically.

    摘要翻译: 本申请公开了一种用于过滤因特网上涉及计算机信息处理技术和信息过滤技术的不良信息的方法和装置。 该应用的实施例提供了一种用于过滤因特网上的不良信息的方法,包括:获得要过滤的文本,系统高级研究模型和用户反馈模型; 预处理所获得的文本; 通过执行预处理信息和系统高级研究模型信息之间的特征信息匹配来获得第一匹配结果; 通过执行预处理信息和用户反馈模型信息之间的特征信息匹配来获得第二匹配结果; 并且基于第一和第二匹配结果对所获取的文本的信息执行滤波处理。 通过应用中公开的技术方案,可以改善自动过滤不良信息的性能,并可以自动更新系统信息。