Invention Grant
- Patent Title: Webpage classification method and apparatus, calculation device and machine readable storage medium
-
Application No.: US15505851Application Date: 2016-05-05
-
Publication No.: US10997256B2Publication Date: 2021-05-04
- Inventor: Jie Liang , Haihong Zheng , Hongcai Zou
- Applicant: GUANGZHOU UCWEB COMPUTER TECHNOLOGY CO., LTD.
- Applicant Address: CN Guangdong
- Assignee: GUANGZHOU UCWEB COMPUTER TECHNOLOGY CO., LTD.
- Current Assignee: GUANGZHOU UCWEB COMPUTER TECHNOLOGY CO., LTD.
- Current Assignee Address: CN Guangdong
- Agency: Sheppard Mullin Richter & Hampton LLP
- Priority: CN201510230951.2 20150508
- International Application: PCT/CN2016/081139 WO 20160505
- International Announcement: WO2016/180270 WO 20161117
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06F16/951 ; G06F16/954 ; G06F40/258 ; G06F17/16 ; G06F16/958 ; G06F16/953 ; G06F16/9532 ; G06F16/9538

Abstract:
A webpage classification method and apparatus, a computing device and a machine readable storage medium are disclosed. Each corpus word in a corpus is converted into a vector by using a word-to-vector tool word2vec, and therefore a processing process such as comparison between corpus words or similarity analysis is converted into vector calculation, so as to more conveniently implement computer automation, thereby improving webpage classification efficiency. Moreover, corresponding corpus words are screened according to preset classification seed words, and a corpus word unrelated to a webpage type may be removed, thereby improving webpage classification accuracy.
Public/Granted literature
- US20180218241A1 WEBPAGE CLASSIFICATION METHOD AND APPARATUS, CALCULATION DEVICE AND MACHINE READABLE STORAGE MEDIUM Public/Granted day:2018-08-02
Information query