-
公开(公告)号:US10387545B2
公开(公告)日:2019-08-20
申请号:US14549394
申请日:2014-11-20
Applicant: Alibaba Group Holding Limited
Inventor: Sha Chen , Menghui Chen , Honghua He , Zhang Liu , Yining Chen
IPC: G06F17/22 , G06F16/951
Abstract: Example methods and devices for processing a page are described. One or more pages of a designated website are acquired. The one or more pages are clustered to obtain one or more classes in accordance with page features of the pages. At least one class is selected as a list page set according to a page linking relationship between the one or more classes. It is not necessary to require an operator to manually involve in the process of establishing the list page set. The present techniques have simple operations and high accuracy rate, thereby improving an efficiency and reliability of establishing a list page library.
-
公开(公告)号:US20150143214A1
公开(公告)日:2015-05-21
申请号:US14549394
申请日:2014-11-20
Applicant: Alibaba Group Holding Limited
Inventor: Sha Chen , Menghui Chen , Honghua He , Zhang Liu , Yining Chen
IPC: G06F17/22
CPC classification number: G06F17/2235 , G06F16/951
Abstract: Example methods and devices for processing a page are described. One or more pages of a designated website are acquired. The one or more pages are clustered to obtain one or more classes in accordance with page features of the pages. At least one class is selected as a list page set according to a page linking relationship between the one or more classes. It is not necessary to require an operator to manually involve in the process of establishing the list page set. The present techniques have simple operations and high accuracy rate, thereby improving an efficiency and reliability of establishing a list page library.
Abstract translation: 描述用于处理页面的示例方法和设备。 获取一个或多个指定网站的页面。 一个或多个页面被聚类以根据页面的页面特征来获得一个或多个类别。 根据一个或多个类之间的页面链接关系,至少选择一个类作为列表页面集合。 没有必要要求操作者手动参与建立列表页面集合的过程。 本技术操作简单,准确率高,提高了建立列表页库的效率和可靠性。
-