发明申请
US20080275890A1 System and method for smoothing hierarchical data using isotonic regression 有权
使用等渗回归平滑分层数据的系统和方法

System and method for smoothing hierarchical data using isotonic regression
摘要:
An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
信息查询
0/0