Apparatus and methods for classification of web sites
    1.
    发明授权
    Apparatus and methods for classification of web sites 有权
    网站分类的设备和方法

    公开(公告)号:US07792951B2

    公开(公告)日:2010-09-07

    申请号:US10315705

    申请日:2002-12-10

    IPC分类号: G06F15/173

    CPC分类号: G06F17/3071

    摘要: Apparatus and methods for classifying web sites are provided. With the apparatus and methods, traffic data is obtained for a plurality of web sites. This patterns, or templates, for each web site are generated based on this traffic data and the patterns are clustered into classes of web sites using a clustering algorithm. The clusters, or classes, are then profiled to generate a template for each class. The template for each class is generated by first shifting the patterns for each web site that is part of the class to compensate for effects like time zone differences, if any, and then identifying a pattern that is most similar to all of the patterns in the class. Once the template for each class is generated, this template is then used with traffic data from a new web site to classify the new web site into one of the existing classes. In other words, when traffic data for a new web site is received, a pattern for the traffic data of the new web site is generated and compared to the templates for the various classes. If a matching class template is identified, the new web site is classified into the corresponding class. If the pattern for the new web site does not match any of the existing templates, a new template and class may be generated based on the pattern for the new web site.

    摘要翻译: 提供了分类网站的装置和方法。 利用该装置和方法,获得多个网站的交通数据。 基于该流量数据生成每个网站的这种模式或模板,并且使用聚类算法将模式聚类成网站类。 然后,对集群或类进行概要分析以为每个类生成一个模板。 每个类的模板是通过首先移动作为类的一部分的每个网站的模式来生成的,以补偿诸如时​​区差异的效果(如果有的话),然后识别最相似于所有模式中的模式 类。 一旦生成了每个类的模板,该模板随后与来自新网站的流量数据一起使用,将新网站分类到现有的一个类中。 换句话说,当接收到新的网站的交通数据时,生成用于新网站的交通数据的模式,并与各种类别的模板进行比较。 如果识别出匹配的类模板,则将新的网站分类到相应的类中。 如果新网站的模式与任何现有模板不匹配,则可能会根据新网站的模式生成新的模板和类。

    System for wireless push and pull based services
    2.
    发明授权
    System for wireless push and pull based services 失效
    无线推挽式服务系统

    公开(公告)号:US07058691B1

    公开(公告)日:2006-06-06

    申请号:US09591746

    申请日:2000-06-12

    IPC分类号: G06F9/00

    摘要: The present invention relates to a method and system for providing Web content from pull and push based services running on Web content providers to mobile users. A proxy gateway connects the mobile users to the Web content providers. A prefetching module is used at the proxy gateway to optimize performance of the pull services by reducing average access latency. The average access latency can be reduced by using at least three factors: one related to the frequency of access to the pull content; second, the update cycle of the pull content determined by the Web content providers; and third, the response delay for fetching pull content from the content provider to the proxy gateway. Pull content, such as documents, having the greatest average access latency are sorted and a predetermined number of the documents are prefetched into the cache. Push services are optimized by iteratively estimating a state of each of the mobile users to determine relevant push content to be forward to the mobile user.

    摘要翻译: 本发明涉及一种从在Web内容提供商上运行的基于拉和推的服务向移动用户提供Web内容的方法和系统。 代理网关将移动用户连接到Web内容提供商。 代理网关使用预取模块,通过减少平均访问延迟来优化拉服务的性能。 通过使用至少三个因素可以减少平均访问延迟:一个与访问拉取内容的频率相关; 第二,Web内容提供商确定的拉取内容的更新周期; 以及第三,从内容提供商提取到代理网关的提取响应的响应延迟。 将具有最大平均访问延迟的诸如文档的拉取内容排序并将预定数量的文档预取到高速缓存中。 推送服务通过迭代地估计每个移动用户的状态来优化,以确定要向移动用户转发的相关推送内容。

    System for wireless push and pull based services
    4.
    发明授权
    System for wireless push and pull based services 有权
    无线推挽式服务系统

    公开(公告)号:US07284035B2

    公开(公告)日:2007-10-16

    申请号:US11118527

    申请日:2005-04-29

    IPC分类号: G06F9/00

    摘要: The present invention relates to a method and system for providing Web content from pull and push based services running on Web content providers to mobile users. A proxy gateway connects the mobile users to the Web content providers. A prefetching module is used at the proxy gateway to optimize performance of the pull services by reducing average access latency. The average access latency can be reduced by using at least three factors: one related to the frequency of access to the pull content; second, the update cycle of the pull content determined by the Web content providers; and third, the response delay for fetching pull content from the content provider to the proxy gateway. Pull content, such as documents, having the greatest average access latency are sorted and a predetermined number of the documents are prefetched into the cache. Push services are optimized by iteratively estimating a state of each of the mobile users to determine relevant push content to be forward to the mobile user.

    摘要翻译: 本发明涉及一种从在Web内容提供商上运行的基于拉和推的服务向移动用户提供Web内容的方法和系统。 代理网关将移动用户连接到Web内容提供商。 代理网关使用预取模块,通过减少平均访问延迟来优化拉服务的性能。 通过使用至少三个因素可以减少平均访问延迟:一个与访问拉取内容的频率相关; 第二,Web内容提供商确定的拉取内容的更新周期; 以及第三,从内容提供商提取到代理网关的提取响应的响应延迟。 将具有最大平均访问延迟的诸如文档的拉取内容排序并将预定数量的文档预取到高速缓存中。 推送服务通过迭代地估计每个移动用户的状态来优化,以确定要向移动用户转发的相关推送内容。

    Apparatus and methods for co-location and offloading of web site traffic based on traffic pattern recognition
    5.
    发明授权
    Apparatus and methods for co-location and offloading of web site traffic based on traffic pattern recognition 有权
    基于流量模式识别的网站流量共同定位和卸载的装置和方法

    公开(公告)号:US07386611B2

    公开(公告)日:2008-06-10

    申请号:US10315335

    申请日:2002-12-10

    IPC分类号: G06F15/173

    摘要: Apparatus and methods for identifying traffic patterns to web sites based on templates that characterize the arrival of traffic to the web sites are provided. Based on these templates, determinations are made as to which web sites should be co-located so as to optimize resource allocation. Specifically, web sites whose templates are complimentary, i.e. a first web site having a peak in arrival traffic at time t1 and a second web site that has a trough in arrival traffic at time t1, are designated as being candidates for co-location. In addition, the present invention uses the templates identified for the traffic patterns of web sites to determine thresholds for offloading traffic to other servers. These thresholds include a first threshold at which offloading should be performed, a second threshold that takes into consideration the lead time needed to begin offloading, and a third threshold that takes into consideration a lag time needed to stop all offloading of traffic to the other servers.

    摘要翻译: 提供了基于表征网站到达流量的模板来识别网站的流量模式的装置和方法。 基于这些模板,确定哪些网站应该位于同一位置,以优化资源分配。 具体来说,其模板是免费的网站,即在时间t 1处具有到达业务峰值的第一网站和在时间t 1处具有到达业务波谷的第二网站被指定为用于共同定位的候选 。 此外,本发明使用为网站的流量模式识别的模板来确定将流量卸载到其他服务器的阈值。 这些阈值包括应该执行卸载的第一阈值,考虑到开始卸载所需的前置时间的第二阈值,以及考虑到停止对其他服务器的所有卸载流量所需的滞后时间的第三阈值 。

    APPARATUS AND METHODS FOR CO-LOCATION AND OFFLOADING OF WEB SITE TRAFFIC BASED ON TRAFFIC PATTERN RECOGNITION
    6.
    发明申请
    APPARATUS AND METHODS FOR CO-LOCATION AND OFFLOADING OF WEB SITE TRAFFIC BASED ON TRAFFIC PATTERN RECOGNITION 失效
    基于交通图案识别的网站交通协同放置的装置和方法

    公开(公告)号:US20080091826A1

    公开(公告)日:2008-04-17

    申请号:US11952641

    申请日:2007-12-07

    IPC分类号: G06F15/173

    摘要: Apparatus and methods for identifying traffic patterns to web sites based on templates that characterize the arrival of traffic to the web sites are provided. Based on these templates, determinations are made as to which web sites should be co-located so as to optimize resource allocation. Specifically, web sites whose templates are complimentary, i.e. a first web site having a peak in arrival traffic at time t1 and a second web site that has a trough in arrival traffic at time t1, are designated as being candidates for co-location. In addition, the present invention uses the templates identified for the traffic patterns of web sites to determine thresholds for offloading traffic to other servers. These thresholds include a first threshold at which offloading should be performed, a second threshold that takes into consideration the lead time needed to begin offloading, and a third threshold that takes into consideration a lag time needed to stop all offloading of traffic to the other servers.

    摘要翻译: 提供了基于表征网站到达流量的模板来识别网站的流量模式的装置和方法。 基于这些模板,确定哪些网站应该位于同一位置,以优化资源分配。 具体来说,其模板是免费的,即在时间t 1处具有到达业务峰值的第一网站和在时间t 1处具有到达业务的波谷的第二网站被指定为用于共同定位的候选 。 此外,本发明使用为网站的流量模式识别的模板来确定将流量卸载到其他服务器的阈值。 这些阈值包括应当执行卸载的第一阈值,考虑到开始卸载所需的前置时间的第二阈值,以及考虑到停止对其他服务器的所有卸载流量所需的滞后时间的第三阈值 。