- 专利标题: System and method for load shedding in data mining and knowledge discovery from stream data
-
申请号: US11058944申请日: 2005-02-16
-
公开(公告)号: US20060184527A1公开(公告)日: 2006-08-17
- 发明人: Yun Chi , Haixun Wang , Philip Yu
- 申请人: Yun Chi , Haixun Wang , Philip Yu
- 申请人地址: US NY Armonk
- 专利权人: IBM Corporation
- 当前专利权人: IBM Corporation
- 当前专利权人地址: US NY Armonk
- 主分类号: H04L27/28
- IPC分类号: H04L27/28
摘要:
Load shedding schemes for mining data streams. A scoring function is used to rank the importance of stream elements, and those elements with high importance are investigated. In the context of not knowing the exact feature values of a data stream, the use of a Markov model is proposed herein for predicting the feature distribution of a data stream. Based on the predicted feature distribution, one can make classification decisions to maximize the expected benefits. In addition, there is proposed herein the employment of a quality of decision (QoD) metric to measure the level of uncertainty in decisions and to guide load shedding. A load shedding scheme such as presented herein assigns available resources to multiple data streams to maximize the quality of classification decisions. Furthermore, such a load shedding scheme is able to learn and adapt to changing data characteristics in the data streams.
公开/授权文献
信息查询