发明授权
- 专利标题: Single pass workload directed clustering of XML documents
- 专利标题(中): 单通道工作负载定向聚类XML文档
-
申请号: US10703250申请日: 2003-11-07
-
公开(公告)号: US07512615B2公开(公告)日: 2009-03-31
- 发明人: Rajesh Bordawekar , Sriram K. Padmanabhan , Oded Shmueli
- 申请人: Rajesh Bordawekar , Sriram K. Padmanabhan , Oded Shmueli
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理机构: Ference & Associates LLC
- 主分类号: G06F7/00
- IPC分类号: G06F7/00 ; G06F17/00
摘要:
A method and system for clustering of XML documents is disclosed. The method operates under specified memory-use constraints. The system implements the method and scans an XML document, assigns edge-weights according to the application workload, and maps clusters of XML nodes to disk pages, all in a single parser-controlled pass over the XML data. Application workload information is used to generate XML clustering solutions that lead to substantial reduction in page faults for the workload under consideration. Several approaches for representing workload information are disclosed. For example, the workload may list the XPath operators invoked during the application along with their invocation frequencies. The application workload can be further refined by incorporating additional features such as query importance or query compilation costs. XML access patterns could be also modeled using stochastic approaches.
公开/授权文献
信息查询