发明授权
- 专利标题: Finding partition boundaries for parallel processing of markup language documents
- 专利标题(中): 查找用于并行处理标记语言文档的分区边界
-
申请号: US12893248申请日: 2010-09-29
-
公开(公告)号: US09477651B2公开(公告)日: 2016-10-25
- 发明人: Manoj K. Agarwal , Amir Bar-Or , Manish Anand Bhide , Sebastian Ertel , Sriram K. Padmanabhan
- 申请人: Manoj K. Agarwal , Amir Bar-Or , Manish Anand Bhide , Sebastian Ertel , Sriram K. Padmanabhan
- 申请人地址: US NY Armonk
- 专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人: INTERNATIONAL BUSINESS MACHINES CORPORATION
- 当前专利权人地址: US NY Armonk
- 代理机构: Susan Murray;SVL IPLaw
- 主分类号: G06F17/27
- IPC分类号: G06F17/27 ; G06F9/45 ; G06F17/22
摘要:
A method, a computer program product and a system identify partition locations within an extended markup language (XML) document without parsing so as to process portions of said document in parallel. The XML document includes sections required to remain continuous. The document is scanned for continuous sections without parsing, and boundaries of the initial partitions are adjusted to reside outside the continuous sections to determine resulting partitions for the document. The resulting partitions may be processed in parallel to provide the document information for storage.
公开/授权文献
信息查询