Invention Grant
- Patent Title: Method for computing frequency distribution for many fields in one pass in parallel
- Patent Title (中): 并行计算一次通过多个场的频率分布的方法
-
Application No.: US11271047Application Date: 2005-11-10
-
Publication No.: US07565349B2Publication Date: 2009-07-21
- Inventor: Michael James Beckerle , Jerry Lee Callen
- Applicant: Michael James Beckerle , Jerry Lee Callen
- Applicant Address: US NY Armonk
- Assignee: International Business Machines Corporation
- Current Assignee: International Business Machines Corporation
- Current Assignee Address: US NY Armonk
- Agency: Konrad Raynes & Victor LLP
- Agent Janaki K. Davda
- Main IPC: G06F7/00
- IPC: G06F7/00 ; G06F17/30

Abstract:
Provided are a techniques for determining a frequency distribution for a set of records. A count table of frequency distributions is built in memory for each field in the set of records, wherein each record of each count table includes a field identifier, a field value, and a count of a number of times the field value occurs in the set of records, and wherein the field identifier concatenated with the field value comprises a composite key value. It is determined that at least one count table of frequency distributions is approaching a maximum amount of memory allocated to that count table. The records of the at least one count table that is approaching the maximum amount of memory are sent for sorting and additional counting, wherein the records include composite key values.
Public/Granted literature
- US20070106666A1 Computing frequency distribution for many fields in one pass in parallel Public/Granted day:2007-05-10
Information query