- 专利标题: System And Method For Large-Scale Data Processing Using An Application-Independent Framework
-
申请号: US18137695申请日: 2023-04-21
-
公开(公告)号: US20230385262A1公开(公告)日: 2023-11-30
- 发明人: Jeffrey Dean , Sanjay Ghemawat
- 申请人: Google LLC
- 申请人地址: US CA Mountain View
- 专利权人: Google LLC
- 当前专利权人: Google LLC
- 当前专利权人地址: US CA Mountain View
- 主分类号: G06F16/22
- IPC分类号: G06F16/22 ; G06F16/23 ; G06F16/2453 ; G06F9/48 ; G06F9/54
摘要:
A method performs large-scale data processing in a distributed and parallel processing environment. The method defines application-independent map and reduce operations, each invoking one or more library functions that automatically handle data partitioning, parallelization of computations, and fault tolerance. A user specifies a map operation, which calls one or more of the application-independent map operators to perform data read and write operations. A user also specifies a reduce operation, which calls one or more of the application-independent reduce operators to perform data read and write operations. The method executes application-independent map worker processes. Each map worker process executes the user-specified map operation to read designated portions of input files and store intermediate data values in intermediate data structures. The method also executes application-independent reduce worker processes. Each reduce worker process executes the user-specified reduce operation to read intermediate data values from the intermediate data structures and produce final output data.
信息查询