-
公开(公告)号:US20220357985A1
公开(公告)日:2022-11-10
申请号:US17738909
申请日:2022-05-06
Applicant: Google LLC
Inventor: Jeffrey Adgate Dean , Sudip Roy , Michael Acheson Isard , Aakanksha Chowdhery , Brennan Saeta , Chandramohan Amyangot Thekkath , Daniel William Hurt , Hyeontaek Lim , Laurent El Shafey , Parker Edward Schuh , Paul Ronald Barham , Ruoming Pang , Ryan Sepassi , Sanjay Ghemawat , Yonghui Wu
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for distributing machine learning workloads, e.g., computations for training a neural network or computing an inference using a neural network, across multiple hardware accelerators. One of the systems comprises a plurality of accelerator islands, each hardware accelerator island comprising a respective plurality of hardware devices that include a plurality of hardware accelerators and a corresponding host for each of the plurality of hardware accelerators; and a respective scheduler for each of the accelerator islands that is configured to schedule workloads across the plurality of accelerators and corresponding hosts in the accelerator island, wherein the system is configured to: receive data representing a machine learning workload; and assign a respective portion of the machine learning workload to each of the plurality of accelerator islands for scheduling by the respective scheduler for the accelerator island.
-
公开(公告)号:US11087216B2
公开(公告)日:2021-08-10
申请号:US17015196
申请日:2020-09-09
Applicant: Google LLC
Inventor: Vijay Vasudevan , Jeffrey Adgate Dean , Sanjay Ghemawat
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for modifying a computational graph to include send and receive nodes. Communication between unique devices performing operations of different subgraphs of the computational graph can be handled efficiently by inserting send and receive nodes into each subgraph. When executed, the operations that these send and receive nodes represent may enable pairs of unique devices to conduct communication with each other in a self-sufficient manner. This shifts the burden of coordinating communication away from the backend, which affords the system that processes this computational graph representation the opportunity to perform one or more other processes while devices are executing subgraphs.
-
13.
公开(公告)号:US20210117401A1
公开(公告)日:2021-04-22
申请号:US17134862
申请日:2020-12-28
Applicant: Google LLC
Inventor: Jeffrey Dean , Sanjay Ghemawat
IPC: G06F16/22 , G06F9/48 , G06F16/2453 , G06F16/23 , G06F9/54
Abstract: A method performs large-scale data processing in a distributed and parallel processing environment. The method defines application-independent map and reduce operations, each invoking one or more library functions that automatically handle data partitioning, parallelization of computations, and fault tolerance. A user specifies a map operation, which calls one or more of the application-independent map operators to perform data read and write operations. A user also specifies a reduce operation, which calls one or more of the application-independent reduce operators to perform data read and write operations. The method executes application-independent map worker processes. Each map worker process executes the user-specified map operation to read designated portions of input files and store intermediate data values in intermediate data structures. The method also executes application-independent reduce worker processes. Each reduce worker process executes the user-specified reduce operation to read intermediate data values from the intermediate data structures and produce final output data.
-
公开(公告)号:US10783435B2
公开(公告)日:2020-09-22
申请号:US15338225
申请日:2016-10-28
Applicant: Google LLC
Inventor: Vijay Vasudevan , Jeffrey Adgate Dean , Sanjay Ghemawat
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for modifying a computational graph to include send and receive nodes. Communication between unique devices performing operations of different subgraphs of the computational graph can be handled efficiently by inserting send and receive nodes into each subgraph. When executed, the operations that these send and receive nodes represent may enable pairs of unique devices to conduct communication with each other in a self-sufficient manner. This shifts the burden of coordinating communication away from the backend, which affords the system that processes this computational graph representation the opportunity to perform one or more other processes while devices are executing subgraphs.
-
公开(公告)号:US10534997B2
公开(公告)日:2020-01-14
申请号:US15965742
申请日:2018-04-27
Applicant: Google LLC
Inventor: Paul A. Tucker , Jeffrey Adgate Dean , Sanjay Ghemawat , Yuan Yu
IPC: G06E1/00 , G06E3/00 , G06F15/18 , G06G7/00 , G06N3/08 , G06F9/50 , G06N3/063 , G06N3/04 , G06N5/04
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for receiving a request from a client to process a computational graph; obtaining data representing the computational graph, the computational graph comprising a plurality of nodes and directed edges, wherein each node represents a respective operation, wherein each directed edge connects a respective first node to a respective second node that represents an operation that receives, as input, an output of an operation represented by the respective first node; identifying a plurality of available devices for performing the requested operation; partitioning the computational graph into a plurality of subgraphs, each subgraph comprising one or more nodes in the computational graph; and assigning, for each subgraph, the operations represented by the one or more nodes in the subgraph to a respective available device in the plurality of available devices for operation.
-
公开(公告)号:US10354186B2
公开(公告)日:2019-07-16
申请号:US15965745
申请日:2018-04-27
Applicant: Google LLC
Inventor: Vijay Vasudevan , Jeffrey Adgate Dean , Sanjay Ghemawat
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for modifying a computational graph to include send and receive nodes. Communication between unique devices performing operations of different subgraphs of the computational graph can be handled efficiently by inserting send and receive nodes into each subgraph. When executed, the operations that these send and receive nodes represent may enable pairs of unique devices to conduct communication with each other in a self-sufficient manner. This shifts the burden of coordinating communication away from the backend, which affords the system that processes this computational graph representation the opportunity to perform one or more other processes while devices are executing subgraphs.
-
17.
公开(公告)号:US10296500B2
公开(公告)日:2019-05-21
申请号:US15479228
申请日:2017-04-04
Applicant: Google LLC
Inventor: Jeffrey Dean , Sanjay Ghemawat
Abstract: A method performs large-scale data processing in a distributed and parallel processing environment. The method defines application-independent map and reduce operations, each invoking one or more library functions that automatically handle data partitioning, parallelization of computations, and fault tolerance. A user specifies a map operation, which calls one or more of the application-independent map operators to perform data read and write operations. A user also specifies a reduce operation, which calls one or more of the application-independent reduce operators to perform data read and write operations. The method executes application-independent map worker processes. Each map worker process executes the user-specified map operation to read designated portions of input files and store intermediate data values in intermediate data structures. The method also executes application-independent reduce worker processes. Each reduce worker process executes the user-specified reduce operation to read intermediate data values from the intermediate data structures and produce final output data.
-
公开(公告)号:US10204110B2
公开(公告)日:2019-02-12
申请号:US15269788
申请日:2016-09-19
Applicant: GOOGLE LLC
Inventor: Yasushi Saito , Sanjay Ghemawat , Jeffrey Adgate Dean
IPC: G06F17/30
Abstract: A method for deleting obsolete files from a file system is provided. The method includes receiving a request to delete a reference to a first target file of a plurality of target files stored in a file system, the first target file having a first target file name. A first reference file whose file name includes the first target file name is identified. The first reference file is deleted from the file system. The method further includes determining whether the file system includes at least one reference file, distinct from the first reference file, whose file name includes the first target file name. In accordance with a determination that the file system does not include the at least one reference file, the first target file is deleted from the file system.
-
公开(公告)号:US20180247198A1
公开(公告)日:2018-08-30
申请号:US15965745
申请日:2018-04-27
Applicant: Google LLC
Inventor: Vijay Vasudevan , Jeffrey Adgate Dean , Sanjay Ghemawat
CPC classification number: G06N3/082 , G06F9/5038 , G06F9/5066 , G06F9/547 , G06F17/16 , G06N3/04 , G06N3/0454 , G06N3/063 , G06N3/084
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for modifying a computational graph to include send and receive nodes. Communication between unique devices performing operations of different subgraphs of the computational graph can be handled efficiently by inserting send and receive nodes into each subgraph. When executed, the operations that these send and receive nodes represent may enable pairs of unique devices to conduct communication with each other in a self-sufficient manner. This shifts the burden of coordinating communication away from the backend, which affords the system that processes this computational graph representation the opportunity to perform one or more other processes while devices are executing subgraphs.
-
公开(公告)号:US11822521B2
公开(公告)日:2023-11-21
申请号:US17671068
申请日:2022-02-14
Applicant: Google LLC
Inventor: Jeffrey Adgate Dean , Sanjay Ghemawat , Andrew Fikes , Yasushi Saito
IPC: G06F16/182 , G06F16/22 , G06F9/50 , G06F16/13 , H04L67/1001 , H04L67/1004 , H04L67/1029
CPC classification number: G06F16/182 , G06F9/5083 , G06F16/13 , G06F16/184 , G06F16/22 , H04L67/1001 , H04L67/1004 , H04L67/1029
Abstract: A method of accessing data includes storing a table that includes a plurality of tablets corresponding to distinct non-overlapping table portions. Respective pluralities of tablet access objects and application objects are stored in a plurality of servers. A distinct application object and distinct tablet are associated with each tablet access object. Each application object corresponds to a distinct instantiation of an application associated with the table. The tablet access objects and associated application objects are redistributed among the servers in accordance with a first load-balancing criterion. A first request directed to a respective tablet is received from a client. In response, the tablet access object associated with the respective tablet is used to perform a data access operation on the respective tablet, and the application object associated with the respective tablet is used to perform an additional computational operation to produce a result to be returned to the client.
-
-
-
-
-
-
-
-
-