摘要:
A computer program product for provisioning resources in a balanced data transfer pipeline may associate a first requested task with a first category, allocate a first resource of a data path to the first requested task based on the first resource corresponding to the first category; and allocate a second resource as part of the data path to the first requested task based on the second resource corresponding to the first category, wherein the first resource corresponds to a first segment of the data path and the second resource corresponds to a second segment of the data path.
摘要:
A computer program product for provisioning resources in a balanced data transfer pipeline may associate a first requested task with a first category, allocate a first resource of a data path to the first requested task based on the first resource corresponding to the first category; and allocate a second resource as part of the data path to the first requested task based on the second resource corresponding to the first category, wherein the first resource corresponds to a first segment of the data path and the second resource corresponds to a second segment of the data path.
摘要:
Methods, systems and computer program products for data fetching in a networked computing environment. In one embodiment, the method comprises receiving a request from an analytical node for a set of data for a defined job, and identifying in networked storage a subset of the data for the job. The subset of data is loaded to the analytical node based on the sequence in which the data are projected to be accessed in the job. In an embodiment, the request includes a specification for the job, and the specification is analyzed to identify the subset of data. In one embodiment, the subset of data is identified by identifying another job having a relationship to the defined job, and identifying the data used for that other job. In an embodiment, the networked computing environment is a cloud computing environment, and the defined job is an analytics job.
摘要:
Methods, systems and computer program products for data fetching in a networked computing environment. In one embodiment, the method comprises receiving a request from an analytical node for a set of data for a defined job, and identifying in networked storage a subset of the data for the job. The subset of data is loaded to the analytical node based on the sequence in which the data are projected to be accessed in the job. In an embodiment, the request includes a specification for the job, and the specification is analyzed to identify the subset of data. In one embodiment, the subset of data is identified by identifying another job having a relationship to the defined job, and identifying the data used for that other job. In an embodiment, the networked computing environment is a cloud computing environment, and the defined job is an analytics job.
摘要:
A computer program product for provisioning resources in a balanced data transfer pipeline may associate a first requested task with a first category, allocate a first resource of a data path to the first requested task based on the first resource corresponding to the first category; and allocate a second resource as part of the data path to the first requested task based on the second resource corresponding to the first category, wherein the first resource corresponds to a first segment of the data path and the second resource corresponds to a second segment of the data path.
摘要:
Methods, systems and computer program products for data fetching in a networked computing environment. In one embodiment, the method comprises receiving a request from an analytical node for a set of data for a defined job, and identifying in networked storage a subset of the data for the job. The subset of data is loaded to the analytical node based on the sequence in which the data are projected to be accessed in the job. In an embodiment, the request includes a specification for the job, and the specification is analyzed to identify the subset of data. In one embodiment, the subset of data is identified by identifying another job having a relationship to the defined job, and identifying the data used for that other job. In an embodiment, the networked computing environment is a cloud computing environment, and the defined job is an analytics job.
摘要:
Methods, systems and computer program products for data fetching in a networked computing environment. In one embodiment, the method comprises receiving a request from an analytical node for a set of data for a defined job, and identifying in networked storage a subset of the data for the job. The subset of data is loaded to the analytical node based on the sequence in which the data are projected to be accessed in the job. In an embodiment, the request includes a specification for the job, and the specification is analyzed to identify the subset of data. In one embodiment, the subset of data is identified by identifying another job having a relationship to the defined job, and identifying the data used for that other job. In an embodiment, the networked computing environment is a cloud computing environment, and the defined job is an analytics job.
摘要:
A computer program product for provisioning resources in a balanced data transfer pipeline may associate a first requested task with a first category, allocate a first resource of a data path to the first requested task based on the first resource corresponding to the first category; and allocate a second resource as part of the data path to the first requested task based on the second resource corresponding to the first category, wherein the first resource corresponds to a first segment of the data path and the second resource corresponds to a second segment of the data path.