摘要:
Executing a plurality of transform stages in an extract, transform and load (ETL) job including, for each of the transform stages, receiving a plurality of input row identifiers (RIDs) corresponding to a first plurality of source database table rows in a source database table. Data is retrieved directly from a subset of the source database table columns in the first plurality of source database table rows based on the input RIDs and transform logic. Partial row data including data from the subset of the source database table columns is generated for each of the first plurality of source database table rows. Transformed data is generated based on the partial row data and to the transform logic. Output RIDs corresponding to a second plurality of rows in the source database table that include a least a subset of the transformed data are output to a downstream stage.
摘要:
A method, computer program product, and system for enabling parallel processing of an XML document without pre-parsing, utilizing metadata associated with the XML document and created at the same time as the XML document. The metadata is used to generate partitions of the XML document at the time of parallel processing, without requiring system-intensive pre-parsing.
摘要:
A method, computer program product, and system for enabling parallel processing of an XML document without pre-parsing, utilizing metadata associated with the XML document and created at the same time as the XML document. The metadata is used to generate partitions of the XML document at the time of parallel processing, without requiring system-intensive pre-parsing.
摘要:
Methods and arrangements for extracting tuples from a streaming XML document. A query twig is applied to the XML document stream, tuples are extracted from the XML document stream based on the query twig, and a quantity of extracted tuples is limited via foregoing extraction of duplicate tuples extraction of tuples that do not satisfy query twig criteria.
摘要:
A method, computer program product, and system for enabling parallel processing of an XML document without pre-parsing, utilizing metadata associated with the XML document and created at the same time as the XML document. The metadata is used to generate partitions of the XML document at the time of parallel processing, without requiring system-intensive pre-parsing.
摘要:
Method, system, and computer program product for performing an operation, the operation including broadcasting one or more media streams to one or more remote devices, the remote devices configured to provide feedback related to the one or more media streams, the remote devices configured to display a media stream on a television coupled to the remote device. The operation then aggregates, based on a predefined aggregation criterion, feedback received from the remote devices related to a first media stream. The operation then, upon determining that the aggregated feedback of the first media stream exceeds a predefined threshold for the predefined aggregation criterion, marks the first media stream based on the predefined aggregation criterion, the remote devices configured to, responsive to receiving the marked first media stream, perform a predefined operation to effect prioritized display of the first media stream on the television.
摘要:
Method, system, and computer program product for performing an operation, the operation including broadcasting one or more media streams to one or more remote devices, the remote devices configured to provide feedback related to the one or more media streams, the remote devices configured to display a media stream on a television coupled to the remote device. The operation then aggregates, based on a predefined aggregation criterion, feedback received from the remote devices related to a first media stream. The operation then, upon determining that the aggregated feedback of the first media stream exceeds a predefined threshold for the predefined aggregation criterion, marks the first media stream based on the predefined aggregation criterion, the remote devices configured to, responsive to receiving the marked first media stream, perform a predefined operation to effect prioritized display of the first media stream on the television.
摘要:
Executing a plurality of transform stages in an extract, transform and load (ETL) job including, for each of the transform stages, receiving a plurality of input row identifiers (RIDs) corresponding to a first plurality of source database table rows in a source database table. Data is retrieved directly from a subset of the source database table columns in the first plurality of source database table rows based on the input RIDs and transform logic. Partial row data including data from the subset of the source database table columns is generated for each of the first plurality of source database table rows. Transformed data is generated based on the partial row data and to the transform logic. Output RIDs corresponding to a second plurality of rows in the source database table that include a least a subset of the transformed data are output to a downstream stage.
摘要:
A method, computer program product, and system for enabling parallel processing of an XML document without pre-parsing, utilizing metadata associated with the XML document and created at the same time as the XML document. The metadata is used to generate partitions of the XML document at the time of parallel processing, without requiring system-intensive pre-parsing.
摘要:
Methods and arrangements for extracting tuples from a streaming XML document. A query twig is applied to the XML document stream, tuples are extracted from the XML document stream based on the query twig, and a quantity of extracted tuples is limited via foregoing extraction of duplicate tuples extraction of tuples that do not satisfy query twig criteria.