摘要:
Some embodiments of the invention employ algorithms enabling the calculation of one or more statistical moments in a single pass of a dataset. For example, some embodiments may apply algorithms for calculating statistical moments to a dataset using a map-reduce framework, whereby an input dataset is partitioned into multiple shards, a separate map process is used to apply an algorithm enabling calculation of one or more statistical moments in a single scan to each shard, and one or more reduce processes consolidate the results generated by the map processes to calculate the one or more statistical moments across the entire dataset. In other embodiments of the invention, a map-reduce framework may be employed to apply algorithms enabling calculation of a covariance between data elements expressed in a dataset, instead of or in addition to one or more statistical moments.
摘要:
A canopy clustering process merges at least one set of multiple single-center canopies together into a merged multi-center canopy. Multi-center canopies, as well as the single-center canopies, can then be used to partition data objects in a dataset. The multi-center canopies allow a canopy assignment condition constraint to be relaxed without risk of leaving any data objects in a dataset outside of all canopies. Approximate distance calculations can be used as similarity metrics to define and merge canopies and to assign data objects to canopies. In one implementation, a distance between a data object and a canopy is represented as the minimum of the distances between the data object and each center of a canopy (whether merged or unmerged), and the distance between two canopies is represented as the minimum of the distances for each pairing of the center(s) in one canopy and the center(s) in the other canopy.
摘要:
A light emitting device comprising a first semiconductor layer, a second semiconductor layer and a quantum well layer, wherein the first semiconductor layer and the second semiconductor layer are disposed on the opposite sides of the quantum well layer, the quantum well layer comprising a plurality of quantum well rods which are separated from each other, and each of the quantum well rods has only one quantum well.
摘要:
Architecture that automatically detects historical search contexts as well as behaviors related to a search query. Machine learning and hand-authored rules are employed to automatically identify search contexts. Historical information likely to be useful in the current context is surfaced. When a user enters a search query or executes another search behavior, past behaviors are exposed which are contextually related to the current behavior. The architecture also provides automatic discovery of historical contexts, features related to the contexts, and training or authoring of a system for classifying behavior into contexts, using some combination of the machine learning and/or hand-authored rules. A runtime system classifies the current user behavior into a context and surfaces contextual information to the user.
摘要:
An illuminating brick includes a block and at least one light-emitting element mounted in the brick. The brick has a top face, a bottom face and a plurality of lateral side surfaces interconnecting the top and bottom faces. The at least one light-emitting element is engaged in and optically coupled to at least one of the bottom face and lateral side surfaces. The lateral side surfaces and the bottom face are configured for reflecting and directing light emitted from the at least a light-emitting element to exit through the top face.
摘要:
The application illustrates a light-emitting device including a contact layer and a current spreading layer on the contact layer. A part of the contact layer is a rough structure and a part of the contact layer is a flat structure. A part of the current spreading layer is a rough structure and a part of the current spreading layer is a flat structure. The rough region of the contact layer and the rough region of the current spreading layer are substantially overlapped.
摘要:
A light emitting diode (LED) illumination device (10) includes an LED (11), a frustum-shaped light guide member (12) and a light diffusing plate (13). The frustum-shaped light guide member (12) has a light input surface (120) and a light output surface (122) opposite to the light input surface (120). The light guide member (12) tapers from the light output surface (122) to the light input surface (120). The light input surface (120) is optically coupled to the LED (11), and the light diffusing plate (13) is optically coupled to the light output surface (122) of the light guide member (12).
摘要:
A MapReduce architecture may be utilized for sequence alignment algorithm processing (such as BLAST or BLAST-like algorithms). In addition, a MapReduce architecture may be extended such that memory of the computing devices of a MapReduce-configured system may be shared between different jobs of sequence alignment and/or other bioinformatics algorithm processing, thereby reducing overhead associated with executing such jobs using the MapReduce-configured system.
摘要:
Techniques for identifying influential users of a social networking service are provided. Influential users may be identified via an algorithm in which an influence score is assigned to each user based at least in part on other members of the community users having taken an affirmative step with respect to the user's communications. Iterative processing may be performed, with each user's influence score being determined by contributions from other users, and each contribution being determined by the contributor's influence score as of a prior iteration. A map-reduce framework may be employed, with data representing the community being partitioned into a plurality of discrete shards, a map process corresponding to each shard calculating an influence score for users represented in the shard, and reduce processes ranking users according to influence score across all shards.
摘要:
A canopy clustering process merges at least one set of multiple single-center canopies together into a merged multi-center canopy. Multi-center canopies, as well as the single-center canopies, can then be used to partition data objects in a dataset. The multi-center canopies allow a canopy assignment condition constraint to be relaxed without risk of leaving any data objects in a dataset outside of all canopies. Approximate distance calculations can be used as similarity metrics to define and merge canopies and to assign data objects to canopies. In one implementation, a distance between a data object and a canopy is represented as the minimum of the distances between the data object and each center of a canopy (whether merged or unmerged), and the distance between two canopies is represented as the minimum of the distances for each pairing of the center(s) in one canopy and the center(s) in the other canopy.