摘要:
A Multi-Petascale Highly Efficient Parallel Supercomputer of 100 petaOPS-scale computing, at decreased cost, power and footprint, and that allows for a maximum packaging density of processing nodes from an interconnect point of view. The Supercomputer exploits technological advances in VLSI that enables a computing model where many processors can be integrated into a single Application Specific Integrated Circuit (ASIC). Each ASIC computing node comprises a system-on-chip ASIC utilizing four or more processors integrated into one die, with each having full access to all system resources and enabling adaptive partitioning of the processors to functions such as compute or messaging I/O on an application by application basis, and preferably, enable adaptive partitioning of functions in accordance with various algorithmic phases within an application, or if I/O or other processors are underutilized, then can participate in computation or communication nodes are interconnected by a five dimensional torus network with DMA that optimally maximize the throughput of packet communications between nodes and minimize latency.
摘要翻译:具有100 petaOPS规模计算的多Petascale高效并行超级计算机,其成本,功耗和占地面积都在降低,并且允许从互连角度来看处理节点的最大封装密度。 超级计算机利用了VLSI的技术进步,实现了许多处理器可以集成到单个专用集成电路(ASIC)中的计算模型。 每个ASIC计算节点包括利用集成到一个管芯中的四个或更多个处理器的片上系统ASIC,每个处理器具有对所有系统资源的完全访问,并且使得处理器能够对诸如计算或消息传递I / O 并且优选地,根据应用内的各种算法阶段实现功能的自适应分割,或者如果I / O或其他处理器未被充分利用,则可以参与计算或通信节点通过五维环面网络互连 使用DMA来最大限度地最大化节点之间的分组通信的吞吐量并最小化等待时间。
摘要:
Methods, systems and computer program products are disclosed for measuring a performance of a program running on a processing unit of a processing system. In one embodiment, the method comprises informing a logic unit of each instruction in the program that is executed by the processing unit, assigning a weight to each instruction, assigning the instructions to a plurality of groups, and analyzing the plurality of groups to measure one or more metrics. In one embodiment, each instruction includes an operating code portion, and the assigning includes assigning the instructions to the groups based on the operating code portions of the instructions. In an embodiment, each type of instruction is assigned to a respective one of the plurality of groups. These groups may be combined into a plurality of sets of the groups.
摘要:
A Multi-Petascale Highly Efficient Parallel Supercomputer of 100 petaOPS-scale computing, at decreased cost, power and footprint, and that allows for a maximum packaging density of processing nodes from an interconnect point of view. The Supercomputer exploits technological advances in VLSI that enables a computing model where many processors can be integrated into a single Application Specific Integrated Circuit (ASIC). Each ASIC computing node comprises a system-on-chip ASIC utilizing four or more processors integrated into one die, with each having full access to all system resources and enabling adaptive partitioning of the processors to functions such as compute or messaging I/O on an application by application basis, and preferably, enable adaptive partitioning of functions in accordance with various algorithmic phases within an application, or if I/O or other processors are underutilized, then can participate in computation or communication nodes are interconnected by a five dimensional torus network with DMA that optimally maximize the throughput of packet communications between nodes and minimize latency.
摘要翻译:具有100 petaOPS规模计算的多Petascale高效并行超级计算机,其成本,功耗和占地面积都在降低,并且允许从互连角度来看处理节点的最大封装密度。 超级计算机利用了VLSI的技术进步,实现了许多处理器可以集成到单个专用集成电路(ASIC)中的计算模型。 每个ASIC计算节点包括利用集成到一个管芯中的四个或更多个处理器的片上系统ASIC,每个处理器具有对所有系统资源的完全访问,并且使得处理器能够对诸如计算或消息传递I / O 并且优选地,根据应用内的各种算法阶段实现功能的自适应分割,或者如果I / O或其他处理器未被充分利用,则可以参与计算或通信节点通过五维环面网络互连 使用DMA来最大限度地最大化节点之间的分组通信的吞吐量并最小化等待时间。
摘要:
Methods, systems and computer program products are disclosed for measuring a performance of a program running on a processing unit of a processing system. In one embodiment, the method comprises informing a logic unit of each instruction in the program that is executed by the processing unit, assigning a weight to each instruction, assigning the instructions to a plurality of groups, and analyzing the plurality of groups to measure one or more metrics. In one embodiment, each instruction includes an operating code portion, and the assigning includes assigning the instructions to the groups based on the operating code portions of the instructions. In an embodiment, each type of instruction is assigned to a respective one of the plurality of groups. These groups may be combined into a plurality of sets of the groups.
摘要:
A general computer-implement method and apparatus to optimize problem layout on a massively parallel supercomputer is described. The method takes as input the communication matrix of an arbitrary problem in the form of an array whose entries C(i, j) are the amount to data communicated from domain i to domain j. Given C(i, j), first implement a heuristic map is implemented which attempts sequentially to map a domain and its communications neighbors either to the same supercomputer node or to near-neighbor nodes on the supercomputer torus while keeping the number of domains mapped to a supercomputer node constant (as much as possible). Next a Markov Chain of maps is generated from the initial map using Monte Carlo simulation with Free Energy (cost function) F=Σi,jC(i,j)H(i,j)− where H(i,j) is the smallest number of hops on the supercomputer torus between domain i and domain j. On the cases tested, found was that the method produces good mappings and has the potential to be used as a general layout optimization tool for parallel codes. At the moment, the serial code implemented to test the method is un-optimized so that computation time to find the optimum map can be several hours on a typical PC. For production implementation, good parallel code for our algorithm would be required which could itself be implemented on supercomputer.
摘要:
A process which comprises extracting 1,3-diacyloxy-1,1,3,3-tetra(hydrocarbyl) distannoxane from a mixture containing 1,3-diacyloxy-1,1,3,3-tetra(hydrocarbyl)distannoxane, a sucrose-6-ester, and polar aprotic solvent, which process comprises the steps of:(a) contacting said mixture, in the presence of a small amount of water, with an organic solvent that is substantially immiscible with water to form thereby an extraction mixture, wherein the amount of water employed is sufficient to cause efficient partitioning of said 1,3-diacyloxy-1,1,3,3-tetra(hydrocarbyl)distannoxane from a first phase comprising said polar aprotic solvent into second phase comprising said organic solvent;(b) agitating the extraction mixture for a period of time and at a temperature sufficient to form thereby a two-phase mixture wherein the preponderance of the 1,3-diacyloxy-1,1,3,3-tetra(hydrocabyl)distannoxane in the extraction mixture is contained in said second phase and essentially all of the sucrose-6-ester in the extraction mixture is contained in said first phase; and(c) separating said first phase from said second phase.
摘要:
A process for the chlorination of sucrose-6-esters to produce 6',4,1'-trichlorosucrose-6-esters which comprises the steps of:(a) adding at least seven molar equivalents of an acid chloride to a reaction mixture containing a sucrose-6-ester and a tertiary amide to form initially a chloroformiminium chloride salt which subsequently forms a complex with the hydroxyl groups of the sucrose-6-ester;(b) subjecting the reaction mixture product of step (a) to an elevated temperature not higher than about 85.degree. C. for a period of time sufficient to produce a mixture of chlorinated sucrose-6-ester products consisting essentially of 6'-chlorosucrose-6-ester, 4,6'-dichlorosucrose-6-ester, and 1',6'-dichlorosucrose-6-ester; and(c) subjecting the reaction mixture product of step (b) to an elevated temperature not higher than about 125.degree. C. for a period of time sufficient to produce a chlorinated product consisting essentially of 1',4,6'-trichlorosucrose-6-ester.
摘要:
There is disclosed a process for recovering sucralose-6-ester from a feed mixture of 6-O-acyl-4,1',6'-trichloro-4,1',6'-trideoxygalactosucrose in a reaction medium comprising a tertiary amide (such as N,N-dimethylformamide), wherein said process comprises removing a major proportion of said tertiary amide by steam distillation. In preferred aspects of the invention, the steam distillation is followed by extraction and then purification by crystallization or crystal aging to recover sucralose-6-ester in good yield.
摘要:
A process for preparing functional sugar polymers comprising transferring a monosaccharide or oligosaccharide to an acceptor, removing by-products, separating polymers which have not achieved the desired chain length and recycling these underdeveloped polymers, and an apparatus for producing same.
摘要:
There is disclosed a process for producing sucralose from sucralose-6-ester whereby the sucralose-6-ester is deacylated directly either prior to or after removal of the tertiary amide reaction vehicle from the neutralized chlorination reaction mixture, to produce an aqueous solution of sucralose plus salts and impurities, from which sucralose is recovered by extraction and is then preferably purified by crystallization.