DISTRIBUTED MEANS OF ORGANIZING AN ARBITRARILY LARGE NUMBER OF COMPUTERS

    公开(公告)号:US20120005375A1

    公开(公告)日:2012-01-05

    申请号:US12703220

    申请日:2010-03-16

    申请人: Russell Fish

    发明人: Russell Fish

    IPC分类号: G06F15/16

    CPC分类号: H04L41/0893 H04L67/10

    摘要: A technique for organizing a plurality of computers such that message broadcast, content searching, and computer identification of the entire collection or a subset of the entire collection may be performed quickly without the use of a controlling computer. The technique describes the creation, operation, and maintenance of a connection scheme by which each computer in the collection appears to be the top level of a hierarchical array. The maintenance of this hierarchical connection scheme allows one to many communications throughout the collection of computers to scale geometrically rather than linearly.

    Thread optimized multiprocessor architecture
    2.
    发明授权
    Thread optimized multiprocessor architecture 有权
    线程优化的多处理器架构

    公开(公告)号:US08984256B2

    公开(公告)日:2015-03-17

    申请号:US12147332

    申请日:2008-06-26

    申请人: Russell Fish

    发明人: Russell Fish

    摘要: In one aspect, the invention comprises a system comprising: (a) a plurality of parallel processors on a single chip; and (b) computer memory located on the chip and accessible by each of the processors; wherein each of the processors is operable to process a de minimis instruction set, and wherein each of the processors comprises local caches dedicated to each of at least three specific registers in the processor. In another aspect, the invention comprises a system comprising: (a) a plurality of parallel processors on a single chip; and (b) computer memory located on the chip and accessible by each of the processors, wherein each of the processors is operable to process an instruction set optimized for thread-level parallel processing and wherein each processor accesses the internal data bus of the computer memory on the chip and the internal data bus is the width of one row of the memory.

    摘要翻译: 一方面,本发明包括一种系统,包括:(a)单个芯片上的多个并行处理器; 和(b)位于芯片上并由每个处理器访问的计算机存储器; 其中每个处理器可操作以处理最小指令集,并且其中每个处理器包括专用于处理器中的至少三个特定寄存器中的每一个的本地高速缓存。 在另一方面,本发明包括一种系统,包括:(a)在单个芯片上的多个并行处理器; 和(b)位于芯片上并由每个处理器访问的计算机存储器,其中每个处理器可操作以处理针对线程级并行处理优化的指令集,并且其中每个处理器访问计算机存储器的内部数据总线 在芯片上和内部数据总线是一行内存的宽度。

    Floating point exception handling in a risc microprocessor architecture

    公开(公告)号:US20080072021A1

    公开(公告)日:2008-03-20

    申请号:US11981453

    申请日:2007-10-31

    IPC分类号: G06F9/302

    摘要: A microprocessor executes at 100 native MIPS peak performance with a 100-MHz internal clock frequency. Central processing unit (CPU) instruction sets are hardwired, allowing most instructions to execute in a single cycle. A “flow-through” design allows the next instruction to start before the prior instruction completes, thus increasing performance. A microprocessing unit (MPU) contains 52 general-purpose registers, including 16 global data registers, an index register, a count register, a 16-deep addressable register/return stack, and an 18-deep operand stack. Both stacks contain an index register in the top elements, are cached on chip, and when required, automatically spill to and refill from external memory. The stacks minimize the data movement and also minimize memory access during procedure calls, parameter passing, and variable assignments. Additionally, the MPU contains a mode/status register and 41 locally addressed registers for I/O, control, configuration, and status. The CPU contains both a high-performance, zero-operand, dual-stack architecture MPU, and an input-output processor (IOP) that executes instructions to transfer data, count events, measure time, and perform other timing-dependent functions. A zero-operand stack architecture eliminates operand bits. Stacks also minimize register saves and loads within and across procedures, thus allowing shorter instruction sequences and faster-running code. Instructions are simple to decode and execute, allowing the MPU and IOP to issue and complete instructions in a single clock cycle—each at 100 native MIPS peak execution. Using 8-bit opcodes, the CPU obtains up to four instructions from memory each time an instruction fetch or pre-fetch is performed. These instructions can be repeated without rereading them from memory. This maintains high performance when connected directly to DRAM, without a cache.

    DISTRIBUTED MEANS OF ORGANIZING AN ARBITRARILY LARGE NUMBER OF COMPUTERS
    4.
    发明申请
    DISTRIBUTED MEANS OF ORGANIZING AN ARBITRARILY LARGE NUMBER OF COMPUTERS 失效
    组织大量计算机的分布式手段

    公开(公告)号:US20120014250A1

    公开(公告)日:2012-01-19

    申请号:US12703218

    申请日:2010-03-16

    申请人: Russell Fish

    发明人: Russell Fish

    IPC分类号: H04L12/28 H04L12/56 H04L12/26

    CPC分类号: H04L41/0893 H04L67/10

    摘要: A technique for organizing a plurality of computers such that message broadcast, content searching, and computer identification of the entire collection or a subset of the entire collection may be performed quickly without the use of a controlling computer. The technique describes the creation, operation, and maintenance of a connection scheme by which each computer in the collection appears to be the top level of a hierarchical array. The maintenance of this hierarchical connection scheme allows one to many communications throughout the collection of computers to scale geometrically rather than linearly.

    摘要翻译: 一种用于组织多个计算机的技术,使得可以快速执行整个集合的整个集合的消息广播,内容搜索和计算机标识,而不需要使用控制计算机。 该技术描述了连接方案的创建,操作和维护,通过该方案,收集中的每台计算机似乎是分层数组的顶层。 这种分层连接方案的维护允许在整个计算机集合中进行多次通信以在几何上进行比例而不是线性化。

    DISTRIBUTED MEANS OF ORGANIZING AN ARBITRARILY LARGE NUMBER OF COMPUTERS
    5.
    发明申请
    DISTRIBUTED MEANS OF ORGANIZING AN ARBITRARILY LARGE NUMBER OF COMPUTERS 审中-公开
    组织大量计算机的分布式手段

    公开(公告)号:US20110202681A1

    公开(公告)日:2011-08-18

    申请号:US13087916

    申请日:2011-04-15

    申请人: Russell Fish

    发明人: Russell Fish

    IPC分类号: G06F15/173

    CPC分类号: H04L41/0893 H04L67/10

    摘要: A technique for organizing a plurality of computers such that message broadcast, content searching, and computer identification of the entire collection or a subset of the entire collection may be performed quickly without the use of a controlling computer. The technique describes the creation, operation, and maintenance of a connection scheme by which each computer in the collection appears to be the top level of a hierarchical array. The maintenance of this hierarchical connection scheme allows one to many communications throughout the collection of computers to scale geometrically rather than linearly.

    摘要翻译: 一种用于组织多个计算机的技术,使得可以快速执行整个集合的整个集合的消息广播,内容搜索和计算机标识,而不需要使用控制计算机。 该技术描述了连接方案的创建,操作和维护,通过该方案,收集中的每台计算机似乎是分层数组的顶层。 这种分层连接方案的维护允许在整个计算机集合中进行多次通信以在几何上进行比例而不是线性化。

    Using trap routines in a RISC microprocessor architecture
    6.
    发明申请
    Using trap routines in a RISC microprocessor architecture 审中-公开
    在RISC微处理器架构中使用陷阱程序

    公开(公告)号:US20080071991A1

    公开(公告)日:2008-03-20

    申请号:US11981482

    申请日:2007-10-31

    IPC分类号: G06F12/08

    摘要: A microprocessor executes at 100 native MIPS peak performance with a 100-MHz internal clock frequency. Central processing unit (CPU) instruction sets are hardwired, allowing most instructions to execute in a single cycle. A “flow-through” design allows the next instruction to start before the prior instruction completes, thus increasing performance. A microprocessing unit (MPU) contains 52 general-purpose registers, including 16 global data registers, an index register, a count register, a 16-deep addressable register/return stack, and an 18-deep operand stack. Both stacks contain an index register in the top elements, are cached on chip, and when required, automatically spill to and refill from external memory. The stacks minimize the data movement and also minimize memory access during procedure calls, parameter passing, and variable assignments. Additionally, the MPU contains a mode/status register and 41 locally addressed registers for I/O, control, configuration, and status. The CPU contains both a high-performance, zero-operand, dual-stack architecture MPU, and an input-output processor (IOP) that executes instructions to transfer data, count events, measure time, and perform other timing-dependent functions. A zero-operand stack architecture eliminates operand bits. Stacks also minimize register saves and loads within and across procedures, thus allowing shorter instruction sequences and faster-running code. Instructions are simple to decode and execute, allowing the MPU and IOP to issue and complete instructions in a single clock cycle—each at 100 native MIPS peak execution. Using 8-bit opcodes, the CPU obtains up to four instructions from memory each time an instruction fetch or pre-fetch is performed. These instructions can be repeated without rereading them from memory. This maintains high performance when connected directly to DRAM, without a cache.

    摘要翻译: 微处理器以100 MHz内部时钟频率执行100个本机MIPS峰值性能。 中央处理单元(CPU)指令集是硬连线的,允许大多数指令在一个周期内执行。 “流通”设计允许下一条指令在先前指令完成之前启动,从而提高性能。 微处理单元(MPU)包含52个通用寄存器,包括16个全局数据寄存器,一个索引寄存器,一个计数寄存器,一个16深可寻址寄存器/返回堆栈以及一个18深操作数堆栈。 两个堆栈都包含顶部元素中的索引寄存器,缓存在芯片上,并在需要时自动溢出并从外部存储器中重新填充。 堆栈最小化数据移动,并在过程调用,参数传递和变量赋值期间最小化存储器访问。 此外,MPU还包含一个模式/状态寄存器和41个用于I / O,控制,配置和状态的本地寻址寄存器。 CPU包含高性能零操作数双堆栈架构MPU和执行指令传输数据,计数事件,测量时间和执行其他与时序相关的功能的输入输出处理器(IOP)。 零操作数堆栈架构消除了操作数位。 堆栈还可以在过程内和跨过程中最小化寄存器保存和加载,从而允许较短的指令序列和更快的运行代码。 指令简单易于解码和执行,允许MPU和IOP在单个时钟周期内发出和完成指令,每个时钟周期为100个本机MIPS峰值执行。 每次执行指令提取或预取时,CPU使用8位操作码,最多可从内存中获取四条指令。 这些指令可以重复,而不会从内存重新读取。 当直接连接到DRAM而没有高速缓存时,这将保持高性能。

    Transferring data between registers in a RISC microprocessor architecture

    公开(公告)号:US20080091920A1

    公开(公告)日:2008-04-17

    申请号:US11981237

    申请日:2007-10-31

    IPC分类号: G06F9/00 G06F9/30

    摘要: A microprocessor executes at 100 native MIPS peak performance with a 100-MHz internal clock frequency. Central processing unit (CPU) instruction sets are hardwired, allowing most instructions to execute in a single cycle. A “flow-through” design allows the next instruction to start before the prior instruction completes, thus increasing performance. A microprocessing unit (MPU) contains 52 general-purpose registers, including 16 global data registers, an index register, a count register, a 16-deep addressable register/return stack, and an 18-deep operand stack. Both stacks contain an index register in the top elements, are cached on chip, and when required, automatically spill to and refill from external memory. The stacks minimize the data movement and also minimize memory access during procedure calls, parameter passing, and variable assignments. Additionally, the MPU contains a mode/status register and 41 locally addressed registers for I/O, control, configuration, and status. The CPU contains both a high-performance, zero-operand, dual-stack architecture MPU, and an input-output processor (IOP) that executes instructions to transfer data, count events, measure time, and perform other timing-dependent functions. A zero-operand stack architecture eliminates operand bits. Stacks also minimize register saves and loads within and across procedures, thus allowing shorter instruction sequences and faster-running code. Instructions are simple to decode and execute, allowing the MPU and IOP to issue and complete instructions in a single clock cycle—each at 100 native MIPS peak execution. Using 8-bit opcodes, the CPU obtains up to four instructions from memory each time an instruction fetch or pre-fetch is performed. These instructions can be repeated without rereading them from memory. This maintains high performance when connected directly to DRAM, without a cache.

    Using breakpoints for debugging in a RISC microprocessor architecture

    公开(公告)号:US20080077911A1

    公开(公告)日:2008-03-27

    申请号:US11981278

    申请日:2007-10-31

    IPC分类号: G06F9/44

    摘要: A microprocessor executes at 100 native MIPS peak performance with a 100-MHz internal clock frequency. Central processing unit (CPU) instruction sets are hardwired, allowing most instructions to execute in a single cycle. A “flow-through” design allows the next instruction to start before the prior instruction completes, thus increasing performance. A microprocessing unit (MPU) contains 52 general-purpose registers, including 16 global data registers, an index register, a count register, a 16-deep addressable register/return stack, and an 18-deep operand stack. Both stacks contain an index register in the top elements, are cached on chip, and when required, automatically spill to and refill from external memory. The stacks minimize the data movement and also minimize memory access during procedure calls, parameter passing, and variable assignments. Additionally, the MPU contains a mode/status register and 41 locally addressed registers for I/O, control, configuration, and status. The CPU contains both a high-performance, zero-operand, dual-stack architecture MPU, and an input-output processor (IOP) that executes instructions to transfer data, count events, measure time, and perform other timing-dependent functions. A zero-operand stack architecture eliminates operand bits. Stacks also minimize register saves and loads within and across procedures, thus allowing shorter instruction sequences and faster-running code. Instructions are simple to decode and execute, allowing the MPU and IOP to issue and complete instructions in a single clock cycle—each at 100 native MIPS peak execution. Using 8-bit opcodes, the CPU obtains up to four instructions from memory each time an instruction fetch or pre-fetch is performed. These instructions can be repeated without rereading them from memory. This maintains high performance when connected directly to DRAM, without a cache.

    Detecting the boundaries of memory in a RISC microprocessor architecture

    公开(公告)号:US20070271442A1

    公开(公告)日:2007-11-22

    申请号:US11881284

    申请日:2007-07-26

    IPC分类号: G06F9/30

    摘要: A microprocessor executes at 100 native MIPS peak performance with a 100-MHz internal clock frequency. Central processing unit (CPU) instruction sets are hardwired, allowing most instructions to execute in a single cycle. A “flow-through” design allows the next instruction to start before the prior instruction completes, thus increasing performance. A microprocessing unit (MPU) contains 52 general-purpose registers, including 16 global data registers, an index register, a count register, a 16-deep addressable register/return stack, and an 18-deep operand stack. Both stacks contain an index register in the top elements, are cached on chip, and when required, automatically spill to and refill from external memory. The stacks minimize the data movement and also minimize memory access during procedure calls, parameter passing, and variable assignments. Additionally, the MPU contains a mode/status register and 41 locally addressed registers for I/O, control, configuration, and status. The CPU contains both a high-performance, zero-operand, dual-stack architecture MPU, and an input-output processor (IOP) that executes instructions to transfer data, count events, measure time, and perform other timing-dependent functions. A zero-operand stack architecture eliminates operand bits. Stacks also minimize register saves and loads within and across procedures, thus allowing shorter instruction sequences and faster-running code. Instructions are simple to decode and execute, allowing the MPU and IOP to issue and complete instructions in a single clock cycle—each at 100 native MIPS peak execution. Using 8-bit opcodes, the CPU obtains up to four instructions from memory each time an instruction fetch or pre-fetch is performed. These instructions can be repeated without rereading them from memory. This maintains high performance when connected directly to DRAM, without a cache.

    System and method for securely establishing a direct connection between two firewalled computers
    10.
    发明申请
    System and method for securely establishing a direct connection between two firewalled computers 审中-公开
    用于安全地建立两台防火墙计算机之间的直接连接的系统和方法

    公开(公告)号:US20060230163A1

    公开(公告)日:2006-10-12

    申请号:US11386173

    申请日:2006-03-22

    申请人: Russell Fish

    发明人: Russell Fish

    IPC分类号: G06F15/16

    摘要: The disclosed system describes a means for internetworked computers protected behind blocking firewalls to communicate directly with other internetworked computers protected behind blocking firewalls. A trusted computer helps establish a connection between the two protected computers, but all subsequent communications takes place directly between the two protected computers.

    摘要翻译: 所公开的系统描述了一种用于互联网络计算机的装置,其在防火墙之后被保护以直接与阻塞防火墙后保护的其他互联网络计算机通信。 可信计算机有助于建立两台受保护计算机之间的连接,但所有后续通信都直接在两台受保护的计算机之间进行。