Device discovery and topology reporting in a combined CPU/GPU architecture system
    1.
    发明授权
    Device discovery and topology reporting in a combined CPU/GPU architecture system 有权
    组合CPU / GPU架构系统中的设备发现和拓扑报告

    公开(公告)号:US08797332B2

    公开(公告)日:2014-08-05

    申请号:US13325824

    申请日:2011-12-14

    CPC分类号: G06T1/20 G06F9/30003

    摘要: Methods and apparatus are provided, as an aspect of a combined CPU/APD architecture system, for discovering and reporting properties of devices and system topology that are relevant to efficiently scheduling and distributing computational tasks to the various computational resources of a combined CPU/APD architecture system. The combined CPU/APD architecture unifies CPUs and APDs in a flexible computing environment. In some embodiments, the combined CPU/APD architecture capabilities are implemented in a single integrated circuit, elements of which can include one or more CPU cores and one or more APD cores. The combined CPU/APD architecture creates a foundation upon which existing and new programming frameworks, languages, and tools can be constructed.

    摘要翻译: 提供了作为组合的CPU / APD架构系统的一个方面的方法和装置,用于发现和报告与有效地调度和分发计算任务到组合的CPU / APD架构的各种计算资源相关的设备和系统拓扑的属性 系统。 组合的CPU / APD架构将CPU和APD统一在灵活的计算环境中。 在一些实施例中,组合的CPU / APD架构能力在单个集成电路中实现,其单元可以包括一个或多个CPU核心和一个或多个APD核心。 组合的CPU / APD架构创建了可以构建现有和新的编程框架,语言和工具的基础。

    Device Discovery and Topology Reporting in a Combined CPU/GPU Architecture System
    2.
    发明申请
    Device Discovery and Topology Reporting in a Combined CPU/GPU Architecture System 有权
    组合CPU / GPU架构系统中的设备发现和拓扑报告

    公开(公告)号:US20120162234A1

    公开(公告)日:2012-06-28

    申请号:US13325824

    申请日:2011-12-14

    IPC分类号: G06T1/20 G06T1/60

    CPC分类号: G06T1/20 G06F9/30003

    摘要: Methods and apparatus are provided, as an aspect of a combined CPU/APD architecture system, for discovering and reporting properties of devices and system topology that are relevant to efficiently scheduling and distributing computational tasks to the various computational resources of a combined CPU/APD architecture system. The combined CPU/APD architecture unifies CPUs and APDs in a flexible computing environment. In some embodiments, the combined CPU/APD architecture capabilities are implemented in a single integrated circuit, elements of which can include one or more CPU cores and one or more APD cores. The combined CPU/APD architecture creates a foundation upon which existing and new programming frameworks, languages, and tools can be constructed.

    摘要翻译: 提供了作为组合的CPU / APD架构系统的一个方面的方法和装置,用于发现和报告与有效地调度和分发计算任务到组合的CPU / APD架构的各种计算资源相关的设备和系统拓扑的属性 系统。 组合的CPU / APD架构将CPU和APD统一在灵活的计算环境中。 在一些实施例中,组合的CPU / APD架构能力在单个集成电路中实现,其单元可以包括一个或多个CPU核心和一个或多个APD核心。 组合的CPU / APD架构创建了可以构建现有和新的编程框架,语言和工具的基础。

    Infrastructure support for accelerated processing device memory paging without operating system integration
    3.
    发明授权
    Infrastructure support for accelerated processing device memory paging without operating system integration 有权
    基础架构支持加速处理设备内存分页,无需操作系统集成

    公开(公告)号:US08578129B2

    公开(公告)日:2013-11-05

    申请号:US13325282

    申请日:2011-12-14

    IPC分类号: G06F12/00

    摘要: In a CPU, the CPU having multiple CPU cores, each core having a first machine specific register, a second machine specific register, and microcode which when executed causes a write notification to be issued to the physical address contained in the second machine specific register; receiving in the first machine specific register of a CPU core, a physical page table/page directory base address, receiving in the second machine specific register of the CPU core, a physical address pointing to a location controlled by the IOMMUv2, determining that a control register of the CPU core has been updated, and responsive to the determination that the control register has been updated, executing microcode in the CPU core that causes a write notification to be issued to the physical address contained in the second machine specific register, wherein the physical address is able to receive writes that affect IOMMUv2 page table invalidations.

    摘要翻译: 在CPU中,具有多个CPU核心的CPU,每个核心具有第一机器特定寄存器,第二机器特定寄存器和微代码,当被执行时,将对包含在第二机器特定寄存器中的物理地址发出写入通知; 在CPU核心的第一机器特定寄存器中接收物理页表/页目录基地址,在CPU核心的第二机器特定寄存器中接收指向由IOMMUv2控制的位置的物理地址,确定控制 已经更新了CPU核心的寄存器,并且响应于控制寄存器被更新的确定,执行CPU核心中的微代码,使得向第二机器特定寄存器中包含的物理地址发出写入通知,其中, 物理地址能够接收影响IOMMUv2页表无效的写入。

    GRAPHICS PROCESSING DISPATCH FROM USER MODE
    4.
    发明申请
    GRAPHICS PROCESSING DISPATCH FROM USER MODE 有权
    图形处理从用户模式进行分配

    公开(公告)号:US20120188258A1

    公开(公告)日:2012-07-26

    申请号:US13289304

    申请日:2011-11-04

    IPC分类号: G06F15/16

    CPC分类号: G06F9/545 G06F9/544

    摘要: A method, system, and computer program product are disclosed for providing improved access to accelerated processing device compute resources to user mode applications. The functionality disclosed allows user mode applications to provide commands to an accelerated processing device without the need for kernel mode transitions in order to access a unified ring buffer. Instead, applications are each provided with their own buffers, which the accelerated processing device hardware can access to process commands. With full operating system support, user mode applications are able to utilize the accelerated processing device in much the same way as a CPU.

    摘要翻译: 公开了一种方法,系统和计算机程序产品,用于提供对用户模式应用的加速处理设备计算资源的改进访问。 所公开的功能允许用户模式应用程序向加速处理设备提供命令,而不需要内核模式转换以便访问统一的环形缓冲区。 相反,应用程序各自提供有自己的缓冲区,加速处理设备硬件可以访问进程命令。 通过完整的操作系统支持,用户模式应用程序能够以与CPU相同的方式利用加速处理设备。

    Graphics processing dispatch from user mode
    5.
    发明授权
    Graphics processing dispatch from user mode 有权
    图形处理从用户模式调度

    公开(公告)号:US09176795B2

    公开(公告)日:2015-11-03

    申请号:US13289304

    申请日:2011-11-04

    CPC分类号: G06F9/545 G06F9/544

    摘要: A method, system, and computer program product are disclosed for providing improved access to accelerated processing device compute resources to user mode applications. The functionality disclosed allows user mode applications to provide commands to an accelerated processing device without the need for kernel mode transitions in order to access a unified ring buffer. Instead, applications are each provided with their own buffers, which the accelerated processing device hardware can access to process commands. With full operating system support, user mode applications are able to utilize the accelerated processing device in much the same way as a CPU.

    摘要翻译: 公开了一种方法,系统和计算机程序产品,用于提供对用户模式应用的加速处理设备计算资源的改进访问。 所公开的功能允许用户模式应用程序向加速处理设备提供命令,而不需要内核模式转换以便访问统一的环形缓冲区。 相反,应用程序各自提供有自己的缓冲区,加速处理设备硬件可以访问进程命令。 通过完整的操作系统支持,用户模式应用程序能够以与CPU相同的方式利用加速处理设备。

    Graphics compute process scheduling
    7.
    发明授权
    Graphics compute process scheduling 有权
    图形计算过程调度

    公开(公告)号:US09176794B2

    公开(公告)日:2015-11-03

    申请号:US13289260

    申请日:2011-11-04

    CPC分类号: G06F9/545 G06F2209/509

    摘要: A method, system, and computer program product are disclosed for providing improved access to accelerated processing device compute resources to user mode applications. The functionality disclosed allows user mode applications to provide commands to an accelerated processing device without the need for kernel mode transitions in order to access a unified ring buffer. Instead, applications are each provided with their own buffers, which the accelerated processing device hardware can access to process commands. With full operating system support, user mode applications are able to utilize the accelerated processing device in much the same way as a CPU.

    摘要翻译: 公开了一种方法,系统和计算机程序产品,用于提供对用户模式应用的加速处理设备计算资源的改进访问。 所公开的功能允许用户模式应用程序向加速处理设备提供命令,而不需要内核模式转换以便访问统一的环形缓冲区。 相反,应用程序各自提供有自己的缓冲区,加速处理设备硬件可以访问进程命令。 通过完整的操作系统支持,用户模式应用程序能够以与CPU相同的方式利用加速处理设备。

    DRIVER ARCHITECTURE FOR COMPUTER DEVICE HAVING MULTIPLE GRAPHICS SUBSYSTEMS, REDUCED POWER CONSUMPTION MODES, SOFTWARE AND METHODS
    9.
    发明申请
    DRIVER ARCHITECTURE FOR COMPUTER DEVICE HAVING MULTIPLE GRAPHICS SUBSYSTEMS, REDUCED POWER CONSUMPTION MODES, SOFTWARE AND METHODS 有权
    具有多个图形子系统的计算机设备的驱动器架构,降低功耗模式,软件和方法

    公开(公告)号:US20090153540A1

    公开(公告)日:2009-06-18

    申请号:US12335258

    申请日:2008-12-15

    IPC分类号: G06F3/038 G06F15/16

    摘要: Many computing device may now include two or more graphics subsystems. The multiple graphics subsystems may have different abilities, and may, for example, consume differing amount of electrical power, with one subsystem consuming more average power than the others. The higher power consuming graphics subsystem may be coupled to the device and used instead of, or in addition to, the lower power consuming graphics subsystem, resulting in higher performance or additional capabilities, but increased overall power consumption. By transitioning from the use of the higher power consuming graphics subsystem to the lower power consuming graphics subsystem, while placing the higher power consuming graphics subsystem in a lower power consumption mode, overall power consumption is reduced. A processor executes application software and driver software. The driver software includes first and second driver components for respectively controlling operation of the first and second graphics subsystems. A further proxy driver component routes calls (e.g. API/DDI calls) to one of said first and second driver components, in dependence on which of the first and second graphics system is in use.

    摘要翻译: 许多计算设备现在可以包括两个或更多个图形子系统。 多个图形子系统可能具有不同的能力,并且可以例如消耗不同的电力量,一个子系统比其他子系统消耗更多的平均功率。 较高功耗的图形子系统可以耦合到该设备,并且被代替或者除了低功耗的图形子系统之外使用,导致更高的性能或附加的能力,但增加了整体的功耗。 通过从使用较高功耗的图形子系统转换到较低功耗的图形子系统,在将较高功率消耗的图形子系统置于较低功耗模式的同时,总体功耗降低。 处理器执行应用软件和驱动程序软件。 驱动器软件包括用于分别控制第一和第二图形子系统的操作的第一和第二驱动器组件。 根据第一和第二图形系统中的哪一个在使用中,另外的代理驱动器组件将呼叫(例如,API / DDI呼叫)​​路由到所述第一和第二驱动器组件之一。