INTELLIGENT DATA PROPAGATION IN A HIGHLY DISTRIBUTED ENVIRONMENT
    1.
    发明申请
    INTELLIGENT DATA PROPAGATION IN A HIGHLY DISTRIBUTED ENVIRONMENT 有权
    智能数据传播在高分布环境中

    公开(公告)号:US20150134606A1

    公开(公告)日:2015-05-14

    申请号:US14080710

    申请日:2013-11-14

    Applicant: VMware, Inc.

    CPC classification number: G06F17/30575 G06F17/30194 H04L67/1095 H04L67/1097

    Abstract: Exemplary methods, apparatuses, and systems that can intelligently copy data to a plurality of datastores are described. In one embodiment, a distance value of a path between each datastore is determined. Based on the distance values, a graph cluster analysis creates clusters of the datastores within close proximity to one another. Also, a shortest path tree determines the most efficient paths available for copying data from a source datastore to one or more destination datastores. The source datastore is designated as the root of the shortest path tree, and the one or more destination datastores are designated as the vertices of the tree. After each child vertex of the source datastore is ordered in descending order according to a number of unique clusters to which descendants of the child vertex belong, the data is copied from the source datastore to the one or more destination datastores in the descending order.

    Abstract translation: 描述了可以智能地将数据复制到多个数据存储区的示例性方法,装置和系统。 在一个实施例中,确定每个数据存储之间的路径的距离值。 基于距离值,图形聚类分析创建彼此靠近的数据存储区域。 此外,最短路径树确定可用于将数据从源数据存储复制到一个或多个目标数据存储的最有效的路径。 源数据存储区被指定为最短路径树的根,并且一个或多个目标数据存储区被指定为树的顶点。 在源数据存储的每个子顶点按照子顶点的后代所属的唯一集群的数量的降序排序后,数据将从源数据存储复制到一个或多个目标数据存储中。

    MULTI-TENANT PRODUCTION AND TEST DEPLOYMENTS OF HADOOP
    2.
    发明申请
    MULTI-TENANT PRODUCTION AND TEST DEPLOYMENTS OF HADOOP 审中-公开
    多样性生产和测试部署HADOOP

    公开(公告)号:US20150120791A1

    公开(公告)日:2015-04-30

    申请号:US14062723

    申请日:2013-10-24

    Applicant: VMWARE, INC.

    CPC classification number: G06F17/30194 G06F9/45558 G06F2009/45575

    Abstract: A distributed computing application is described that provides a highly elastic and multi-tenant platform for Hadoop applications and other workloads running in a virtualized environment. Production, test, and development deployments of a Hadoop application may be executed using multiple compute clusters and a shared instance of a distributed filesystem, or in other cases, multiple instances of the distributed filesystem. Data nodes executing as virtual machines (VMs) for test and development deployments can be linked clones of data nodes executing as VMs for a production deployment to reduce duplicated data and provide a shared storage space.

    Abstract translation: 描述了一种分布式计算应用程序,为Hadoop应用程序和在虚拟化环境中运行的其他工作负载提供了高弹性和多租户平台。 Hadoop应用程序的生产,测试和开发部署可以使用多个计算集群和分布式文件系统的共享实例,或者其他情况下分布式文件系统的多个实例来执行。 作为用于测试和开发部署的虚拟机(VM))执行的数据节点可以是作为生产部署的VM执行的数据节点的链接克隆,以减少重复的数据并提供共享存储空间。

    ELASTIC TEMPORARY FILESYSTEM
    3.
    发明申请
    ELASTIC TEMPORARY FILESYSTEM 审中-公开
    弹性临时文件系统

    公开(公告)号:US20150160884A1

    公开(公告)日:2015-06-11

    申请号:US14517301

    申请日:2014-10-17

    Applicant: VMWARE, INC.

    Abstract: An elastic filesystem for temporary data provides storage space for virtual machines (VMs) in a distributed computing system. The filesystem redirects accesses to virtual disks in VMs to a common pool file. The system provides performance and storage efficiency at least on par with local, direct attached virtual disks, while providing a single pool of shared storage that is provisioned and managed independently of the VMs. The system provides storage isolation between VMs storing temporary data in that shared pool. Also, storage space for temporary data may be allocated on demand and reclaimed when no longer needed, thereby supporting a wide variety of temporary space requirements for different Hadoop jobs.

    Abstract translation: 用于临时数据的弹性文件系统为分布式计算系统中的虚拟机(VM)提供了存储空间。 文件系统将对VM中的虚拟磁盘的访问重定向到公共池文件。 该系统至少与本地直接连接的虚拟磁盘相提并论,提供性能和存储效率,同时提供独立于虚拟机配置和管理的单个共享存储池。 该系统提供在该共享池中存储临时数据的虚拟机之间的存储隔离。 此外,临时数据的存储空间可以根据需要分配,并在不再需要时进行回收,从而支持不同Hadoop作业的各种临时空间要求。

    INTELLIGENT DATA PROPAGATION USING PERFORMANCE MONITORING
    4.
    发明申请
    INTELLIGENT DATA PROPAGATION USING PERFORMANCE MONITORING 有权
    智能数据传播使用性能监控

    公开(公告)号:US20150134607A1

    公开(公告)日:2015-05-14

    申请号:US14080718

    申请日:2013-11-14

    Applicant: VMware, Inc.

    Abstract: Exemplary methods, apparatuses, and systems that can intelligently copy data to a plurality of datastores using performance monitoring are described. In one embodiment, a shortest path tree determines the most efficient paths available for copying data from a source datastore to one or more destination datastores. During the copying of the data between a source datastore and the one or more destination datastores, a performance value of each of the datastores involved in the copying process is compared to a threshold. In response to determining that the performance value of a given source or destination datastore involved in the copying exceeds the threshold, the copying of the data to the corresponding destination datastore is suspended. An updated shortest path tree is determined to locate a more efficient path for copying data to the suspended destination datastore. Copying is resumed to the suspended destination datastore using the updated shortest path tree.

    Abstract translation: 描述了可以使用性能监视将数据智能复制到多个数据存储区的示例性方法,装置和系统。 在一个实施例中,最短路径树确定可用于将数据从源数据存储复制到一个或多个目的地数据存储的最有效的路径。 在源数据存储和一个或多个目标数据存储之间的数据复制期间,复制过程中涉及的每个数据存储区的性能值与阈值进行比较。 响应于确定复制中涉及的给定源或目的地数据存储的性能值超过阈值,将暂停将数据复制到相应的目的地数据存储。 确定更新的最短路径树以定位用于将数据复制到已暂停的目的地数据存储的更有效的路径。 使用更新的最短路径树将复制恢复到已暂停的目标数据存储区。

    CONTAINER VIRTUAL MACHINES FOR HADOOP
    5.
    发明申请
    CONTAINER VIRTUAL MACHINES FOR HADOOP 审中-公开
    集装箱的虚拟机

    公开(公告)号:US20150120928A1

    公开(公告)日:2015-04-30

    申请号:US14062660

    申请日:2013-10-24

    Applicant: VMware, Inc.

    CPC classification number: H04L67/1008 G06F9/5072 G06F17/30194 G06F2209/5011

    Abstract: A distributed computing application is described that provides a highly elastic and multi-tenant platform for Hadoop applications and other workloads running in a virtualized environment. Data and compute nodes are separated into different virtual machines (VM). Compute VMs are used to launch containers from different tenants. Compute VMs are organized in pools of hot spare VMs that are immediately available for launching a container and executing a task, and pools of cold spare VMs. Each compute VM may include a mounted network filesystem provided by a node manager to share intermediate outputs across VMs executing on the same host.

    Abstract translation: 描述了一种分布式计算应用程序,为Hadoop应用程序和在虚拟化环境中运行的其他工作负载提供了高弹性和多租户平台。 数据和计算节点分为不同的虚拟机(VM)。 计算虚拟机用于从不同的租户启动容器。 计算虚拟机组织在可以立即可用于启动容器和执行任务的热备用虚拟机池以及冷备用虚拟机池中。 每个计算VM可以包括由节点管理器提供的安装的网络文件系统,以跨越在同一主机上执行的VM共享中间输出。

Patent Agency Ranking