DATA PROCESSING METHOD AND APPARATUS
    2.
    发明申请
    DATA PROCESSING METHOD AND APPARATUS 有权
    数据处理方法和装置

    公开(公告)号:US20140189237A1

    公开(公告)日:2014-07-03

    申请号:US14140945

    申请日:2013-12-26

    Abstract: Embodiments of the present invention provide a data processing method and apparatus. According to the embodiments of the present invention, when it is found that a data hash value in a currently received data stream exceeds a preset first threshold, a part or all of data in the data stream is not deduplicated, and is directly stored, so as to prevent the data in the data stream from being dispersedly stored into a plurality of storage areas; instead, the part or all of the data is stored into a storage area in a centralized manner, so that a deduplication rate is effectively improved on the whole, particularly in a scenario of large data storage amount.

    Abstract translation: 本发明的实施例提供一种数据处理方法和装置。 根据本发明的实施例,当发现当前接收的数据流中的数据散列值超过预设的第一阈值时,数据流中的部分或全部数据不被重复数据删除,并被直接存储,所以 以防止数据流中的数据被分散地存储到多个存储区域中; 相反,部分或全部数据以集中的方式存储在存储区域中,从而整体上有效地提高了重复数据删除率,特别是在大数据存储量的情况下。

Patent Agency Ranking