A Method for High-Throughput Deduplication for Primary File Server by Using Prefetch Cache

Hitoshi Kamei, Takaki Nakamura

    研究成果: Article

    1 引用 (Scopus)

    抜粋

    We propose a method of high-throughput file-level deduplication for primary file servers, called partial data background prefetch (PDBP). To achieve high throughput of deduplication, the method reduces the number of disk I/Os issued during deduplication process. Before running deduplication process, the proposed method prefetches a part of data of shred files referred by deduplicated files. After that, the method processes the files that are larger than a file-size threshold defined by administrators. In this paper, we evaluate a deduplication processing time by using a simulation model of PDBP. Consequently, we confirm that the processing time of PDBP is reduced by about 50% compared to a conventional file deduplication method when the threshold is set to 4 KB.

    元の言語English
    ページ(範囲)54-64
    ページ数11
    ジャーナルElectronics and Communications in Japan
    99
    発行部数12
    DOI
    出版物ステータスPublished - 2016 12 1

    ASJC Scopus subject areas

    • Signal Processing
    • Physics and Astronomy(all)
    • Computer Networks and Communications
    • Electrical and Electronic Engineering
    • Applied Mathematics

    フィンガープリント A Method for High-Throughput Deduplication for Primary File Server by Using Prefetch Cache' の研究トピックを掘り下げます。これらはともに一意のフィンガープリントを構成します。

  • これを引用