SDFS: A File-System With Inline De-Duplication

Posted by Michael Larabel on August 17, 2011

ZFS is known for its de-duplication support and there are other file-systems (such as Dragonfly's HAMMER, plus work-in-progress support for Btrfs) that support this data compression feature of eliminating duplicate data. There's also a new project that we have just learned about which is SDFS, a file-system that offers inline de-duplication support.

Opendedup SDFS is a file-system that supports in-line and batch mode de-duplication on both Linux and Windows systems, along with VMware virtualized environments. This file-system claims it can reduce storage utilization by up to 90~95%, can de-duplicate more than a Petabyte of data, can de-dupe/re-dupe at a speed of more than 1GB/s, and can do this de-duplication process either locally, on the network, or in the cloud (including Amazon S3). In fact, SDFS is particularly suited for the cloud with focusing on VMware, Xen, and KVM. SDFS also supports file and folder snapshots. These claims are rather impressive, especially from an unheard of open-source project (they only have 18 Twitter followers).

Earlier this month they put out the SDFS file-system 1.0.8 feature for Windows and Linux. While the file-system is portable to Windows, to the dismay of some, this file-system is built atop FUSE, which Linus Torvalds argues is for toys and misguided people. This file-system requires Linux x86_64, FUSE 2.8+, at least 2GB of RAM, and even Java 7.

For those not familiar with data de-duplication, they have a page about it and more information on opendedup.org, including an SDFS architecture presentation. This user-space file-system is hosted at Google Code and is developed under the GNU GPLv2 license.

Discuss this article in our forums, IRC channel, or email the author. You can also follow our content via RSS and on social networks like Facebook, Identi.ca, and Twitter (@Phoronix and @MichaelLarabel). Subscribe to Phoronix Premium to view our content without advertisements, view entire articles on a single page, and experience other benefits.
Latest Hardware Reviews
  1. Sumo Lounge Emperor
  2. Gallium3D Continues Improving OpenGL For Older Radeon GPUs
  3. 15-Way Open vs. Closed Source NVIDIA/AMD Linux GPU Comparison
  4. Nouveau vs. NVIDIA Linux Comparison Shows Shortcomings
Latest Software Articles
  1. The Cost Of Ubuntu Disk Encryption
  2. Btrfs vs. EXT4 vs. XFS vs. F2FS On Linux 3.10
  3. AMD Radeon R600 GPU LLVM 3.3 Back-End Testing
  4. F2FS File-System Shows Regressions On Linux 3.10
Latest Linux News
  1. QEMU 1.5 Supports VGA Passthrough, Better USB 3.0
  2. Handbrake 0.9.9 Supports OpenCL Offloading
  3. Freedreno Gallium3D Now Banging The Adreno A3XX
  4. Jolla Announces Their First Phone
  5. Mageia 3 Released, Still Using Legacy GRUB
  6. NetBSD 6.1 Brings In More Features
  7. Using Six Monitors With AMD's Open-Source Linux Driver
  8. Benchmarking The Intel P-State, CPUfreq Changes
  9. FreeBSD Still Working On Next-Gen Package Manager
  10. DNF Still Advancing As Experimental Yum For Fedora
  11. Logitech Begins Supporting Linux Users
Latest Forum Talk
  1. The Cost Of Ubuntu Disk Encryption
  2. Freedreno Gallium3D Now Banging The Adreno A3XX
  3. QEMU 1.5 Supports VGA Passthrough, Better USB 3.0
  4. Sumo Lounge Emperor
  5. FreeBSD Still Working On Next-Gen Package Manager
  6. Plymouth Planned For Ubuntu 9.10 Integration
  1. Computers
  2. Display Drivers
  3. Graphics Cards
  4. Motherboards
  5. Peripherals
  6. Processors
  7. Software
  8. Operating Systems
  9. All Articles
  1. Linux Benchmarking
  2. OpenBenchmarking.org
  3. Phoronix Test Suite