Single Instance Storage (Block level deduplication)

Please post all that you want from a ReadyNAS here. Nothing guaranteed, but we'll certainly do our best if you make a good case for it.

Single Instance Storage (Block level deduplication)

Postby omlette brothers » Mon Oct 10, 2005 8:23 am

First post, and a new user :D Chuffed to bits with my X6 so far...

I've migrated all my data from my W2K3 Server. I have had the benefit of Single Instance Storage as the server also hosted RIS to build my PC's. I have a lot of development I386's which contain a lot of repeated files. Would be great to have the same, transparent support to remove data redundancy.
Last edited by omlette brothers on Fri Oct 31, 2008 12:27 am, edited 1 time in total.
ReadyNas X6 Rev B - RAIDiator 4.1.7 [1.00a147] - Crucial CT12864Z335 (1024MB 2.0-3-3-7) - 4xSeagate ST31500341AS CC1H - Popcornhour NMT-A100 [01-17-090201-15-POP-402] - 3COM 8 Port 1Gbps Switch (3CGSU08-UK) - Wintel Clients RTL8169 @ 1Gbps - No jumbo frames - Netgear HDX101 http://forum1.netgear.com/showthread.php?t=19467
User avatar
omlette brothers
ReadyNAS User
 
Posts: 74
Joined: Fri Sep 30, 2005 3:30 am
Location: Motown UK
ReadyNAS: X6

Re: Single Instance Storage

Postby bhoar » Mon Oct 10, 2005 9:50 am

Hi - um, care to give more details for those of us who aren't really sure what you're talking about so we can follow along when Infrant responds? :)

-brendan
User avatar
bhoar
ReadyNAS Junkie
 
Posts: 3541
Joined: Sun Jun 26, 2005 12:51 pm
Location: Arlington, VA / Washington, D.C.
ReadyNAS: Pro

Postby Renegade59 » Mon Oct 10, 2005 5:35 pm

Single Instance Storage (SIS) allows the server to maintain one copy of a file for multiple "people."

The easiest example is in MS Exchange. If you send an attachment to 30 people, the server used to need to keep 30 different copies of the same file. With SIS, it can keep just the one and use if for everyone.
Renegade59
ReadyNAS Newbie
 
Posts: 14
Joined: Tue Jul 26, 2005 5:59 pm

Postby yoh-dah » Mon Oct 10, 2005 6:27 pm

Renegade59 wrote:Single Instance Storage (SIS) allows the server to maintain one copy of a file for multiple "people."

The easiest example is in MS Exchange. If you send an attachment to 30 people, the server used to need to keep 30 different copies of the same file. With SIS, it can keep just the one and use if for everyone.

How would this be applicable in a NAS environment? I would think this is more application dependent as the app would need to create appropriate "symlinks" to the data.
User avatar
yoh-dah
Jedi Council Alumni
 
Posts: 13688
Joined: Fri Nov 19, 2004 1:21 am
Location: Borah-Borah
ReadyNAS: Pro

Postby omlette brothers » Tue Oct 11, 2005 8:01 am

In my environment I have a lot of repeated files from developing unattended Windows 2K3 and XP installs. If you've ever looked in the I386 directory of a Windows install CD you'll see thousands of files.

If you take it a stage further and compare the I386 directory of Windows 2003 Server, Windows 2003 Advanced Server, Windows 2003 Web Server and then multiply that by the different licence types like Volume, Retail and OEM you'll find very few files differ and that thousands are identical.

Single Instance Storage (is a server service), when used with Microsoft's RIS is a way of replacing the physical discrete (ie files) data with (I guess) a Unix like symbolic entry. This cuts down on the amount of *real* disk being used.

My other major uses are DJ'ing and in the future VJ'ing. I see both uses requiring repeated data when editing music / video.
ReadyNas X6 Rev B - RAIDiator 4.1.7 [1.00a147] - Crucial CT12864Z335 (1024MB 2.0-3-3-7) - 4xSeagate ST31500341AS CC1H - Popcornhour NMT-A100 [01-17-090201-15-POP-402] - 3COM 8 Port 1Gbps Switch (3CGSU08-UK) - Wintel Clients RTL8169 @ 1Gbps - No jumbo frames - Netgear HDX101 http://forum1.netgear.com/showthread.php?t=19467
User avatar
omlette brothers
ReadyNAS User
 
Posts: 74
Joined: Fri Sep 30, 2005 3:30 am
Location: Motown UK
ReadyNAS: X6

Postby Xipper » Tue Oct 11, 2005 9:00 am

I'm not sure how you would even go about creating such a service on a storage device. The problem is how do you determine when files are non-unique? md5 isn't accurate enough, file name isn't accurate enough, size is not applicable. It starts to become something that is far more resource intensive, just try running noclone, dedupe or any other software on a PC that is supposed to find duplicate files for you. Its a lot to expect of a system that is optimized to read and write files, not file contents.

I don't believe I have ever come across an enterprise storage system that supports single instance, I'm not aware of many applications that even support this outside of Exchange.
Xipper
ReadyNAS User
 
Posts: 52
Joined: Thu Sep 08, 2005 1:02 pm

Re: Single Instance Storage

Postby omlette brothers » Fri Oct 31, 2008 12:26 am

A revival of an old topic, but it now seems to be a hot topic in the world of Enterprise NAS - It goes by the name of block level deduplication. Is the Readynas powerful enough to run a service like this? It would be absolute boon to the device, where only the deltas in file changes are required.
ReadyNas X6 Rev B - RAIDiator 4.1.7 [1.00a147] - Crucial CT12864Z335 (1024MB 2.0-3-3-7) - 4xSeagate ST31500341AS CC1H - Popcornhour NMT-A100 [01-17-090201-15-POP-402] - 3COM 8 Port 1Gbps Switch (3CGSU08-UK) - Wintel Clients RTL8169 @ 1Gbps - No jumbo frames - Netgear HDX101 http://forum1.netgear.com/showthread.php?t=19467
User avatar
omlette brothers
ReadyNAS User
 
Posts: 74
Joined: Fri Sep 30, 2005 3:30 am
Location: Motown UK
ReadyNAS: X6

Re: Single Instance Storage (Block level deduplication)

Postby BobRoss » Thu Nov 13, 2008 8:42 am

I agree, especially for VM shops!!! We have lots of dup'd data in our Win2k3 server installs.
BobRoss
Advanced ReadyNAS User
 
Posts: 108
Joined: Wed Oct 08, 2008 1:01 pm
ReadyNAS: Pro

Re: Single Instance Storage (Block level deduplication)

Postby TeknoJnky » Thu Nov 13, 2008 9:50 am

sis support would be awesome.

for some more info, wiki has a decent explaination.

http://en.wikipedia.org/wiki/Single_instance_store
nv+ ~ 1gb ram ~ 4x WDC WD20EARS-00S8B1 ~ 5555 GB
ultra4 ~ 4 gb ram ~ 2x ST31500341AS ~ 2x ST4000DX000-1C5160 ~ 6471 GB
pro business ~ 4gb ram ~ dual redundancy ~ 4x Hitachi HDS724040ALE640 ~ 2x SAMSUNG HD204UI ~ 9130 GB
A/V streaming ---> Subsonic ---> EVO 3D
User avatar
TeknoJnky
ReadyNAS Addict
 
Posts: 2910
Joined: Mon Oct 13, 2008 1:34 pm
Location: MO
ReadyNAS: Pro

Re: Single Instance Storage (Block level deduplication)

Postby omlette brothers » Mon Nov 02, 2009 2:33 pm

One day, sometime in the future the Readynas will have dedupe.... :o

Now ZFS has block level dedupe.....http://blogs.sun.com/bonwick/en_US/entry/zfs_dedup
ReadyNas X6 Rev B - RAIDiator 4.1.7 [1.00a147] - Crucial CT12864Z335 (1024MB 2.0-3-3-7) - 4xSeagate ST31500341AS CC1H - Popcornhour NMT-A100 [01-17-090201-15-POP-402] - 3COM 8 Port 1Gbps Switch (3CGSU08-UK) - Wintel Clients RTL8169 @ 1Gbps - No jumbo frames - Netgear HDX101 http://forum1.netgear.com/showthread.php?t=19467
User avatar
omlette brothers
ReadyNAS User
 
Posts: 74
Joined: Fri Sep 30, 2005 3:30 am
Location: Motown UK
ReadyNAS: X6

Re: Single Instance Storage (Block level deduplication)

Postby TeknoJnky » Tue Feb 23, 2010 1:34 pm

has anyone thought of or tried running open solaris and zfs with dedupe, on a pro/3200 via virtual box?

I realize it would be slower than using native, but for data that is dedupe friendly it might be worth it.
nv+ ~ 1gb ram ~ 4x WDC WD20EARS-00S8B1 ~ 5555 GB
ultra4 ~ 4 gb ram ~ 2x ST31500341AS ~ 2x ST4000DX000-1C5160 ~ 6471 GB
pro business ~ 4gb ram ~ dual redundancy ~ 4x Hitachi HDS724040ALE640 ~ 2x SAMSUNG HD204UI ~ 9130 GB
A/V streaming ---> Subsonic ---> EVO 3D
User avatar
TeknoJnky
ReadyNAS Addict
 
Posts: 2910
Joined: Mon Oct 13, 2008 1:34 pm
Location: MO
ReadyNAS: Pro

Re: Single Instance Storage (Block level deduplication)

Postby omlette brothers » Wed Nov 24, 2010 1:10 pm

Block level Dedupe looks to be on the BTFRS roadmap? :D
ReadyNas X6 Rev B - RAIDiator 4.1.7 [1.00a147] - Crucial CT12864Z335 (1024MB 2.0-3-3-7) - 4xSeagate ST31500341AS CC1H - Popcornhour NMT-A100 [01-17-090201-15-POP-402] - 3COM 8 Port 1Gbps Switch (3CGSU08-UK) - Wintel Clients RTL8169 @ 1Gbps - No jumbo frames - Netgear HDX101 http://forum1.netgear.com/showthread.php?t=19467
User avatar
omlette brothers
ReadyNAS User
 
Posts: 74
Joined: Fri Sep 30, 2005 3:30 am
Location: Motown UK
ReadyNAS: X6

Re: Single Instance Storage (Block level deduplication)

Postby lurium » Wed Jan 12, 2011 10:37 am

Or netgear can make support for LESSFS which is inline dedup in linux.

http://www.lessfs.com/wordpress
lurium
ReadyNAS Newbie
 
Posts: 43
Joined: Tue Oct 14, 2008 11:58 am
ReadyNAS: Pro

Re: Single Instance Storage (Block level deduplication)

Postby jeremyotten » Tue Apr 12, 2011 3:49 am

+1
jeremyotten
Advanced ReadyNAS Expert
 
Posts: 666
Joined: Wed Aug 02, 2006 2:08 am

Re: Single Instance Storage (Block level deduplication)

Postby andrewz » Tue May 24, 2011 12:10 am

TeknoJnky wrote:has anyone thought of or tried running open solaris and zfs with dedupe, on a pro/3200 via virtual box?
I realize it would be slower than using native, but for data that is dedupe friendly it might be worth it.


Would be slower.
andrewz
ReadyNAS Newbie
 
Posts: 1
Joined: Tue May 24, 2011 12:07 am
ReadyNAS: 3200

Next

Return to Feature Request



Who is online

Users browsing this forum: No registered users and 2 guests