<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>RayOnStorage Blog &#187; Storage Backup</title>
	<atom:link href="http://silvertonconsulting.com/blog/index.php/category/storage/storage-backup/feed/" rel="self" type="application/rss+xml" />
	<link>http://silvertonconsulting.com/blog</link>
	<description>Storage, Strategy &#38; Systems</description>
	<lastBuildDate>Thu, 09 Feb 2012 23:03:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>A day and a half with HP Storage</title>
		<link>http://silvertonconsulting.com/blog/2012/01/30/a-day-and-a-half-with-hp-storage/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=a-day-and-a-half-with-hp-storage</link>
		<comments>http://silvertonconsulting.com/blog/2012/01/30/a-day-and-a-half-with-hp-storage/#comments</comments>
		<pubDate>Mon, 30 Jan 2012 21:10:38 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Block Storage]]></category>
		<category><![CDATA[Data reduction]]></category>
		<category><![CDATA[Disk storage]]></category>
		<category><![CDATA[File Storage]]></category>
		<category><![CDATA[Storage architecture]]></category>
		<category><![CDATA[Storage Backup]]></category>
		<category><![CDATA[System effectiveness]]></category>
		<category><![CDATA[autonomic storage]]></category>
		<category><![CDATA[converged storage]]></category>
		<category><![CDATA[HP 3PAR]]></category>
		<category><![CDATA[HP B6200]]></category>
		<category><![CDATA[HP data center cooling technology]]></category>
		<category><![CDATA[HP Ibrix]]></category>
		<category><![CDATA[HP StoreOnce]]></category>
		<category><![CDATA[HP Tech Days]]></category>
		<category><![CDATA[HP X5000]]></category>
		<category><![CDATA[HP X9000]]></category>
		<category><![CDATA[Multi-tenancy]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=3634</guid>
		<description><![CDATA[[long post 945 wds] HP held their (annual?) HP Tech Days in Fort Collins, Colorado this last week. We had presentations from a number of HP product managers and got to meet a number of new and old bloggers there. &#8230; <a href="http://silvertonconsulting.com/blog/2012/01/30/a-day-and-a-half-with-hp-storage/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_3643" class="wp-caption alignright" style="width: 330px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-3RZ1.jpg"><img class="size-full wp-image-3643" title="Bloggers and HP people waiting to tour lab" src="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-3RZ1.jpg" alt="A photo of bloggers and HP personnel waiting to go on the lab tour" width="320" height="239" /></a><p class="wp-caption-text">Bloggers and HP people waiting to tour lab</p></div>
<p>[long post 945 wds] HP held their (annual?) HP Tech Days in Fort Collins, Colorado this last week. We had presentations from a number of HP product managers and got to meet a number of new and old bloggers there.</p>
<p>In attendance from the blogosphere were: <a href="http://www.demitasse.co.nz/" target="_blank">Alastair Cooke </a>(@DemitasseNZ), <a href="http://www.knudt.net/vblog/" target="_blank">Brian Knudtson</a> (@bknudtson), <a href="http://deepstorage.net/WP-Save/" target="_blank">Howard Marks</a> (@DeepStorageNet), <a href="http://absolutelywindows.com/" target="_blank">John Obeto</a> (@JohnObeto), <a href="http://www.geekazine.com/" target="_blank">Jeff Powers</a> (@Geekazine), <a href="http://vmbulletin.com/" target="_blank">Rich Schandler</a> (@recklessop), <a href="http://techhelp.cybercreations.net/" target="_blank">Derek Schauland</a> (@webjunkie), <a href="http://3cvguy.blog.com/" target="_blank">Justin Vashisht</a> (@3cVGuy), and <a href="http://mattvogt.posterous.com/" target="_blank">Matt Vogt</a> (@MattVogt).</p>
<p><strong>Craig Nunes</strong> VP of Marketing, HP Storage got up and led off the day&#8217;s discussion talking about recent results. HP disk storage is up 11% for the quarter, 3par is growing by triple digit growth (<del>QoQ</del> maybe YoY?) and channel sales are growing by 10%.  HP storage is gaining market share, grew 3% for the quarter.  Also, HP is #2 is shipped backup appliances (1H11).  The current focus for HP storage is in three areas:</p>
<ul>
<li>Invest in <strong>established platforms</strong>, MSA and EVA (with a 100K customers)</li>
<li>Invest in <strong>converged storage</strong> aimed at new data centers, 3PAR, Lefthand, IBRIX and StoreOnce.</li>
<li>Invest in <strong>converged systems</strong> knocking down barriers between servers, storage and networking with Virtual Systems.</li>
</ul>
<p>Craig spent most of his time talking about converged storage. HP’s converged storage includes:</p>
<ul>
<li><strong>built in autonomic storage </strong>automating operations with one pain of glass and an orchestration layer on top to oversee everything.</li>
<li><strong>scale out storage</strong> providing simpler ways to grow storage.</li>
<li><strong>built on standardized platforms</strong> using off the shelf server platform technology</li>
</ul>
<p>Craig ended up discussing HP’s Virtual System, their response to VCE’s Vblock, NetApp’s FlexPod and Dell’s vStart Bundle.   HP&#8217;s Virtual System was announced earlier last year and has been doing well in the market.</p>
<p><strong>Brad Katz, </strong>Product Manager got up next and talked about Lefthand storage solutions.  Lefthand&#8217;s portfolio now ranges from the Virtual Storage Appliance (VSA) all the way up to a P4800 SAN storage blade with P4300 and P4500 rackmountable storage systems between those two.   Lefthand systems provide a clustered, scale-out IP/SAN and NAS storage.   Cluster data is striped across all disks in all storage nodes.</p>
<p>The VSA runs as a virtual machine and utilizes any ESX  (direct or SAN attached) storage.  The P4800 operates as a storage blade in an HP blade server and uses storage in the blade system.  The two rackmount systems P4300 and P4500 connect to SAS attached, external disk shelves.</p>
<div id="attachment_3636" class="wp-caption alignleft" style="width: 330px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-1RZ.jpg"><img class="size-full wp-image-3636" title="Steve Johnson on StoreOnce" src="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-1RZ.jpg" alt="HP's Steve Johnson, at the front of the room discussing slide on StoreOnce" width="320" height="239" /></a><p class="wp-caption-text">Steve Johnson on StoreOnce</p></div>
<p><strong>Steve Johnson and Mat Jacoby</strong> talked next about the StoreOnce deduplicating backup appliance product line.  StoreOnce is an HP R&amp;D Labs home grown, deduplication technology which provides balanced ingest-restore rates and memory efficient deduplication.  The current product line spans D2D25xx, D2D41xx, D2D43xx and the recently announced, B6200 backup storage blade.</p>
<p>StoreOnce use a variable block, 4K chunksize and a sparse index which saves on server memory size which both lead to great deduplication rates.   Most deduplication functionality is memory intensive making it hard to scale without increasing memory or using different dedupe engines across a product line.  StoreOnce&#8217;s sparse indexing fixed that issue and as such, can use the same deduplication engine across their entire product line.</p>
<div id="attachment_3637" class="wp-caption alignright" style="width: 330px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-2RZ.jpg"><img class="size-full wp-image-3637" title="JR talking about 3PAR advantages" src="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-2RZ.jpg" alt="HP's JR (Jim Richardson) at the front of the room discussing 3PAR's advantages" width="320" height="239" /></a><p class="wp-caption-text">JR talking about 3PAR advantages</p></div>
<p><strong>Jim Richardson</strong> or JR, a 3PAR SE from the start, got up and discussed 3PAR.  Early on, 3PAR brought to the market three characteristics that differentiated it from other enterprise storage products:</p>
<ul>
<li><strong>Multi-tennancy – </strong>today’s cloud service providers and just about anyone running enterprise storage needs to support mixed workloads on shared storage. 3PAR&#8217;s ASIC allows data to be placed on any storage node and be serviced at direct access speeds to better support these multi-application environments. <strong></strong></li>
<li><strong>Thin provisioning – </strong>although certainly not the first to support thin provisioning (Iceberg was the first), 3PAR did much to popularize it.  Once again the ASIC provides automated support for thin provisioning.  <strong></strong></li>
<li><strong>Autonomic functionality – </strong>optimization of storage performance across nodes and tiers of storage was also helped by their ASIC&#8217;s ability to transfer data without involving processor interaction.  Also 3PAR, tried to take the drudgery out of administration by automatically wide striping and making provisioning easier.</li>
</ul>
<p><strong>Jim Hankins and Chris Duffy</strong> came up next and talked about the X9000 IBRIX storage system.  Ibrix has intrinsic scale out NAS support and provides automatic failover across dual processing nodes called couplets. The B6200 backup system (see above) is based on Ibrix technology.  Ibrix supports a 15PB single name space that is segmented across cluster couplets.  Ibrix also comes in a gateway configuration using shared SAN storage behind it.</p>
<div id="attachment_3638" class="wp-caption alignleft" style="width: 330px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-5RZ.jpg"><img class="size-full wp-image-3638" title="HP X5000 NAS system" src="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-5RZ.jpg" alt="A picture of a X5000 without skins, and a couple of CRUs taken out" width="320" height="239" /></a><p class="wp-caption-text">HP X5000 NAS system</p></div>
<p><strong>Robert Thompson</strong> got up and talked about the X5000 Windows Server WSS based NAS product.  It is the industry’s first two node file system with active/active clustering in a box.  As the product runs Windows Server, one can run Anti-Virus or other server applications directly on the storage and is customer maintainable. Robert pulled out every replaceable unit in the system.  Apparently the E5000, HP Storage’s Exchange Appliance is also based on the same hardware.   The two servers in the storage system are clustered together using MSCS.</p>
<div id="attachment_3642" class="wp-caption alignright" style="width: 250px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-4RZ2.jpg"><img class="size-full wp-image-3642" title="HPer showing off intelligent floor tiles" src="http://silvertonconsulting.com/blog/wp-content/uploads/2012/01/HPTechDays2012-4RZ2.jpg" alt="A photo of an intelligent data center floor tile with remotely controlled mechanical louvres to control air flow." width="240" height="321" /></a><p class="wp-caption-text">HPer showing off intelligent floor tiles</p></div>
<p>In the afternoon we went on a<strong> lab tour</strong> and got to see some of HP&#8217;s storage and data center cooling technology on display.</p>
<p>On the second day, <strong>Mike Koponen</strong> got up and discussed HP&#8217;s Virtual System (or Vblock competitor) and<strong> Aboubacar Diare</strong> gave some of his opinions on VMware VAAI &amp; VASA integration from his testing perspective.  Finally, Calvin Zito wrapped up the two day event and everyone (except me and a few others) went on a brewery tour.</p>
<p>~~~~</p>
<p>All in all, we had a good time with HP.  Too bad, I didn&#8217;t get to go on the New Belgium Brewery tour, perhaps next time.</p>
<p>Comments?</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=3634&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2012. |
<a href="http://silvertonconsulting.com/blog/2012/01/30/a-day-and-a-half-with-hp-storage/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2012/01/30/a-day-and-a-half-with-hp-storage/#comments">One comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2012/01/30/a-day-and-a-half-with-hp-storage/&title=A day and a half with HP Storage">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/autonomic-storage/" rel="tag">autonomic storage</a>, <a href="http://silvertonconsulting.com/blog/tag/converged-storage/" rel="tag">converged storage</a>, <a href="http://silvertonconsulting.com/blog/tag/hp-3par/" rel="tag">HP 3PAR</a>, <a href="http://silvertonconsulting.com/blog/tag/hp-b6200/" rel="tag">HP B6200</a>, <a href="http://silvertonconsulting.com/blog/tag/hp-data-center-cooling-technology/" rel="tag">HP data center cooling technology</a>, <a href="http://silvertonconsulting.com/blog/tag/hp-ibrix/" rel="tag">HP Ibrix</a>, <a href="http://silvertonconsulting.com/blog/tag/hp-storeonce/" rel="tag">HP StoreOnce</a>, <a href="http://silvertonconsulting.com/blog/tag/hp-tech-days/" rel="tag">HP Tech Days</a>, <a href="http://silvertonconsulting.com/blog/tag/hp-x5000/" rel="tag">HP X5000</a>, <a href="http://silvertonconsulting.com/blog/tag/hp-x9000/" rel="tag">HP X9000</a>, <a href="http://silvertonconsulting.com/blog/tag/multi-tenancy/" rel="tag">Multi-tenancy</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2012/01/30/a-day-and-a-half-with-hp-storage/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Latest Microsoft ESRP v3 (Exchange 2010) 1K to 5K mailbox performance results &#8211; chart of the month</title>
		<link>http://silvertonconsulting.com/blog/2011/08/25/latest-microsoft-esrp-v3-exchange-2010-1k-to-5k-mailbox-performance-results-chart-of-the-month/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=latest-microsoft-esrp-v3-exchange-2010-1k-to-5k-mailbox-performance-results-chart-of-the-month</link>
		<comments>http://silvertonconsulting.com/blog/2011/08/25/latest-microsoft-esrp-v3-exchange-2010-1k-to-5k-mailbox-performance-results-chart-of-the-month/#comments</comments>
		<pubDate>Thu, 25 Aug 2011 21:46:17 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Block Storage]]></category>
		<category><![CDATA[Disk storage]]></category>
		<category><![CDATA[Ethernet]]></category>
		<category><![CDATA[FC]]></category>
		<category><![CDATA[Storage Backup]]></category>
		<category><![CDATA[Storage performance]]></category>
		<category><![CDATA[System effectiveness]]></category>
		<category><![CDATA[1K-to-5Kmbx]]></category>
		<category><![CDATA[Database backup performance]]></category>
		<category><![CDATA[Dell PowerEdge R510]]></category>
		<category><![CDATA[Email performance]]></category>
		<category><![CDATA[ESRP]]></category>
		<category><![CDATA[Exchange 2010]]></category>
		<category><![CDATA[Exchange performance]]></category>
		<category><![CDATA[Exchange Solution Reviewed Program]]></category>
		<category><![CDATA[Fujitsu ETERNUS JX40]]></category>
		<category><![CDATA[HP E5700]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=3203</guid>
		<description><![CDATA[Microsoft specifies two different metrics on sequential read rates for database backup activity in their Exchange Solution Reviewed Program (ESRP) reports MB read/sec per database MB read/sec total per server Our problem with these metrics is that they don&#8217;t say &#8230; <a href="http://silvertonconsulting.com/blog/2011/08/25/latest-microsoft-esrp-v3-exchange-2010-1k-to-5k-mailbox-performance-results-chart-of-the-month/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_3204" class="wp-caption aligncenter" style="width: 911px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2011/08/SCIESRP110726-004.jpg"><img class="size-full wp-image-3204" title="SCIESRP110726-004 (c) 2011 Silverton Consulting, All Rights Reserved" src="http://silvertonconsulting.com/blog/wp-content/uploads/2011/08/SCIESRP110726-004.jpg" alt="SCIESRP110726-004 (c) 2011 Silverton Consulting, All Rights Reserved" width="901" height="613" /></a><p class="wp-caption-text">SCIESRP110726-004 (c) 2011 Silverton Consulting, All Rights Reserved</p></div>
<p>Microsoft specifies two different metrics on sequential read rates for database backup activity in their <a href="http://technet.microsoft.com/en-us/exchange/ff182054.aspx" target="_blank">Exchange Solution Reviewed Program (ESRP) reports</a></p>
<ul>
<li>MB read/sec per <strong>database</strong></li>
<li>MB read/sec total per <strong>server</strong></li>
</ul>
<p>Our problem with these metrics is that they don&#8217;t say much about the storage systems performance.  Some ESRP submissions could have a single database while others can have 100s of databases.  And the same thing applies to servers, although 20 servers seems to be about the max we have seen.  So as one can see the MB/s/DB or MB/s/server can vary all over the place depending on the Exchange configuration that one uses, even for the same exact storage system.</p>
<p>In the above chart, we  have attempted to move beyond some of these problems and use the information supplied in the ESRP reports to aggregate DB backups across all databases.  As such, we have derived a new metric called &#8220;total database backup&#8221;.  (Pretty simple actually just multiply the MB/s/DB times the number of databases in the Exchange configuration).</p>
<p>A couple of problems with our approach.</p>
<ul>
<li>Current ESRP reports typically utilize a shadow storage system and shadow Exchange servers which host 50% of the databases and email activity. So what I am showing for those ESRP reports is what two storage systems can accomplish not one.</li>
<li>Another potential way to get the same result would be to use the number of servers times the MB/sec/server metric. (But try as I might these two approaches didn&#8217;t work to get the same answer so I am using the computation above &#8211; must be the way I am recording the number of [shadow] servers).</li>
<li>Although ESRP reports the average MB/sec/database to backup a single database it&#8217;s not clear that these measurements were taken while backing up all active databases at the same time, especially for those submissions with 100s of databases.</li>
</ul>
<p>Probably the last is the most problematic critique to our new measure but may not be that harmful for smaller configurations. Nonetheless, we produced the above chart and published it in our last months review of ESRP results for the 1001 to 5000 mailbox category.</p>
<p>One item we discussed in our report was that numbers of disk drives didn&#8217;t seem to correlate well with high positions on this chart.  The number ten position (Fujitsu ETERNUS JX40) used over 140 disks, the number two position (Dell PowerEdge R510) had only 12 disk drives, and the number one solution (HP E5700) consisted of 56 drives, close to the average for this category.</p>
<p>One striking finding using this measure is that performance varies considerably from the top providing over 1600 MB/sec of database backup to the lowest of the group providing only ~800 MB/sec of backup performance. What with Exchange 2010 and lagged DAGs, some people feel that backup activity is no longer needed but we would disagree. We continue to believe that taking backups of Exchange data still makes a whole lot of sense and shouldn&#8217;t go away, ever.</p>
<p>It&#8217;s our hope that this or some similar follow-on metric will remove some of the Exchange configuration parameters from confounding ESRP reported storage system performance results.  We realize that this quixotic quest may never be entirely successful nevertheless we perform this duty in the hope that it will benefit today and future storage performance analysts everywhere.</p>
<p>Comments?</p>
<p>&#8212;&#8211;</p>
<p>The full ESRP report went out to our newsletter subscribers last month.  A copy of the full report will be up on the <a href="http://silvertonconsulting.com/cms1/dispatches/" target="_blank">dispatches page</a> of our website later next month. However, you can get this information now and subscribe to future newsletters to receive these reports even earlier by just emailing us at <a href="mailto:SubscribeNews@SilvertonConsulting.com?Subject=Subscribe_to_Newsletter">SubscribeNews@SilvertonConsulting.com?Subject=Subscribe_to_NewsletterR</a> or using the signup form above and to the right.</p>
<p>As always, we welcome any suggestions on how to improve our analysis of ESRP or any of our other storage system performance discussions.</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=3203&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2011. |
<a href="http://silvertonconsulting.com/blog/2011/08/25/latest-microsoft-esrp-v3-exchange-2010-1k-to-5k-mailbox-performance-results-chart-of-the-month/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2011/08/25/latest-microsoft-esrp-v3-exchange-2010-1k-to-5k-mailbox-performance-results-chart-of-the-month/#comments">No comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2011/08/25/latest-microsoft-esrp-v3-exchange-2010-1k-to-5k-mailbox-performance-results-chart-of-the-month/&title=Latest Microsoft ESRP v3 (Exchange 2010) 1K to 5K mailbox performance results &#8211; chart of the month">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/1k-to-5kmbx/" rel="tag">1K-to-5Kmbx</a>, <a href="http://silvertonconsulting.com/blog/tag/database-backup-performance/" rel="tag">Database backup performance</a>, <a href="http://silvertonconsulting.com/blog/tag/dell-poweredge-r510/" rel="tag">Dell PowerEdge R510</a>, <a href="http://silvertonconsulting.com/blog/tag/email-performance/" rel="tag">Email performance</a>, <a href="http://silvertonconsulting.com/blog/tag/esrp/" rel="tag">ESRP</a>, <a href="http://silvertonconsulting.com/blog/tag/exchange-2010/" rel="tag">Exchange 2010</a>, <a href="http://silvertonconsulting.com/blog/tag/exchange-performance/" rel="tag">Exchange performance</a>, <a href="http://silvertonconsulting.com/blog/tag/exchange-solution-reviewed-program/" rel="tag">Exchange Solution Reviewed Program</a>, <a href="http://silvertonconsulting.com/blog/tag/fujitsu-eternus-jx40/" rel="tag">Fujitsu ETERNUS JX40</a>, <a href="http://silvertonconsulting.com/blog/tag/hp-e5700/" rel="tag">HP E5700</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2011/08/25/latest-microsoft-esrp-v3-exchange-2010-1k-to-5k-mailbox-performance-results-chart-of-the-month/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>EMCWorld day 3 &#8230;</title>
		<link>http://silvertonconsulting.com/blog/2011/05/11/emc-world-day-3/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=emc-world-day-3</link>
		<comments>http://silvertonconsulting.com/blog/2011/05/11/emc-world-day-3/#comments</comments>
		<pubDate>Wed, 11 May 2011 20:01:47 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Data grid]]></category>
		<category><![CDATA[Data retention]]></category>
		<category><![CDATA[Distributed computing]]></category>
		<category><![CDATA[File Storage]]></category>
		<category><![CDATA[Storage architecture]]></category>
		<category><![CDATA[Storage Backup]]></category>
		<category><![CDATA[Systems]]></category>
		<category><![CDATA[backup appliances]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[EMCWorld]]></category>
		<category><![CDATA[grid computing]]></category>
		<category><![CDATA[healthcare cloud]]></category>
		<category><![CDATA[Isilon]]></category>
		<category><![CDATA[sensor mesh]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=2940</guid>
		<description><![CDATA[Sometime this week EMC announced a new generation of Isilon NearLine storage which now includes HGST 3TB SATA disk drives.  With the new capacity the multi-node (144) Isilon cluster using the 108NL nodes can support 15PB of file data in &#8230; <a href="http://silvertonconsulting.com/blog/2011/05/11/emc-world-day-3/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<p>Sometime this week EMC announced a new generation of Isilon NearLine storage which now includes HGST 3TB SATA disk drives.  With the new capacity the multi-node (144) Isilon cluster using the 108NL nodes can support 15PB of file data in a single file system.</p>
<p>Some of the booths along the walk to the solutions pavilion highlight EMC innovation winners. Two that caught my interest included:</p>
<ul>
<li><strong>Constellation computing</strong> &#8211; not quite sure how to define this but it&#8217;s distributed computing along with distributed data creation.  The intent is to move the data processing to the source of the data creation and keep the data there.  This might be very useful for applications that have many data sources and where data processing capabilities can be moved out to the nodes where the data was created. Seems highly scaleable but may depend on the ability to carve up the processing to work on the local data. I can see where compression, encryption, indexing and some statistical summarization can be done at the data creation site before it&#8217;s sent elsewhere. Sort of like both a sensor mesh with a processing nodes attached to the sensors configured as a sensor-proccessing grid.  Only one thing concerned me, there didn&#8217;t seem to be any central repository or control to this computing environment.  Probably what they intended, as the distributed solution is more adaptable and more scaleable than a centrally controlled environment.</li>
<li><strong>Developing world healthcare cloud</strong> &#8211; seemed to be all about delivering healthcare to the bottom of the pyramid.  They won EMC&#8217;s social innovation award and are working with a group in Rwanda to try to provide better healthcare to remote villages.  It&#8217;s built around <a href="http://openmrs.org/" target="_blank">OpenMRS</a> as a backend medical record archive hosted on EMC DC powered Iomega NAS storage and uses Google&#8217;s <a href="http://opendatakit.org/" target="_blank">OpenDataKit</a> to work with the data on mobile and laptop devices.  They showed a mobile phone which could be used to create, record and retrieve healthcare information (OpenMRS records) remotely and upload it sometime later when in range of a cell tower.  The solution also supports the download of a portion of the medical center&#8217;s health record database (e.g., a &#8220;cohort&#8221; slice, think a village&#8217;s healthcare records) onto a laptop, usable offline by a healthcare provider to update and record  patient health changes onsite and remotely.  Pulling all the technology together and delivering this as an application stack usable on mobile and laptop devices with minimal IT sophistication, storage and remote/mobile access are where the challenges lie.</li>
</ul>
<p>Went to Sanjay&#8217;s (EMC&#8217;s CIO) keynote on EMC IT&#8217;s journey to IT-as-a-Service. As you can imagine it makes extensive use of VMware&#8217;s vSphere, vCloud, and vShield capabilities primarily in a private cloud infrastructure but they seem agnostic to a build-it or buy-it approach. EMC is about 75% virtualized today, and are starting to see significant and tangible OpEx and energy savings. They designed their North Carolina data center around the vCloud architecture and now are offering business users self service portals to provision VMs and business services&#8230;</p>
<p>Only caught the first section of BJ&#8217;s (President of BRS) keynote but he said recent analyst data (think IDC?) said that EMC was the overall leader (&gt;64% market share) in purpose built backup appliances (Data Domain, Disk Library, Avamar data stores, etc.).  Too bad I had to step out but he looked like he was on a roll.</p>
<p>Comments?</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=2940&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2011. |
<a href="http://silvertonconsulting.com/blog/2011/05/11/emc-world-day-3/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2011/05/11/emc-world-day-3/#comments">No comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2011/05/11/emc-world-day-3/&title=EMCWorld day 3 &#8230;">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/backup-appliances/" rel="tag">backup appliances</a>, <a href="http://silvertonconsulting.com/blog/tag/emc/" rel="tag">EMC</a>, <a href="http://silvertonconsulting.com/blog/tag/emcworld/" rel="tag">EMCWorld</a>, <a href="http://silvertonconsulting.com/blog/tag/grid-computing/" rel="tag">grid computing</a>, <a href="http://silvertonconsulting.com/blog/tag/healthcare-cloud/" rel="tag">healthcare cloud</a>, <a href="http://silvertonconsulting.com/blog/tag/isilon/" rel="tag">Isilon</a>, <a href="http://silvertonconsulting.com/blog/tag/sensor-mesh/" rel="tag">sensor mesh</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2011/05/11/emc-world-day-3/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>EMC Data Domain products enter the archive market</title>
		<link>http://silvertonconsulting.com/blog/2011/01/18/emc-data-domain-products-enter-the-archive-market/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=emc-data-domain-products-enter-the-archive-market</link>
		<comments>http://silvertonconsulting.com/blog/2011/01/18/emc-data-domain-products-enter-the-archive-market/#comments</comments>
		<pubDate>Tue, 18 Jan 2011 15:18:53 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[data protection]]></category>
		<category><![CDATA[Data retention]]></category>
		<category><![CDATA[Storage Backup]]></category>
		<category><![CDATA[Storage longevity]]></category>
		<category><![CDATA[Archive]]></category>
		<category><![CDATA[backup data retention]]></category>
		<category><![CDATA[Data archive]]></category>
		<category><![CDATA[Data Domain]]></category>
		<category><![CDATA[Data Domain 860 Archiver]]></category>
		<category><![CDATA[DD Boost]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[file backup data]]></category>
		<category><![CDATA[Long term archive]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=2565</guid>
		<description><![CDATA[In another assault on the tape market, EMC announced today a new Data Domain 860 Archiver appliance. This new system supports both short-term and long-term retention of backup data. This attacks one of the last bastions of significant tape use &#8230; <a href="http://silvertonconsulting.com/blog/2011/01/18/emc-data-domain-products-enter-the-archive-market/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_2593" class="wp-caption alignleft" style="width: 234px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2011/01/IMG_0268.jpg"><img class="size-medium wp-image-2593" title="(c) 2011 Silverton Consulting, Inc., All Rights Reserved" src="http://silvertonconsulting.com/blog/wp-content/uploads/2011/01/IMG_0268-224x300.jpg" alt="(c) 2011 Silverton Consulting, Inc., All Rights Reserved" width="224" height="300" /></a><p class="wp-caption-text">(c) 2011 Silverton Consulting, Inc., All Rights Reserved</p></div>
<p>In another assault on the tape market, EMC announced today a new Data Domain <del datetime="2011-01-24T19:06:12+00:00">860</del> Archiver <del datetime="2011-01-24T19:06:12+00:00">appliance</del>.  This new system supports both short-term and long-term retention of backup data.  This attacks one of the last bastions of significant tape use &#8211; long-term data archives.</p>
<p>Historically, a cheap version of archives had been the long-term retention of full backup tapes. As such, if one needed to keep data around for 5 years, one would keep all their full backup tape sets offsite, in a vault somewhere for 5 years.  They could then rotate the tapes (bring them back into scratch use) after the 5 years elapsed.  One problem with this &#8211; tape technology is advancing to a new generation of technology more like every 2-3 years and as such, a 5-year old tape cartridge would be at least one generation back before it could be re-used.  But current tape technology always reads 2 generations and writes at least one generation back so this use would still be feasible. I would say that many tape users did something like this to create a &#8220;<del datetime="2011-01-24T19:06:12+00:00">psuedo</del>pseudo-archive&#8221;.</p>
<p>On the other hand, there exists many specific archive point products that focused on one or a few application arenas such as email, records, or database archives which would extract specific data items and place them into archive.  These did not generally apply outside one or a few application domains but were used to support stringent compliance requirements.  The advantage of these application based archive systems is that the data was actually removed from primary storage, out of any data protection activities and placed permanently in only &#8220;archive storage&#8221;.  Such data would be subject to strict retention policies and as such, would be inviolate (couldn&#8217;t be modified) and could not be deleted until formally expired.</p>
<p>Enter the Data Domain 860 Archiver, this system supports up to 24 disk shelves, each one of which could either be dedicated to short- or long-term data retention.  Backup file data is moved within the appliance by automated policy from short- to long-term storage.  Up to 4-disk shelves can be dedicated to short-term storage with the remainder considered long-term archive units.</p>
<p>When a long-term archive unit (disk shelf) fills up with backup data it is &#8220;sealed&#8221;, i.e., it is given all the metadata required to reconstruct its file system and deduplication domain and thus, would not require the use of other disk shelves to access its data.  In this way one creates a standalone unit that contains everything needed to recover the data.  Not unlike a full backup tape set which can be used in a standalone fashion to restore data.</p>
<p>Today, the Data Domain 860 Archiver only supports file access and DD boost data access.  By doing so, the backup software is responsible for deleting data that has expired.   Such data will then be <del datetime="2011-01-24T19:06:12+00:00">absent</del> deleted from any backups taken and as policy automation copies the backups to long-term archive units it will be <del datetime="2011-01-24T19:06:12+00:00">missing</del> gone from there as well.</p>
<p>While Data Domain&#8217;s Archiver lacks removing the data from ongoing backup streams that application based archive products can achieve, it does look exactly like what could be achieved from tape based archives today.</p>
<p>One can also replicate base Data Domain or Archiver appliances to an Archiver unit to achieve offsite data archives.</p>
<p>&#8212;-</p>
<p><strong>Full disclosure:</strong> I currently work with EMC on projects specific to other products but am not currently working on anything associated with this product.</p>
<p>Tape, your move&#8230;</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=2565&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2011. |
<a href="http://silvertonconsulting.com/blog/2011/01/18/emc-data-domain-products-enter-the-archive-market/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2011/01/18/emc-data-domain-products-enter-the-archive-market/#comments">No comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2011/01/18/emc-data-domain-products-enter-the-archive-market/&title=EMC Data Domain products enter the archive market">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/archive/" rel="tag">Archive</a>, <a href="http://silvertonconsulting.com/blog/tag/backup-data-retention/" rel="tag">backup data retention</a>, <a href="http://silvertonconsulting.com/blog/tag/data-archive/" rel="tag">Data archive</a>, <a href="http://silvertonconsulting.com/blog/tag/data-domain/" rel="tag">Data Domain</a>, <a href="http://silvertonconsulting.com/blog/tag/data-domain-860-archiver/" rel="tag">Data Domain 860 Archiver</a>, <a href="http://silvertonconsulting.com/blog/tag/dd-boost/" rel="tag">DD Boost</a>, <a href="http://silvertonconsulting.com/blog/tag/emc/" rel="tag">EMC</a>, <a href="http://silvertonconsulting.com/blog/tag/file-backup-data/" rel="tag">file backup data</a>, <a href="http://silvertonconsulting.com/blog/tag/long-term-archive/" rel="tag">Long term archive</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2011/01/18/emc-data-domain-products-enter-the-archive-market/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Top 10 storage technologies over the last decade</title>
		<link>http://silvertonconsulting.com/blog/2010/12/30/top-10-storage-technologies-over-the-last-decade/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=top-10-storage-technologies-over-the-last-decade</link>
		<comments>http://silvertonconsulting.com/blog/2010/12/30/top-10-storage-technologies-over-the-last-decade/#comments</comments>
		<pubDate>Thu, 30 Dec 2010 16:45:44 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Block Storage]]></category>
		<category><![CDATA[Cloud services]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Data compression]]></category>
		<category><![CDATA[Disk storage]]></category>
		<category><![CDATA[Ethernet]]></category>
		<category><![CDATA[FC]]></category>
		<category><![CDATA[File Storage]]></category>
		<category><![CDATA[Networking]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Storage Backup]]></category>
		<category><![CDATA[Storage density]]></category>
		<category><![CDATA[Storage Features]]></category>
		<category><![CDATA[Storage performance]]></category>
		<category><![CDATA[Strategic Inflection Points]]></category>
		<category><![CDATA[System effectiveness]]></category>
		<category><![CDATA[Systems]]></category>
		<category><![CDATA[Tape storage]]></category>
		<category><![CDATA[Cloud Storage]]></category>
		<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[FCoE]]></category>
		<category><![CDATA[GMR head]]></category>
		<category><![CDATA[iSCSI]]></category>
		<category><![CDATA[LTO tape]]></category>
		<category><![CDATA[NAND]]></category>
		<category><![CDATA[Scale-out storage]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[Storage virtualization]]></category>
		<category><![CDATA[Thin provisioning]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=2533</guid>
		<description><![CDATA[Some of these technologies were in development prior to 2000, some were available in other domains but not in storage, and some were in a few subsystems but had yet to become popular as they are today.  In no particular &#8230; <a href="http://silvertonconsulting.com/blog/2010/12/30/top-10-storage-technologies-over-the-last-decade/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_2536" class="wp-caption alignleft" style="width: 403px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2010/12/2391180204_61b27a9616_b.jpg"><img class="size-full wp-image-2536   " title="Aurora's Perception or I Schrive When I See Technology by Wonderlane (cc) (from Flickr)" src="http://silvertonconsulting.com/blog/wp-content/uploads/2010/12/2391180204_61b27a9616_b.jpg" alt="Aurora's Perception or I Schrive When I See Technology by Wonderlane (cc) (from Flickr)" width="393" height="248" /></a><p class="wp-caption-text">Aurora&#39;s Perception or I Schrive When I See Technology by Wonderlane (cc) (from Flickr)</p></div>
<p>Some of these technologies were in development prior to 2000, some were available in other domains but not in storage, and some were in a few subsystems but had yet to become popular as they are today.  In no particular order here are my top 10 storage technologies for the decade:</p>
<ol>
<li><strong><a href="http://en.wikipedia.org/wiki/Solid-state_drive" target="_blank">NAND based SSDs</a></strong><strong> </strong>- DRAM and other technology solid state drives (SSDs) were available last century but over the last decade NAND Flash based devices have dominated SSD technology and have altered the storage industry forever more.  Today, it&#8217;s nigh impossible to find enterprise class storage that doesn&#8217;t support NAND SSDs.</li>
<li><strong><a href="http://www.research.ibm.com/research/gmr.html" target="_blank">GMR head</a></strong><a href="http://www.research.ibm.com/research/gmr.html" target="_blank"> </a>- Giant Magneto Resistance disk heads have become common place over the last decade and have allowed disk drive manufacturers to double data density every 18-24 months.  Now GMR heads are starting to transition over to tape storage and will enable that technology to increase data density dramatically</li>
<li><strong><a href="http://silvertonconsulting.com/cms1/wp-content/uploads/filebase/archive/article/Demyst_Dedupe.pdf" target="_blank">Data Deduplication</a></strong><strong> &#8211; </strong>Deduplication technologies emerged over the last decade as a complement to higher density disk drives as a means to more efficiently backup data.  Deduplication technology can be found in many different forms today, ranging from file and block storage systems, backup storage systems, to backup software only solutions.</li>
<li><strong><a href="http://en.wikipedia.org/wiki/Thin_provisioning" target="_blank">Thin provisioning</a></strong> &#8211; No one would argue that thin provisioning emerged last century but it took the last decade to really find its place in the storage pantheon.  One almost cannot find a data center class storage device that does not support thin provisioning today.</li>
<li><strong><a href="http://www.enterprisestrategygroup.com/2010/06/scale-out-storage/" target="_blank">Scale-out storage</a></strong> &#8211; Last century if you wanted to get higher IOPS from a storage subsystem you could add cache or disk drives but at some point you hit a subsystem performance wall.  With scale-out storage, one can now add more processing elements to a storage system cluster without having to replace the controller to obtain more IO processing power.  The link reference talks about the use of commodity hardware to provide added performance but scale-out storage can also be done with non-commodity hardware (see <a href="http://silvertonconsulting.com/blog/2010/09/27/hitachis-vsp-vs-vmax/" target="_blank">Hitachi&#8217;s VSP vs. VMAX</a>).</li>
<li><strong><a href="http://www.snia.org/education/storage_networking_primer/stor_virt/" target="_blank">Storage virtualization</a></strong><strong> &#8211; </strong>server virtualization has taken off as the dominant data center paradigm over the last decade but a counterpart to this in storage has also become more viable as well.  Storage virtualization was originally used to migrate data from old subsystems to new storage but today can be used to manage and migrate data over PBs of physical storage dynamically optimizing data placement for cost and/or performance.</li>
<li><strong><a href="http://www.ultrium.com/technology/index.html" target="_blank">LTO tape</a></strong><strong> -</strong> When IBM dominated IT in the mid to late last century, the tape format dejour always matched IBM&#8217;s tape technology.  As the decade dawned, IBM was no longer the dominant player and tape technology was starting to diverge into a babble of differing formats.  As a result, IBM, Quantum, and HP put their technology together and created a standard tape format, called LTO, which has become the new dominant tape format for the data center.</li>
<li><strong><a href="http://en.wikipedia.org/wiki/Cloud_storage" target="_blank">Cloud storage</a></strong><strong> -</strong> Unclear just when over the last decade cloud storage emerged but it seemed to be a supplement to cloud computing that also appeared this past decade.  Storage service providers had existed earlier but due to bandwidth limitations and storage costs didn&#8217;t survive the dotcom bubble. But over this past decade both bandwidth and storage costs have come down considerably and cloud storage has now become a viable technological solution to many data center issues.</li>
<li><strong>i<a href="http://en.wikipedia.org/wiki/ISCSI" target="_blank">SCSI</a></strong><strong> -</strong> SCSI has taken on many forms over the last couple of decades but iSCSI has the altered the dominant block storage paradigm from a single, pure FC based SAN to a plurality of technologies.  Nowadays, SMB shops can have block storage without the cost and complexity of FC SANs over the LAN networking technology they already use.</li>
<li><strong><a href="http://www.fcoe.com/" target="_blank">FCoE</a></strong><strong> &#8211; </strong>One could argue that this technology is still maturing today but once again SCSI has taken opened up another way to access storage. FCoE has the potential to offer all the robustness and performance of FC SANs over data center Ethernet hardware simplifying and unifying data center networking onto one technology.</li>
</ol>
<p>No doubt others would differ on their top 10 storage technologies over the last decade but I strived to find technologies that significantly changed data storage that existed in 2000 vs. today.  These 10 seemed to me to fit the bill better than most.</p>
<p>Comments?</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=2533&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2010. |
<a href="http://silvertonconsulting.com/blog/2010/12/30/top-10-storage-technologies-over-the-last-decade/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2010/12/30/top-10-storage-technologies-over-the-last-decade/#comments">4 comments</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2010/12/30/top-10-storage-technologies-over-the-last-decade/&title=Top 10 storage technologies over the last decade">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/cloud-storage/" rel="tag">Cloud Storage</a>, <a href="http://silvertonconsulting.com/blog/tag/deduplication/" rel="tag">Deduplication</a>, <a href="http://silvertonconsulting.com/blog/tag/fcoe/" rel="tag">FCoE</a>, <a href="http://silvertonconsulting.com/blog/tag/gmr-head/" rel="tag">GMR head</a>, <a href="http://silvertonconsulting.com/blog/tag/iscsi/" rel="tag">iSCSI</a>, <a href="http://silvertonconsulting.com/blog/tag/lto-tape/" rel="tag">LTO tape</a>, <a href="http://silvertonconsulting.com/blog/tag/nand/" rel="tag">NAND</a>, <a href="http://silvertonconsulting.com/blog/tag/scale-out-storage/" rel="tag">Scale-out storage</a>, <a href="http://silvertonconsulting.com/blog/tag/ssd/" rel="tag">SSD</a>, <a href="http://silvertonconsulting.com/blog/tag/storage-virtualization/" rel="tag">Storage virtualization</a>, <a href="http://silvertonconsulting.com/blog/tag/thin-provisioning/" rel="tag">Thin provisioning</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2010/12/30/top-10-storage-technologies-over-the-last-decade/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>EMC to buy Isilon Systems</title>
		<link>http://silvertonconsulting.com/blog/2010/11/16/emc-to-buy-isilon-systems/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=emc-to-buy-isilon-systems</link>
		<comments>http://silvertonconsulting.com/blog/2010/11/16/emc-to-buy-isilon-systems/#comments</comments>
		<pubDate>Tue, 16 Nov 2010 22:52:11 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[data protection]]></category>
		<category><![CDATA[File Storage]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Storage architecture]]></category>
		<category><![CDATA[Storage Backup]]></category>
		<category><![CDATA[backup systems]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[Isilon Systems]]></category>
		<category><![CDATA[Scale out NAS]]></category>
		<category><![CDATA[SO/CFS]]></category>
		<category><![CDATA[tape elimination]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=2433</guid>
		<description><![CDATA[I understand the rationale behind EMC&#8217;s purchase of Isilon scale out NAS technology for big data applications.  More and more data is being created every day and most of that unstructured.  How can one begin to support multiple PBs of &#8230; <a href="http://silvertonconsulting.com/blog/2010/11/16/emc-to-buy-isilon-systems/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_2434" class="wp-caption alignleft" style="width: 190px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2010/11/x-series-fadewhite-2.jpg"><img class="size-full wp-image-2434" title="Isilon X series nodes (c) 2010 Isilon from Isilon.com" src="http://silvertonconsulting.com/blog/wp-content/uploads/2010/11/x-series-fadewhite-2.jpg" alt="Isilon X series nodes (c) 2010 Isilon from Isilon's website" width="180" height="123" /></a><p class="wp-caption-text">Isilon X series nodes (c) 2010 Isilon from Isilon&#39;s website</p></div>
<p>I understand the rationale behind EMC&#8217;s purchase of Isilon scale out NAS technology for big data applications.  More and more data is being created every day and most of that unstructured.  How can one begin to support multiple PBs of file data that&#8217;s coming online in the next couple of years without scale out NAS.  Scale out NAS has the advantage that within the same architecture one can scale from TBs to PBs of file storage by just adding storage and/or accessor nodes.  Sounds great.</p>
<h3>Isilon for backup storage?</h3>
<p>But what&#8217;s surprising to me is the use of Isilon NL-Series storage in more mundane applications like Database backup.  A couple of weeks ago I wrote a post on how <a href="http://silvertonconsulting.com/blog/2010/10/20/poor-deduplication-with-oracle-rman-compressed-backups/" target="_blank">Oracle RMAN compressed backups don&#8217;t dedupe very well</a>.  The impetus for that post was that a very large enterprise customer I was talking with had just started deploying Isilon NAS systems in their backup environment to handle non-dedupable data.  The customer was backing up PB of storage, a good portion of which was non-dedupable, and as such, they planed to use Isilon Systems to store this data.</p>
<p>I had never seen scale out NAS systems used for backup storage so I was intrigued to find out why.  Essentially, this customer was in the throws of replacing tape and between deduplication appliances and Isilon storage they believed they had the solutions to eliminate tape forever from their backup systems.</p>
<p>All this begs the question where does EMC put Isilon &#8211;  with Celerra and other storage platforms, with Atmos and other cloud services, or with Data Domain and other backup systems?  It seems one could almost break out the three Isilon storage systems and split them into these three business groups but given Isilon&#8217;s flexibility it probably belongs in storage platforms.</p>
<p>However, I would think that BRS would have an immediate market requirement for Isilon&#8217;s NL-Series storage to complement it&#8217;s other backup systems.  I guess we will know shortly where EMC puts it &#8211; until then it&#8217;s anyone&#8217;s guess.</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=2433&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2010. |
<a href="http://silvertonconsulting.com/blog/2010/11/16/emc-to-buy-isilon-systems/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2010/11/16/emc-to-buy-isilon-systems/#comments">One comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2010/11/16/emc-to-buy-isilon-systems/&title=EMC to buy Isilon Systems">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/backup-systems/" rel="tag">backup systems</a>, <a href="http://silvertonconsulting.com/blog/tag/emc/" rel="tag">EMC</a>, <a href="http://silvertonconsulting.com/blog/tag/isilon-systems/" rel="tag">Isilon Systems</a>, <a href="http://silvertonconsulting.com/blog/tag/scale-out-nas/" rel="tag">Scale out NAS</a>, <a href="http://silvertonconsulting.com/blog/tag/socfs/" rel="tag">SO/CFS</a>, <a href="http://silvertonconsulting.com/blog/tag/tape-elimination/" rel="tag">tape elimination</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2010/11/16/emc-to-buy-isilon-systems/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Cloud storage replication does not suffice for backups &#8211; revisited</title>
		<link>http://silvertonconsulting.com/blog/2010/11/02/cloud-storage-replication-does-not-suffice-for-backups-revisited/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=cloud-storage-replication-does-not-suffice-for-backups-revisited</link>
		<comments>http://silvertonconsulting.com/blog/2010/11/02/cloud-storage-replication-does-not-suffice-for-backups-revisited/#comments</comments>
		<pubDate>Tue, 02 Nov 2010 21:16:53 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Cloud services]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[data protection]]></category>
		<category><![CDATA[File Storage]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Storage Backup]]></category>
		<category><![CDATA[System effectiveness]]></category>
		<category><![CDATA[Systems]]></category>
		<category><![CDATA[Cloud gateway]]></category>
		<category><![CDATA[Cloud storage backup]]></category>
		<category><![CDATA[Cloud storage gateways]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=2373</guid>
		<description><![CDATA[I was talking with another cloud storage gateway provider today and I asked them if they do any sort of backup for data sent to the cloud.  His answer disturbed me &#8211; they said they depend on backend cloud storage &#8230; <a href="http://silvertonconsulting.com/blog/2010/11/02/cloud-storage-replication-does-not-suffice-for-backups-revisited/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_2376" class="wp-caption alignright" style="width: 210px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2010/11/2902068353_47c58bc8b1_b.jpg"><img class="size-medium wp-image-2376" title="Free Whipped Cream Clouds on True Blue Sky Creative Commons by Pink Sherbet Photography (cc) (from Flickr)" src="http://silvertonconsulting.com/blog/wp-content/uploads/2010/11/2902068353_47c58bc8b1_b-200x300.jpg" alt="Free Whipped Cream Clouds on True Blue Sky Creative Commons by Pink Sherbet Photography (cc) (from Flickr)" width="200" height="300" /></a><p class="wp-caption-text">Free Whipped Cream Clouds on True Blue Sky Creative Commons by Pink Sherbet Photography (cc) (from Flickr)</p></div>
<p>I was talking with another cloud storage gateway provider today and I asked them if they do any sort of backup for data sent to the cloud.  His answer disturbed me &#8211; they said they depend on backend cloud storage providers replication services to provide data protection &#8211; sigh. Curtis and I have written about this before (see my <a href="http://silvertonconsulting.com/blog/2009/07/28/does-cloud-storage-need-backup/" target="_blank">Does Cloud Storage need Backup?</a> post and <a href="http://www.backupcentral.com/mr-backup-blog-mainmenu-47/13-mr-backup-blog/294-cloud-replication-backup.html" target="_blank">Replication is not backup</a> by W. Curtis Preston).</p>
<h3>Cloud replication is not backup</h3>
<div>Cloud replication is not data protection for anything but hardware failures!   Much more common than hardware failures is mistakes by end-users who inadvertently delete files, overwrite files, corrupt files, or systems that corrupt files any of which would just be replicated in error throughout the cloud storage multi-verse.  (In fact, cloud storage itself can lead to corruption see <a href="http://silvertonconsulting.com/blog/2010/06/10/eventual-data-consistency-and-cloud-storage/" target="_blank">Eventual data consistency and cloud storage</a>).</div>
<p>Replication does a nice job of covering a data center or hardware failure which leaves data at one site inaccessible but allows access to a replica of the data from another site.  As far as I am concerned there&#8217;s nothing better than replication for these sorts of DR purposes but it does nothing for someone deleting the wrong file. (I one time did a &#8220;rm * *&#8221; command on a shared Unix directory &#8211; it wasn&#8217;t pretty).</p>
<p>Some cloud storage (backend) vendors delay the deletion of blobs/containers until sometime later  as one solution to this problem.  By doing this, the data &#8220;stays around&#8221; for &#8220;sometime&#8221; after being deleted and can be restored via special request to the cloud storage vendor. The only problem with this is that &#8220;sometime&#8221; is an ill-defined, nebulous concept which is not guaranteed/specified in any way.  Also, depending on the &#8220;fullness&#8221; of the cloud storage, this time frame may be much shorter or longer.  End-user data protection cannot depend on such a wishy-washy arrangement.</p>
<h3>Other solutions to data protection for cloud storage</h3>
<p>One way is to have a local backup of any data located in cloud storage.  But this kind of defeats the purpose of cloud storage and has the cloud data being stored both locally (as backups) and remotely.  I suppose the backup data could be sent to another cloud storage provider but someone/somewhere would need to support some sort of versioning to be able to keep multiple iterations of the data around, e.g., 90 days worth of backups.  Sounds like a backup package front-ending cloud storage to me&#8230;</p>
<p>Another approach is to have the gateway provider supply some sort of backup internally using the very same cloud storage to hold various versions of data.  As long as the user can specify how many days or versions of backups can be held this works great, as cloud replication supports availability in the face of hardware failures and multiple versions support availability in the face of finger checks/logical corruptions.</p>
<p>This problem can be solved in many ways, but just using cloud replication is not one of them.</p>
<p>&#8212;</p>
<p><strong>Listen up folks</strong>, whenever you think about putting data in the cloud, you need to ask about backups <a href="http://silvertonconsulting.com/blog/2009/12/04/securing-data-in-the-cloud/" target="_blank">among other things</a>.  If they say we only offer data replication provided by the cloud storage backend &#8211; <strong>go somewhere else</strong>. Trust me, there are solutions out there that really backup cloud data.</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=2373&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2010. |
<a href="http://silvertonconsulting.com/blog/2010/11/02/cloud-storage-replication-does-not-suffice-for-backups-revisited/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2010/11/02/cloud-storage-replication-does-not-suffice-for-backups-revisited/#comments">2 comments</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2010/11/02/cloud-storage-replication-does-not-suffice-for-backups-revisited/&title=Cloud storage replication does not suffice for backups &#8211; revisited">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/cloud-gateway/" rel="tag">Cloud gateway</a>, <a href="http://silvertonconsulting.com/blog/tag/cloud-storage-backup/" rel="tag">Cloud storage backup</a>, <a href="http://silvertonconsulting.com/blog/tag/cloud-storage-gateways/" rel="tag">Cloud storage gateways</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2010/11/02/cloud-storage-replication-does-not-suffice-for-backups-revisited/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Poor deduplication with Oracle RMAN compressed backups</title>
		<link>http://silvertonconsulting.com/blog/2010/10/20/poor-deduplication-with-oracle-rman-compressed-backups/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=poor-deduplication-with-oracle-rman-compressed-backups</link>
		<comments>http://silvertonconsulting.com/blog/2010/10/20/poor-deduplication-with-oracle-rman-compressed-backups/#comments</comments>
		<pubDate>Wed, 20 Oct 2010 23:06:50 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Storage Backup]]></category>
		<category><![CDATA[Storage Features]]></category>
		<category><![CDATA[System effectiveness]]></category>
		<category><![CDATA[bzip2]]></category>
		<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[huffman coding]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Oracle RMAN]]></category>
		<category><![CDATA[RMAN backupsets compression]]></category>
		<category><![CDATA[Symantec OST]]></category>
		<category><![CDATA[zlib]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=2328</guid>
		<description><![CDATA[I was talking with one large enterprise customer today and he was lamenting how poorly Oracle RMAN compressed backupsets dedupe. Apparently, non-compressed RMAN backup sets generate anywhere from 20 to 40:1 deduplication ratios but when they use RMAN backupset compression, &#8230; <a href="http://silvertonconsulting.com/blog/2010/10/20/poor-deduplication-with-oracle-rman-compressed-backups/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_2330" class="wp-caption alignright" style="width: 310px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2010/10/173568416_751edafb75_b.jpg"><img class="size-medium wp-image-2330" title="Oracle offices by Steve Parker (cc) (from Flickr)" src="http://silvertonconsulting.com/blog/wp-content/uploads/2010/10/173568416_751edafb75_b-300x225.jpg" alt="Oracle offices by Steve Parker (cc) (from Flickr)" width="300" height="225" /></a><p class="wp-caption-text">Oracle offices by Steve Parker (cc) (from Flickr)</p></div>
<p>I was talking with one large enterprise customer today and he was lamenting how poorly Oracle RMAN compressed backupsets dedupe. Apparently, non-compressed RMAN backup sets generate anywhere from 20 to 40:1 deduplication ratios but when they use RMAN backupset compression, their deduplication ratios drop down to 2:1.  Given that RMAN compression probably only adds another 2:1 compression ratio then the overall data reduction becomes something ~4:1.</p>
<h3>RMAN compression</h3>
<p>It turns out <a href="http://download.oracle.com/docs/cd/B28359_01/backup.111/b28270/rcmconfa.htm#CHDEHCEB" target="_blank">Oracle RMAN supports two different compression algorithms</a> that can be used <a href="http://en.wikipedia.org/wiki/Zlib" target="_blank">zlib</a> (or gzip) and <a href="http://en.wikipedia.org/wiki/Bzip2" target="_blank">bzip2</a>.  I assume the default is zlib and if you want to one can specify bzip2 for even higher compression rates with the commensurate slower or more processor intensive compression activity.</p>
<ul>
<li><strong>Zlib</strong> is pretty standard repeating strings elimination followed by <a href="http://en.wikipedia.org/wiki/Huffman_coding" target="_blank">Huffman coding</a> which uses shorter bit strings to represent more frequent characters and longer bit strings to represent less frequent characters.</li>
<li><strong>Bzip2</strong> also uses Huffman coding but only after a number of other transforms such as r<strong>un length encoding</strong> (changing duplicated characters to a count:character sequence), <a href="http://en.wikipedia.org/wiki/Burrows–Wheeler_transform" target="_blank">Burrows–Wheeler transform</a> (changes data stream so that repeating characters come together), <strong>move-to-front transform</strong> (changes data stream so that all repeating character strings are moved to the front), another run length encoding step, <strong>huffman encoding</strong>, followed by another couple of steps to decrease the data length even more&#8230;</li>
</ul>
<p>The net of all this is that a block of data that is bzip2 encoded may look significantly different if even one character is changed.  Similarly, even zlib compressed data will look different with a single character insertion, but perhaps not as much.  This will depend on the character and where it&#8217;s inserted but even if the new character doesn&#8217;t change the huffman encoding tree, adding a few bits to a data stream will necessarily alter its byte groupings significantly downstream from that insertion. (See huffman coding to learn more).</p>
<h3>Deduplicating RMAN compressed backupsets</h3>
<p>Sub-block level deduplication often depends on seeing the same sequence of data that may be skewed or shifted by one to N bytes between two data blocks.  But as discussed above, with bzip2 or zlib (or any huffman encoded) compression algorithm the sequence of bytes looks distinctly different downstream from any character insertion.</p>
<p>One way to obtain decent deduplication rates from RMAN compressed backupsets would be to decompress the data at the dedupe appliance and then run the deduplication algorithm on it &#8211; dedupe appliance ingestion rates would suffer accordingly.  Another approach is to not use RMAN compressed backupsets but the advantages of compression are very appealing such as less network bandwidth, faster backups (because they are not transferring as much data), and quicker restores.</p>
<h3>Oracle RMAN OST</h3>
<p>On the other hand, what might work is some form of <a href="http://silvertonconsulting.com/blog/2010/05/14/fast-cache-boost-day2emcworld-2010/" target="_blank">Data Domain OST/Boost like support</a> from Oracle RMAN which would partially deduplicate the data at the RMAN server and then send the deduplicated stream to the dedupe appliance.  This would provide less network bandwidth and faster backups but may not do anything for restores.  Perhaps a tradeoff worth investigating.</p>
<p>As for the likelihood that Oracle would make such services available to deduplicatione vendors, I would have said this was unlikely but ultimately the customers have a say here.   It&#8217;s unclear why Symantec created OST but it turned out to be a money maker for them and something similar could be supported by Oracle.  Once an Oracle RMAN OST-like capability was in place, it shouldn&#8217;t take much to provide Boost functionality on top of it.  (Although EMC Data Domain is the only dedupe vendor that has Boost yet for OST or their own <a href="http://silvertonconsulting.com/blog/2010/10/08/emc-networker-7-6-sp1-surfaces/" target="_blank">Networker Boost version</a>.)</p>
<p>&#8212;-</p>
<p>When I first started this post I thought that if the dedupe vendors just understood the format of the RMAN compressed backupsets they would be able to have the same dedupe ratios as seen for normal RMAN backupsets.  As I investigated the compression algorithms being used I became convinced that it&#8217;s a computationally &#8220;hard&#8221; problem to extract duplicate data from RMAN compressed backupsets and ultimately would probably not be worth it.</p>
<p>So, if you use RMAN backupset compression, probably ought to avoid deduplicating this data for now.</p>
<p>Anything I missed here?</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=2328&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2010. |
<a href="http://silvertonconsulting.com/blog/2010/10/20/poor-deduplication-with-oracle-rman-compressed-backups/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2010/10/20/poor-deduplication-with-oracle-rman-compressed-backups/#comments">2 comments</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2010/10/20/poor-deduplication-with-oracle-rman-compressed-backups/&title=Poor deduplication with Oracle RMAN compressed backups">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/bzip2/" rel="tag">bzip2</a>, <a href="http://silvertonconsulting.com/blog/tag/deduplication/" rel="tag">Deduplication</a>, <a href="http://silvertonconsulting.com/blog/tag/huffman-coding/" rel="tag">huffman coding</a>, <a href="http://silvertonconsulting.com/blog/tag/oracle/" rel="tag">Oracle</a>, <a href="http://silvertonconsulting.com/blog/tag/oracle-rman/" rel="tag">Oracle RMAN</a>, <a href="http://silvertonconsulting.com/blog/tag/rman-backupsets-compression/" rel="tag">RMAN backupsets compression</a>, <a href="http://silvertonconsulting.com/blog/tag/symantec-ost/" rel="tag">Symantec OST</a>, <a href="http://silvertonconsulting.com/blog/tag/zlib/" rel="tag">zlib</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2010/10/20/poor-deduplication-with-oracle-rman-compressed-backups/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced (Requested URI is rejected)
Database Caching 20/32 queries in 0.147 seconds using disk: basic

Served from: www.silvertonconsulting.com @ 2012-02-11 13:18:11 -->
