<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>RayOnStorage Blog &#187; Storage reliability</title>
	<atom:link href="http://silvertonconsulting.com/blog/index.php/category/storage/storage-reliability/feed/" rel="self" type="application/rss+xml" />
	<link>http://silvertonconsulting.com/blog</link>
	<description>Storage, Strategy &#38; Systems</description>
	<lastBuildDate>Thu, 09 Feb 2012 23:03:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Graphene Flash Memory</title>
		<link>http://silvertonconsulting.com/blog/2011/09/06/graphene-flash-memory/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=graphene-flash-memory</link>
		<comments>http://silvertonconsulting.com/blog/2011/09/06/graphene-flash-memory/#comments</comments>
		<pubDate>Tue, 06 Sep 2011 19:55:34 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Data density]]></category>
		<category><![CDATA[Data integrity]]></category>
		<category><![CDATA[Data retention]]></category>
		<category><![CDATA[SSD storage]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[storage economics]]></category>
		<category><![CDATA[Storage energy use]]></category>
		<category><![CDATA[Storage longevity]]></category>
		<category><![CDATA[Storage reliability]]></category>
		<category><![CDATA[Strategic Inflection Points]]></category>
		<category><![CDATA[System effectiveness]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[cell-to-cell interaction]]></category>
		<category><![CDATA[circuit size]]></category>
		<category><![CDATA[feram]]></category>
		<category><![CDATA[Ferro-electric RAM]]></category>
		<category><![CDATA[Flash at 11nm]]></category>
		<category><![CDATA[Graphene Flash Memory]]></category>
		<category><![CDATA[IBM Almaden lab]]></category>
		<category><![CDATA[IBM Research]]></category>
		<category><![CDATA[Magnetic RAM]]></category>
		<category><![CDATA[MRAM]]></category>
		<category><![CDATA[NAND Flash problems]]></category>
		<category><![CDATA[pcm]]></category>
		<category><![CDATA[Phase Change Memory]]></category>
		<category><![CDATA[racetrack memory]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[write endurance]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=3222</guid>
		<description><![CDATA[I have been thinking about writing a post on &#8220;Is Flash Dead?&#8221; for a while now.  Well at least since talking with IBM research a couple of weeks ago on their new memory technologies that they have been working on. &#8230; <a href="http://silvertonconsulting.com/blog/2011/09/06/graphene-flash-memory/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_3303" class="wp-caption alignright" style="width: 378px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2011/09/5057399792_b88ae5b06b_bRZ.jpg"><img class="size-full wp-image-3303" title="Model of graphene structure by CORE-Materials (cc) (from Flickr)" src="http://silvertonconsulting.com/blog/wp-content/uploads/2011/09/5057399792_b88ae5b06b_bRZ.jpg" alt="Model of graphene structure by CORE-Materials (cc) (from Flickr)" width="368" height="294" /></a><p class="wp-caption-text">Model of graphene structure by CORE-Materials (cc) (from Flickr)</p></div>
<p>I have been thinking about writing a post on &#8220;Is Flash Dead?&#8221; for a while now.  Well at least since talking with IBM research a couple of weeks ago on their new memory technologies that they have been working on.</p>
<p>But then this new <a href="http://www.technologyreview.com/computing/38471/page1/" target="_blank">Technology Review article</a> came out  discussing <a href="http://pubs.acs.org/doi/pdf/10.1021/nn201809k" target="_blank">recent research on Graphene Flash Memory.</a></p>
<h3>Problems with NAND Flash</h3>
<p>As we have discussed before, <a href="http://silvertonconsulting.com/blog/2011/03/08/when-will-disks-become-extinct/" target="_blank">NAND flash memory</a> has some serious limitations as it&#8217;s shrunk below 11nm or so. For instance, write endurance plummets, memory retention times are reduced and cell-to-cell interactions increase significantly.</p>
<p>These issues are not that much of a problem with <a href="http://www.itproportal.com/2011/04/15/intel-micron-reduces-flash-memory-size-by-a-third/" target="_blank">today&#8217;s flash at 20nm</a> or so. But to continue to follow Moore&#8217;s law and drop the price of NAND flash on a $/Gb basis, it will need to shrink below 16nm.  At that point or soon thereafter, current NAND flash technology will no longer be viable.</p>
<h3>Other non-NAND based non-volatile memories</h3>
<h3><span style="font-size: 16px; color: #444444; line-height: 24px;">That&#8217;s why IBM and others are working on different types of non-volatile storage such as <a href="http://en.wikipedia.org/wiki/Phase-change_memory" target="_blank">PCM (phase change memory)</a>, <a href="http://silvertonconsulting.com/blog/2010/11/30/the-future-of-data-storage-is-mram/" target="_blank">MRAM (magnetic RAM) </a>, <a href="http://en.wikipedia.org/wiki/FeRAM" target="_blank">FeRAM (Ferroelectric RAM)</a> and others.  All these have the potential to improve general reliability characteristics beyond where NAND Flash is today and where it will be tomorrow as chip geometries shrink even more.</span></h3>
<p>IBM seems to be betting on MRAM or <a href="http://www.almaden.ibm.com/spinaps/research/sd/?racetrack" target="_blank">racetrack memory technology</a> because it has near DRAM performance, extremely low power and can store far more data in the same amount of space. It sort of reminds me of <a href="http://en.wikipedia.org/wiki/Delay_line_memory" target="_blank">delay line memory</a> where bits were stored on a wire line and read out as they passed across a read/write circuit. Only in the case of racetrack memory, the delay line is etched in a silicon circuit indentation with the read/write head implemented at the bottom of the cleft.</p>
<h3>Graphene as the solution</h3>
<p>Then along comes Graphene based Flash Memory.  Graphene can apparently be used as a substitute for the storage layer in a flash memory cell.  According to the report, the graphene stores data using less power and with better stability over time.  Both crucial problems with NAND flash memory as it&#8217;s shrunk below today&#8217;s geometries.  The research is being done at UCLA and is supported by Samsung, a significant manufacturer of NAND flash memory today.</p>
<p>Current demonstration chips are much larger than would be useful.  However, given graphene&#8217;s material characteristics, the researchers believe there should be no problem scaling it down below where NAND Flash would start exhibiting problems.  The next iteration of research will be to see if their scaling assumptions can hold when device geometry is shrunk.</p>
<p>The other problem is getting graphene, a new material, into current chip production.  Current materials used in chip manufacturing lines are very tightly controlled and  building hybrid graphene devices to the same level of manufacturing tolerances and control will take some effort.</p>
<p>So don&#8217;t look for Graphene Flash Memory to show up anytime soon. But given that 16nm chip geometries are only a couple of years out and 11nm, a couple of years beyond that, it wouldn&#8217;t surprise me to see Graphene based Flash Memory introduced in about 4 years or so.  Then again, I am no materials expert, so don&#8217;t hold me to this timeline.</p>
<p>&nbsp;</p>
<p>&#8212;-</p>
<p>Comments?</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=3222&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2011. |
<a href="http://silvertonconsulting.com/blog/2011/09/06/graphene-flash-memory/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2011/09/06/graphene-flash-memory/#comments">No comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2011/09/06/graphene-flash-memory/&title=Graphene Flash Memory">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/cell-to-cell-interaction/" rel="tag">cell-to-cell interaction</a>, <a href="http://silvertonconsulting.com/blog/tag/circuit-size/" rel="tag">circuit size</a>, <a href="http://silvertonconsulting.com/blog/tag/feram/" rel="tag">feram</a>, <a href="http://silvertonconsulting.com/blog/tag/ferro-electric-ram/" rel="tag">Ferro-electric RAM</a>, <a href="http://silvertonconsulting.com/blog/tag/flash-at-11nm/" rel="tag">Flash at 11nm</a>, <a href="http://silvertonconsulting.com/blog/tag/graphene-flash-memory/" rel="tag">Graphene Flash Memory</a>, <a href="http://silvertonconsulting.com/blog/tag/ibm-almaden-lab/" rel="tag">IBM Almaden lab</a>, <a href="http://silvertonconsulting.com/blog/tag/ibm-research/" rel="tag">IBM Research</a>, <a href="http://silvertonconsulting.com/blog/tag/magnetic-ram/" rel="tag">Magnetic RAM</a>, <a href="http://silvertonconsulting.com/blog/tag/mram/" rel="tag">MRAM</a>, <a href="http://silvertonconsulting.com/blog/tag/nand-flash-problems/" rel="tag">NAND Flash problems</a>, <a href="http://silvertonconsulting.com/blog/tag/pcm/" rel="tag">pcm</a>, <a href="http://silvertonconsulting.com/blog/tag/phase-change-memory/" rel="tag">Phase Change Memory</a>, <a href="http://silvertonconsulting.com/blog/tag/racetrack-memory/" rel="tag">racetrack memory</a>, <a href="http://silvertonconsulting.com/blog/tag/ssd/" rel="tag">SSD</a>, <a href="http://silvertonconsulting.com/blog/tag/write-endurance/" rel="tag">write endurance</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2011/09/06/graphene-flash-memory/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IBM&#8217;s 120PB storage system</title>
		<link>http://silvertonconsulting.com/blog/2011/09/02/ibms-120pb-storage-system/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=ibms-120pb-storage-system</link>
		<comments>http://silvertonconsulting.com/blog/2011/09/02/ibms-120pb-storage-system/#comments</comments>
		<pubDate>Sat, 03 Sep 2011 01:27:12 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Clustered storage]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Data integrity]]></category>
		<category><![CDATA[data logistics]]></category>
		<category><![CDATA[data protection]]></category>
		<category><![CDATA[Disk storage]]></category>
		<category><![CDATA[Distributed computing]]></category>
		<category><![CDATA[File Storage]]></category>
		<category><![CDATA[Infiniband]]></category>
		<category><![CDATA[Information economy]]></category>
		<category><![CDATA[Networking]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Storage architecture]]></category>
		<category><![CDATA[Storage availability]]></category>
		<category><![CDATA[Storage density]]></category>
		<category><![CDATA[Storage energy use]]></category>
		<category><![CDATA[Storage Features]]></category>
		<category><![CDATA[Storage reliability]]></category>
		<category><![CDATA[Strategy]]></category>
		<category><![CDATA[System effectiveness]]></category>
		<category><![CDATA[Systems]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[GPFS]]></category>
		<category><![CDATA[GPFS nodes]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hadoop over GPFS]]></category>
		<category><![CDATA[IBM GPFS]]></category>
		<category><![CDATA[IBM SONAS]]></category>
		<category><![CDATA[ILM]]></category>
		<category><![CDATA[Isilon]]></category>
		<category><![CDATA[NSD nodes]]></category>
		<category><![CDATA[SAS Drives]]></category>
		<category><![CDATA[Scale out NAS]]></category>
		<category><![CDATA[SPECsfs2008]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=3209</guid>
		<description><![CDATA[Talk about big data, Technology Review reported this week that IBM is building a 120PB storage system for some unnamed customer.  Details are sketchy and I cannot seem to find any announcement of this on IBM.com. Hardware It appears that &#8230; <a href="http://silvertonconsulting.com/blog/2011/09/02/ibms-120pb-storage-system/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_3301" class="wp-caption alignleft" style="width: 378px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2011/09/5099407299_e4377608e8_bRZ.jpg"><img class="size-full wp-image-3301" title="Susitna Glacier, Alaska by NASA Goddard Photo and Video (cc) (from Flickr)" src="http://silvertonconsulting.com/blog/wp-content/uploads/2011/09/5099407299_e4377608e8_bRZ.jpg" alt="Susitna Glacier, Alaska by NASA Goddard Photo and Video (cc) (from Flickr)" width="368" height="368" /></a><p class="wp-caption-text">Susitna Glacier, Alaska by NASA Goddard Photo and Video (cc) (from Flickr)</p></div>
<p>Talk about <a href="http://silvertonconsulting.com/blog/2011/06/24/big-data-part-3/" target="_blank">big data</a>, Technology Review reported this week that<a href="http://www.technologyreview.com/computing/38440/?ref=rss" target="_blank"> IBM is building a 120PB storage system</a> for some unnamed customer.  Details are sketchy and I cannot seem to find any announcement of this on IBM.com.</p>
<h3>Hardware</h3>
<p>It appears that the system uses 200K disk drives to support the 120PB of storage.  The disk drives are packed in a new wider rack and are water cooled.  According to the news report the new wider drive trays hold more drives than current drive trays available on the market.</p>
<p>For instance, HP has a hot pluggable, 100 SFF (small form factor 2.5&#8243;) disk enclosure that sits in 3U of standard rack space.  200K SFF disks would take up about 154 full racks, not counting the interconnect switching that would be required.  Unclear whether water cooling would increase the density much but I suppose a wider tray with special cooling might get you more drives per floor tile.</p>
<p>There was no mention of interconnect, but today&#8217;s drives use either SAS or SATA.  SAS interconnects for 200K drives would require many separate SAS busses. With an SAS expander addressing 255 drives or other expanders, one would need at least 4 SAS busses but this would have ~64K drives per bus and would not perform well.  Something more like 64-128 drives per bus would have much better performer and each drive would need dual pathing, and if we use 100 drives per SAS string, that&#8217;s 2000 SAS drive strings or at least 4000 SAS busses (dual port access to the drives).</p>
<p>The report mentioned GPFS as the underlying software which supports <a href="http://public.dhe.ibm.com/common/ssi/ecm/en/xbw03010usen/XBW03010USEN.PDF" target="_blank">three cluster types today:</a></p>
<ul>
<li><strong>Shared storage cluster </strong>- where GPFS front end nodes access shared storage across the backend. This is generally SAN storage system(s).  But the requirements for high density, it doesn&#8217;t seem likely that the 120PB storage system uses SAN storage in the backend.</li>
<li><strong>Networked based cluster </strong>- here the GPFS front end nodes talk over a LAN to a cluster of <strong>NSD (network storage director?) servers</strong> which can have access to all or some of the storage. My guess is this is what will be used in the 120PB storage system</li>
<li><strong>Shared Network based clusters </strong>- this looks just like a bunch of NSD servers but provides access across multiple NSD clusters.</li>
</ul>
<p>Given the above, with ~100 drives per NSD server means another 1U extra per 100 drives or (given HP drive density) 4U per 100 drives for 1000 drives and 10 IO servers per 40U rack, (not counting switching).  At this density it takes ~200 racks for 120PB of raw storage and NSD nodes or 2000 NSD nodes.</p>
<p>Unclear how many GPFS front end nodes would be needed on top of this but even if it were 1 GPFS frontend node for every 5 NSD nodes, we are talking another 400 GPFS frontend nodes and at 1U per server, another 10 racks or so (not counting switching).</p>
<p>If my calculations are correct we are talking over 210 racks with switching thrown in to support the storage.  According to IBM&#8217;s discussion on the <a href="http://www.dtc.umn.edu/disc/resources/KandlurISW5.pdf" target="_blank">Storage challenges for petascale systems</a>, it probably provides ~6TB/sec of data transfer which should be easy with 200K disks but may require even more SAS busses (maybe ~10K vs. the 2K discussed above).</p>
<h3>Software</h3>
<p>IBM GPFS is used behind the scenes in IBM&#8217;s commercial SONAS storage system but has been around as a cluster file system designed for HPC environments for over <a href="http://en.wikipedia.org/wiki/IBM_General_Parallel_File_System">15 years or more now</a>.</p>
<p>Given this many disk drives something needs to be done about protecting against drive failure.  IBM has been talking about <a href="http://www.pdl.cmu.edu/ftp/Declustering/ASPLOS.pdf" target="_blank">declustered RAID algorithms</a> for their next generation <a href="http://www.dtc.umn.edu/disc/resources/KandlurISW5.pdf" target="_blank">HPC storage system</a> which spreads the parity across more disks and as such, speeds up rebuild time at the cost of reducing effective capacity. There was no mention of effective capacity in the report but this would be a reasonable tradeoff.  A 200K drive storage system should have a drive failure every 10 hours, on average (assuming a 2 million hour MTBF).  Let&#8217;s hope they get drive rebuild time down much below that.</p>
<p>The system is expected to hold around a trillion files.  Not sure but even at 1024 bytes of metadata per file, this number of files would chew up ~1PB of metadata storage space.</p>
<p><a href="http://darwinproject.mit.edu/wiki/images/2/2e/Gpfs_overview.pdf" target="_blank">GPFS provides ILM</a> (information life cycle management, or data placement based on information attributes) using automated policies and supports external storage pools outside the GPFS cluster storage.  ILM within the GPFS cluster supports file placement across different tiers of storage.</p>
<p>All the discussion up to now revolved around homogeneous backend storage but it&#8217;s quite possible that multiple storage tiers could also be used.  For example, a high density but slower storage tier could be combined with a low density but faster storage tier to provide a more cost effective storage system.  Although, it&#8217;s unclear whether the application (real world modeling) could readily utilize this sort of storage architecture nor whether they would care about system cost.</p>
<p>Nonetheless, presumably an external storage pool would be a useful adjunct to any 120PB storage system for HPC applications.</p>
<h3>Can it be done?</h3>
<p>Let&#8217;s see, 400 GPFS nodes, 2000 NSD nodes, and 200K drives. Seems like the hardware would be readily doable (not sure why they needed watercooling but hopefully they obtained better drive density that way).</p>
<p>Luckily GPFS supports <a href="http://www.infinibandta.org/content/pages.php?pg=technology_faq" target="_blank">Infiniband which can support 10,000 nodes</a> within a single subnet.  Thus an Infiniband interconnect between the GPFS and NSD nodes could easily support a 2400 node cluster.</p>
<p>The only real question is can a GPFS software system handle 2000 NSD nodes and 400 GPFS nodes with trillions of files over 120PB of raw storage.</p>
<p>As a comparison here are some recent examples of scale out NAS systems:</p>
<ul>
<li>EMC Isilon demonstrated a <a href="http://www.spec.org/sfs2008/results/res2011q2/sfs2008-20110527-00186.html" target="_blank">140 node storage cluster in a SPECsfs2008</a> benchmark.</li>
<li> IBM has already announced support for a <a href="http://public.dhe.ibm.com/common/ssi/ecm/en/tsd03090usen/TSD03090USEN.PDF" target="_blank">14.4PB SONAS storage system</a> using high density drives</li>
</ul>
<p><span style="font-size: 16px; font-family: Georgia, 'Bitstream Charter', serif; line-height: 24px;">It would seem that a 20X multiplier times a current Isilon cluster or even a 10X multiple of a currently supported SONAS system would take some software effort to work together, but seems entirely within reason.</span></p>
<p>On the other hand, <a href="http://wiki.apache.org/hadoop/PoweredBy" target="_blank">Yahoo supports a 4000-node Hadoop cluster</a> and seems to work just fine.  So from a feasability perspective, a 2500 node GPFS-NSD node system seems just a walk in the park for Hadoop.</p>
<p>Of course, IBM Almaden is working on project to support <a href="http://www.almaden.ibm.com/storagesystems/projects/hadoop/" target="_blank">Hadoop over GPFS</a> which might not be optimum for real world modeling but would nonetheless support the node count being talked about here.</p>
<p>&#8212;&#8212;</p>
<p>I wish there was some real technical information on the project out on the web but I could not find any. Much of this is informed conjecture based on current GPFS system and storage hardware capabilities. But hopefully, I haven&#8217;t traveled to far astray.</p>
<p>Comments?</p>
<p>&nbsp;</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=3209&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2011. |
<a href="http://silvertonconsulting.com/blog/2011/09/02/ibms-120pb-storage-system/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2011/09/02/ibms-120pb-storage-system/#comments">No comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2011/09/02/ibms-120pb-storage-system/&title=IBM&#8217;s 120PB storage system">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/big-data/" rel="tag">big data</a>, <a href="http://silvertonconsulting.com/blog/tag/gpfs/" rel="tag">GPFS</a>, <a href="http://silvertonconsulting.com/blog/tag/gpfs-nodes/" rel="tag">GPFS nodes</a>, <a href="http://silvertonconsulting.com/blog/tag/hadoop/" rel="tag">Hadoop</a>, <a href="http://silvertonconsulting.com/blog/tag/hadoop-over-gpfs/" rel="tag">Hadoop over GPFS</a>, <a href="http://silvertonconsulting.com/blog/tag/ibm-gpfs/" rel="tag">IBM GPFS</a>, <a href="http://silvertonconsulting.com/blog/tag/ibm-sonas/" rel="tag">IBM SONAS</a>, <a href="http://silvertonconsulting.com/blog/tag/ilm/" rel="tag">ILM</a>, <a href="http://silvertonconsulting.com/blog/tag/isilon/" rel="tag">Isilon</a>, <a href="http://silvertonconsulting.com/blog/tag/nsd-nodes/" rel="tag">NSD nodes</a>, <a href="http://silvertonconsulting.com/blog/tag/sas-drives/" rel="tag">SAS Drives</a>, <a href="http://silvertonconsulting.com/blog/tag/scale-out-nas/" rel="tag">Scale out NAS</a>, <a href="http://silvertonconsulting.com/blog/tag/specsfs2008/" rel="tag">SPECsfs2008</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2011/09/02/ibms-120pb-storage-system/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Pure Storage surfaces</title>
		<link>http://silvertonconsulting.com/blog/2011/08/23/pure-storage-surfaces/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=pure-storage-surfaces</link>
		<comments>http://silvertonconsulting.com/blog/2011/08/23/pure-storage-surfaces/#comments</comments>
		<pubDate>Tue, 23 Aug 2011 19:41:47 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Block Storage]]></category>
		<category><![CDATA[Clustered storage]]></category>
		<category><![CDATA[Data reduction]]></category>
		<category><![CDATA[FC]]></category>
		<category><![CDATA[SSD storage]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Storage performance]]></category>
		<category><![CDATA[Storage reliability]]></category>
		<category><![CDATA[Strategic Inflection Points]]></category>
		<category><![CDATA[Strategy]]></category>
		<category><![CDATA[Systems]]></category>
		<category><![CDATA[All SSD storage]]></category>
		<category><![CDATA[data compression]]></category>
		<category><![CDATA[Data deduplication]]></category>
		<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[MLC NAND]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[VMware]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=3186</guid>
		<description><![CDATA[We were talking with Pure Storage last week, another SSD startup which just emerged out of stealth mode today.  Somewhat like SolidFire which we discussed a month or so ago, Pure Storage uses only SSDs to provide primary storage.  In &#8230; <a href="http://silvertonconsulting.com/blog/2011/08/23/pure-storage-surfaces/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_3187" class="wp-caption alignleft" style="width: 223px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2011/08/1x1.png"><img class="size-full wp-image-3187" title="1 controller X 1 storage shelf (c) 2011 Pure Storage (from their website)" src="http://silvertonconsulting.com/blog/wp-content/uploads/2011/08/1x1.png" alt="1 controller X 1 storage shelf (c) 2011 Pure Storage (from their website)" width="213" height="77" /></a><p class="wp-caption-text">1 controller X 1 storage shelf (c) 2011 Pure Storage (from their website)</p></div>
<p>We were talking with <a href="http://www.purestorage.com" target="_blank">Pure Storage</a> last week, another SSD startup which just emerged out of stealth mode today.  Somewhat like <a href="http://silvertonconsulting.com/blog/2011/06/21/solidfire-supplies-scale-out-ssd-storage-for-cloud-service-providers/" target="_blank">SolidFire which we discussed a month or so ago</a>, Pure Storage uses only SSDs to provide primary storage.  In this case, they are supporting a FC front end, with an all SSDs backend, and implementing internal data deduplication and compression, to try to address the needs of enterprise tier 1 storage.</p>
<p>Pure Storage is in final beta testing with their product and plan to GA sometime around the end of the year.</p>
<h3>Pure Storage hardware</h3>
<p>Their system is built around MLC SSDs which are available from many vendors but with a strategic investment from Samsung, currently use that vendor&#8217;s storage.  As we know, MLC has write endurance limitations but Pure Storage was built from the ground up knowing they were going to use this technology and have built their IP to counteract these issues.</p>
<p>The system is available in one or two controller configurations, with an Infiniband interconnect between the controllers, 6Gbps SAS backend, 48GB of DRAM per controller for caching purposes, and NV-RAM for power outages.  Each controller has 12-cores supplied by 2-Intel Xeon processor chips.</p>
<p>With the first release they are limiting the controllers to one or two (HA option) but their storage system is capable of clustering together many more, maybe even up to 8-controllers using the Infiniband back end.</p>
<p>Each storage shelf provides 5.5TB of raw storage using 2.5&#8243; 256GB MLC SSDs.  It looks like each controller can handle up to 2-storage shelfs with the HA (dual controller option) supporting 4 drive shelfs for up to 22TB of raw storage.</p>
<h3>Pure Storage Performance</h3>
<p>Although these numbers are not independently verified, the company says a single controller (with 1-storage shelf) they can do 200K sustained 4K random read IOPS, 2GB/sec bandwidth, 140K sustained write IOPS, or 500MB/s of write bandwidth.  A dual controller system (with 2-storage shelfs) can achieve 300K random read IOPS, 3GB/sec bandwidth, 180K write IOPS or 1GB/sec of write bandwidth.  They also claim that they can do all this IO with an under 1 msec. latency.</p>
<p>One of the things they pride themselves on is consistent performance.  They have built their storage such that they can deliver this consistent performance even under load conditions.</p>
<p>Given the amount of SSDs in their system this isn&#8217;t screaming performance but is certainly up there with many enterprise class systems sporting over 1000 disks.  The random write performance is not bad considering this is MLC.  On the other hand the sequential write bandwidth is probably their weakest spec and reflects their use of MLC flash.</p>
<p><span style="color: #000000; font-size: 18px; line-height: 27px;">Purity software</span></p>
<p>One key to Pure Storage (and SolidFire for that matter) is their use of inline data compression and deduplication. By using these techniques and basing their system storage on MLC, Pure Storage believes they can close the price gap between disk and SSD storage systems.</p>
<p>The problems with data reduction technologies is that not all environments can benefit from them and they both require lots of CPU power to perform well.  Pure Storage believes they have the horsepower (with 12 cores per controller) to support these services and are focusing their sales activities on those (VMware, Oracle, and SQL server) environments which have historically proven to be good candidates for data reduction.</p>
<p>In addition, they perform a lot of optimizations in their backend data layout to prolong the life of MLC storage. Specifically, they use a write chunk size that matches the underlying MLC SSDs page width so as not to waste endurance with partial data writes.  Also they migrate old data to new locations occasionally to maintain &#8220;data freshness&#8221; which can be a problem with MLC storage if the data is not touched often enough.  Probably other stuff as well, but essentially they are tuning their backend use to optimize endurance and performance of their SSD storage.</p>
<p>Furthermore, they have created a new RAID 3D scheme which provides an adaptive parity scheme based on the number of available drives that protects against any dual SSD failure.  They provide triple parity, dual parity for drive failures and another parity for unrecoverable bit errors within a data payload.  In most cases, a failed drive will not induce an immediate rebuild but rather a reconfiguration of data and parity to accommodate the failing drive and rebuild it onto new drives over time.</p>
<p>At the moment, they don&#8217;t have snapshots or data replication but they said these capabilities are on their roadmap for future delivery.</p>
<p>&#8212;-</p>
<p>In the mean time, all SSD storage systems seem to be coming out of the wood work. We mentioned SolidFire, but <a href="http://www.whiptailtech.com/" target="_blank">WhipTail</a> is another one and I am sure there are plenty more in stealth waiting for the right moment to emerge.</p>
<p>I was at a conference about two months ago where I predicted that all SSD systems would be coming out with little of the engineering development of storage systems of yore. Based on the performance available from a single SSD, one wouldn&#8217;t need 100s of SSDs to generate 100K IOPS or more.  Pure Storage is doing this level of IO with only 22 MLC SSDs and a high-end, but essentially off-the-shelf controller.</p>
<p>Just imagine what one could do if you threw some custom hardware at it&#8230;</p>
<p>Comments?</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=3186&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2011. |
<a href="http://silvertonconsulting.com/blog/2011/08/23/pure-storage-surfaces/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2011/08/23/pure-storage-surfaces/#comments">No comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2011/08/23/pure-storage-surfaces/&title=Pure Storage surfaces">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/all-ssd-storage/" rel="tag">All SSD storage</a>, <a href="http://silvertonconsulting.com/blog/tag/data-compression/" rel="tag">data compression</a>, <a href="http://silvertonconsulting.com/blog/tag/data-deduplication/" rel="tag">Data deduplication</a>, <a href="http://silvertonconsulting.com/blog/tag/deduplication/" rel="tag">Deduplication</a>, <a href="http://silvertonconsulting.com/blog/tag/mlc-nand/" rel="tag">MLC NAND</a>, <a href="http://silvertonconsulting.com/blog/tag/ssd/" rel="tag">SSD</a>, <a href="http://silvertonconsulting.com/blog/tag/vmware/" rel="tag">VMware</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2011/08/23/pure-storage-surfaces/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data storage features for virtual desktop infrastructure (VDI) deployments</title>
		<link>http://silvertonconsulting.com/blog/2010/09/10/data-storage-features-for-virtual-desktop-infrastructure-vdi-deployments/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=data-storage-features-for-virtual-desktop-infrastructure-vdi-deployments</link>
		<comments>http://silvertonconsulting.com/blog/2010/09/10/data-storage-features-for-virtual-desktop-infrastructure-vdi-deployments/#comments</comments>
		<pubDate>Fri, 10 Sep 2010 20:24:15 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[desktop virtualization]]></category>
		<category><![CDATA[File Storage]]></category>
		<category><![CDATA[Storage Features]]></category>
		<category><![CDATA[Storage performance]]></category>
		<category><![CDATA[Storage reliability]]></category>
		<category><![CDATA[Anti-Virus scan]]></category>
		<category><![CDATA[boot storm]]></category>
		<category><![CDATA[Deduplication]]></category>
		<category><![CDATA[High availability]]></category>
		<category><![CDATA[High reliability]]></category>
		<category><![CDATA[Snapshot]]></category>
		<category><![CDATA[VDI]]></category>
		<category><![CDATA[Virtual Desktop Infrastructure]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=2217</guid>
		<description><![CDATA[Was talking with someone yesterday about one of my favorite topics, data storage for virtual desktop infrastructure (VDI) deployments.  In my mind there are a few advanced storage features that help considerably with VDI implemetations: Deduplication &#8211; almost every one &#8230; <a href="http://silvertonconsulting.com/blog/2010/09/10/data-storage-features-for-virtual-desktop-infrastructure-vdi-deployments/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_2221" class="wp-caption alignright" style="width: 210px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2010/09/4878815609_c2ed45bbd6_b.jpg"><img class="size-medium wp-image-2221" title="The Planet Data Center by The Planet (cc) (from Flickr)" src="http://silvertonconsulting.com/blog/wp-content/uploads/2010/09/4878815609_c2ed45bbd6_b-200x300.jpg" alt="The Planet Data Center by The Planet (cc) (from Flickr)" width="200" height="300" /></a><p class="wp-caption-text">The Planet Data Center by The Planet (cc) (from Flickr)</p></div>
<p>Was talking with someone yesterday about one of my favorite topics, data storage for virtual desktop infrastructure (VDI) deployments.  In my mind there are a few advanced storage features that help considerably with VDI implemetations:</p>
<ul>
<li><strong>Deduplication &#8211; </strong>almost every one of your virtual desktops will share 75-90% of their O/S disk data with every other virtual desktop.  Having sub-file/sub-block deduplication can be a godsend for all this replicated data and reduce O/S storage requirements considerably.</li>
<li><strong>0 storage snapshots/clones &#8211; </strong>another solution to the duplication of O/S data is to use some sort of space conserving snapshots.  For example, one creates a master (gold) disk image and makes 100s if not 1000s of snapshots of it, taking almost no additional space.</li>
<li><strong>Highly available/highly reliable storage &#8211; </strong>when you have a lone desktop dependent on DAS for it&#8217;s O/S, it doesn&#8217;t impact a lot of users if that device fails. However, when you have 100s to 1000s of users dependent on DAS device(s) for their O/S software, any DAS failure could impact all of them at the same time.  As such, one needs to move off DAS and invest in highly reliable and available external storage of some kind to sustain reasonable uptime for your user community.</li>
</ul>
<p>Those seem to me to be the most important attributes for VDI storage but there are a couple more features/facilities which can also:</p>
<ul>
<li><strong>NAS systems with NFS</strong> &#8211; VDI deployments will generate lots of VMDKs for all the user desktop C: drives.  Although this can be managed with block level storage as separate LUNs or multi-VMDK LUNs, who want&#8217;s to configure a 100 to 1000 LUNs.  NFS files can perform just as well and are much easier to create on the fly and thus, for VDI it&#8217;s hard to beat NFS storage.</li>
<li><strong>Boot storm enhancements &#8211; </strong>Another problem with VDI is that everyone gets to work 8am Monday and proceeds to boot up their (virtual) machines, which drives an awful lot of IO to their virtual C: drives.  Deduplication and 0 storage snapshots can help manage the boot storm as long as these characteristics are retained throughout system cache, i.e, deduplication exists in cache as well as on backend disk.  But there are other approaches to the problem as well, available from various vendors to better manage boot storms.</li>
<li><strong>Anti-Virus scan enhancements &#8211; </strong>Similar to boot storms, A-V scans also typically happen around the same time for many desktop users and can be just as bad for virtual C: drive performance.  Again, deduplication or 0 storage snapshots can help (with above caveats) but some vendor storage can offload these activities from the desktop alltogether.  Also last weeks VMworld release of VMware&#8217;s vShield Edge (see <a href="http://silvertonconsulting.com/blog/2010/09/08/vmworld-2010-review/" target="_blank">VMworld 2010 review</a>) also supports some A-V scan enhancements.<strong> </strong>Any of these approaches should be able to help.</li>
</ul>
<p>Regular &#8220;dumb&#8221; block storage will always work but it will require a lot more raw storage, performance will suffer just when everybody gets back to work, and the administrative burden will be much higher.</p>
<p>I may seem biased but enterprise class reliability&amp;availability with some of the advanced storage features described above can help make your deployment of VDI that much better for you and all your knowledge workers.</p>
<p>Anything I missed?</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=2217&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2010. |
<a href="http://silvertonconsulting.com/blog/2010/09/10/data-storage-features-for-virtual-desktop-infrastructure-vdi-deployments/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2010/09/10/data-storage-features-for-virtual-desktop-infrastructure-vdi-deployments/#comments">One comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2010/09/10/data-storage-features-for-virtual-desktop-infrastructure-vdi-deployments/&title=Data storage features for virtual desktop infrastructure (VDI) deployments">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/anti-virus-scan/" rel="tag">Anti-Virus scan</a>, <a href="http://silvertonconsulting.com/blog/tag/boot-storm/" rel="tag">boot storm</a>, <a href="http://silvertonconsulting.com/blog/tag/deduplication/" rel="tag">Deduplication</a>, <a href="http://silvertonconsulting.com/blog/tag/high-availability/" rel="tag">High availability</a>, <a href="http://silvertonconsulting.com/blog/tag/high-reliability/" rel="tag">High reliability</a>, <a href="http://silvertonconsulting.com/blog/tag/snapshot/" rel="tag">Snapshot</a>, <a href="http://silvertonconsulting.com/blog/tag/vdi/" rel="tag">VDI</a>, <a href="http://silvertonconsulting.com/blog/tag/virtual-desktop-infrastructure/" rel="tag">Virtual Desktop Infrastructure</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2010/09/10/data-storage-features-for-virtual-desktop-infrastructure-vdi-deployments/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Enterprise data storage defined and why 3PAR?</title>
		<link>http://silvertonconsulting.com/blog/2010/08/25/enterprise-data-storage-defined-and-why-3par/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=enterprise-data-storage-defined-and-why-3par</link>
		<comments>http://silvertonconsulting.com/blog/2010/08/25/enterprise-data-storage-defined-and-why-3par/#comments</comments>
		<pubDate>Wed, 25 Aug 2010 15:12:46 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Block Storage]]></category>
		<category><![CDATA[Data growth]]></category>
		<category><![CDATA[File Storage]]></category>
		<category><![CDATA[Storage availability]]></category>
		<category><![CDATA[Storage Features]]></category>
		<category><![CDATA[Storage performance]]></category>
		<category><![CDATA[Storage reliability]]></category>
		<category><![CDATA[3PAR]]></category>
		<category><![CDATA[cloud computing services]]></category>
		<category><![CDATA[Cloud Storage]]></category>
		<category><![CDATA[Dell]]></category>
		<category><![CDATA[enterprise storage]]></category>
		<category><![CDATA[HP]]></category>
		<category><![CDATA[IDC]]></category>
		<category><![CDATA[Microsoft Hyper-V]]></category>
		<category><![CDATA[Server virtualization]]></category>
		<category><![CDATA[VMware]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=2170</guid>
		<description><![CDATA[Recent press reports about a bidding war for 3PAR bring into focus the expanding need for enterprise class data storage subsystems.  What exactly is enterprise storage? Defining enterprise storage is frought with problems but I will take a shot.  Enterprise &#8230; <a href="http://silvertonconsulting.com/blog/2010/08/25/enterprise-data-storage-defined-and-why-3par/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_1707" class="wp-caption alignleft" style="width: 370px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2010/04/photo3.jpg"><img class="size-full wp-image-1707 " title="More SNW hall servers and storage " src="http://silvertonconsulting.com/blog/wp-content/uploads/2010/04/photo3.jpg" alt="More SNW hall servers and storage" width="360" height="480" /></a><p class="wp-caption-text">More SNW hall servers and storage </p></div>
<p>Recent press reports about a bidding war for 3PAR bring into focus the expanding need for enterprise class data storage subsystems.  What exactly is enterprise storage?</p>
<p>Defining enterprise storage is frought with problems but I will take a shot.  Enterprise class data storage has:</p>
<ul>
<li>Enhanced reliability, high availability and serviceability &#8211; meaning it hardly ever fails, it keeps operating (on redundant components) when it does fail, and repairing the storage when the rare failure occurs can be accomplished without disrupting ongoing storage services</li>
<li>Extreme data integrity &#8211; goes beyond just RAID storage, meaning that these systems lose data very infrequently, provide the latest data written to a location when read and will tell you when data cannot be accessed.</li>
<li>Automated I/O performance &#8211; meaning sophisticated caching algorithms that try to keep ahead of sequential I/O streams, buffer actively read data, and buffer write data in non-volatile cache before destaging to disk or other media.</li>
<li>Multiple types of storage &#8211; meaning the system supports SATA, SAS and/or FC disk drives and SSDs or Flash storage</li>
<li>PBs of storage &#8211; meaning behind one enterprise class storage (sub-)system one can support over 1PB of storage</li>
<li>Sophisticated functionality &#8211; meaning the system supports multiple forms of offsite replication, thin provisioning, storage tiering, point-in-time copies, data cloning, administration GUIs/CLIs, etc.</li>
<li>Compatibility with all enterprise O/Ss &#8211; meaning the storage has been tested and is on hardware compatibility lists for every major operating system in use by the enterprise today.</li>
</ul>
<p>As for storage protocol, it seems best to leave this off the list.  I wanted to just add block storage, but enterprises today probably have as much if not more external file storage (CIFS or NFS) as they have block storage (FC or iSCSI).  And the proportion in file systems seems to be growing (see IDC report referenced below).</p>
<p>In addition, while I don&#8217;t like the non-determinism of iSCSI or file access protocols, this doesn&#8217;t seem to stop such storage from putting up pretty impressive performance numbers (see our <a href="http://silvertonconsulting.com/cms1/dispatches/" target="_blank">performance dispatches</a>).  Anything that can crack 100K I/O or file operations per second probably deserves to call themselves enterprise storage as long as they meet the other requirements.  So, maybe I should add high-performance storage to the list above.</p>
<h3>Why the sudden interest in enterprise storage?</h3>
<p>Enterprise storage has been around arguably since the 2nd half of last century (for mainframe systems) but lately has become even more interesting as applications deploy to the cloud and server virtualization (from VMware, Microsoft Hyper-V and others) takes over the data center.</p>
<p>Cloud storage and cloud computing services are lowering the entry points for storage and processing, enabling application deployments which were heretofore unaffordable.  These new cloud applications consume storage at increasing rates and don&#8217;t seem to be slowing down any time soon.  Arguably, some cloud storage is not enterprise storage but as service levels go up for these applications, providers must ultimately turn to enterprise storage.</p>
<p>In addition, server virtualization transforms the enterprise data center from a single application per server to easily 5 or more applications per physical server.  This trend is raising server utilization, driving more I/O, and requiring higher capacity.  Such &#8220;multi-application&#8221; storage almost always requires high availability, reliability and performance to work well, generating even more demand for enterprise data storage systems.</p>
<p>Despite all the demand, <a href="http://www.storagenewsletter.com/news/systems/idc-ww-disk-storage-systems-revenue-2009" target="_blank">world wide external storage revenues dropped 12% last year according to IDC</a>.  Now the economy had a lot to do with this decline but another factor reducing external storage revenue is the ongoing drop in the price of storage on a $/GB basis.  To this point, that same IDC report stated that external storage capacity increased 33% last year.</p>
<p><strong>Why Dell &amp; HP wants 3PAR storage?</strong></p>
<p>Margins on enterprise storage are good, some would say very good.  While raw disk storage can be had at under $0.50/GB, enterprise class storage is often 10 or more times that price.  Now that has to cover redundant hardware, software/firmware engineering and other characteristics, but this still leaves pretty good margins.</p>
<p>In my mind, Dell would see enterprise storage as a natural extension of their current enterprise server business.  They already sell and support these customers, including enterprise class storage just adds another product to the mix.  Developing enterprise storage from scratch is probably a 4-7 year journey with the right people, buying 3PAR puts them in the market today with a competitive product.</p>
<p>HP is already in the enterprise storage market today, with their XP and EVA storage subsystems.  However, having their own 3PAR enterprise class storage may get them better margins than their current XP storage OEMed from HDS.  But I think Chuck Hollis&#8217;s post on <a href="http://chucksblog.emc.com/chucks_blog/2010/08/now-hp-gets-into-the-act-for-3par.html" target="_blank">HP&#8217;s counter bid for 3PAR</a> may have revealed another side to this discussion &#8211; sometime M&amp;A is as much about constraining your competition as it is about adding new capabilities to a company.</p>
<p>&#8212;&#8212;</p>
<p>What do you think?</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=2170&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2010. |
<a href="http://silvertonconsulting.com/blog/2010/08/25/enterprise-data-storage-defined-and-why-3par/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2010/08/25/enterprise-data-storage-defined-and-why-3par/#comments">One comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2010/08/25/enterprise-data-storage-defined-and-why-3par/&title=Enterprise data storage defined and why 3PAR?">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/3par/" rel="tag">3PAR</a>, <a href="http://silvertonconsulting.com/blog/tag/cloud-computing-services/" rel="tag">cloud computing services</a>, <a href="http://silvertonconsulting.com/blog/tag/cloud-storage/" rel="tag">Cloud Storage</a>, <a href="http://silvertonconsulting.com/blog/tag/dell/" rel="tag">Dell</a>, <a href="http://silvertonconsulting.com/blog/tag/enterprise-storage/" rel="tag">enterprise storage</a>, <a href="http://silvertonconsulting.com/blog/tag/hp/" rel="tag">HP</a>, <a href="http://silvertonconsulting.com/blog/tag/idc/" rel="tag">IDC</a>, <a href="http://silvertonconsulting.com/blog/tag/microsoft-hyper-v/" rel="tag">Microsoft Hyper-V</a>, <a href="http://silvertonconsulting.com/blog/tag/server-virtualization/" rel="tag">Server virtualization</a>, <a href="http://silvertonconsulting.com/blog/tag/vmware/" rel="tag">VMware</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2010/08/25/enterprise-data-storage-defined-and-why-3par/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>WD&#8217;s new SiliconEdge Blue SSD data write spec</title>
		<link>http://silvertonconsulting.com/blog/2010/03/04/wds-new-siliconedge-blue-ssd-data-write-spec/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=wds-new-siliconedge-blue-ssd-data-write-spec</link>
		<comments>http://silvertonconsulting.com/blog/2010/03/04/wds-new-siliconedge-blue-ssd-data-write-spec/#comments</comments>
		<pubDate>Thu, 04 Mar 2010 17:02:34 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Storage]]></category>
		<category><![CDATA[Storage reliability]]></category>
		<category><![CDATA[data write specification]]></category>
		<category><![CDATA[MLC NAND]]></category>
		<category><![CDATA[NAND]]></category>
		<category><![CDATA[SSD]]></category>
		<category><![CDATA[WD SiliconEdge]]></category>
		<category><![CDATA[Western Digital]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=1591</guid>
		<description><![CDATA[Western Digital (WD) announced their first SSD drive for the desktop/laptop market space today.  Their drive offers the typical256, 128, and 64GB capacity points over a SATA interface.  Performance looks ok at 5K random read or write IO/s with sustained &#8230; <a href="http://silvertonconsulting.com/blog/2010/03/04/wds-new-siliconedge-blue-ssd-data-write-spec/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_1593" class="wp-caption alignright" style="width: 110px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2010/03/wdfSSD_EdgeBlue.jpg"><img class="size-full wp-image-1593 " title="Western Digital's SiliconEdge Blue SSD SATA drive (from their website)" src="http://silvertonconsulting.com/blog/wp-content/uploads/2010/03/wdfSSD_EdgeBlue.jpg" alt="Western Digital's Silicon Edge Blue SSD SATA drive (from their website)" width="100" height="100" /></a><p class="wp-caption-text">Western Digital&#39;s SiliconEdge Blue SSD SATA drive (from their website)</p></div>
<p>Western Digital (WD) announced their first SSD drive for the desktop/laptop market space today.  Their drive offers the typical256, 128, and 64GB capacity points over a SATA interface.  Performance looks ok at 5K random read or write IO/s with sustained transfers at 250 and 140MB/s for read and write respectively.  But what caught my eye was a new specification I hadn&#8217;t seen before indicating <a href="http://www.wdc.com/wdproducts/library/SpecSheet/ENG/2879-771357.pdf" target="_blank">Maximum GB written per day</a> of 17.5, 35 and 70GB/d for their drives using WD&#8217;s Operational Lifespan &#8211; LifeEST(tm) definition.</p>
<p>I couldn&#8217;t find anywhere that said which NAND technology was used in the device but it likely uses MLC NAND.  In a prior posting we discussed a <a href="http://silvertonconsulting.com/blog/2010/01/08/toshiba-studies-laptop-write-rates-confirming-ssd-longevity/" target="_blank">Toshiba study</a> that said a &#8220;typical&#8221; laptop user writes about 2.4GB/d and a &#8220;heavy&#8221; laptop user writes about 9.2GB/d.  This data would indicate that WD&#8217;s new 64GB drive can handle almost 2X the defined &#8220;heavy&#8221; user workload for laptops and their other drives would handle it just fine.  A data write rate for desktop work, as far as I can tell, has not been published, but presumably it would be greater than laptop users.</p>
<p>From my perspective more information on the drives underlying NAND technology, on what a LifeEST specification actually means, and a specification as to how much NAND storage was actually present would be nice, but these are all personal nits.  All that aside, I applaud WD for standing up and saying what data write rate their drives can support.  This needs to be a standard part of any SSD specification sheet and I look forward to seeing more information like this coming from other vendors as well.</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=1591&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2010. |
<a href="http://silvertonconsulting.com/blog/2010/03/04/wds-new-siliconedge-blue-ssd-data-write-spec/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2010/03/04/wds-new-siliconedge-blue-ssd-data-write-spec/#comments">No comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2010/03/04/wds-new-siliconedge-blue-ssd-data-write-spec/&title=WD&#8217;s new SiliconEdge Blue SSD data write spec">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/data-write-specification/" rel="tag">data write specification</a>, <a href="http://silvertonconsulting.com/blog/tag/mlc-nand/" rel="tag">MLC NAND</a>, <a href="http://silvertonconsulting.com/blog/tag/nand/" rel="tag">NAND</a>, <a href="http://silvertonconsulting.com/blog/tag/ssd/" rel="tag">SSD</a>, <a href="http://silvertonconsulting.com/blog/tag/wd-siliconedge/" rel="tag">WD SiliconEdge</a>, <a href="http://silvertonconsulting.com/blog/tag/western-digital/" rel="tag">Western Digital</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2010/03/04/wds-new-siliconedge-blue-ssd-data-write-spec/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Atmos GeoProtect vs RAID</title>
		<link>http://silvertonconsulting.com/blog/2010/02/18/atmos-geoprotect-vs-raid/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=atmos-geoprotect-vs-raid</link>
		<comments>http://silvertonconsulting.com/blog/2010/02/18/atmos-geoprotect-vs-raid/#comments</comments>
		<pubDate>Thu, 18 Feb 2010 22:55:36 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Data availability]]></category>
		<category><![CDATA[Data integrity]]></category>
		<category><![CDATA[Storage]]></category>
		<category><![CDATA[Storage reliability]]></category>
		<category><![CDATA[Atmos]]></category>
		<category><![CDATA[Cloud Storage]]></category>
		<category><![CDATA[EMC]]></category>
		<category><![CDATA[GeoProtect]]></category>
		<category><![CDATA[RAID]]></category>
		<category><![CDATA[Reed-Solomon codes]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=1532</guid>
		<description><![CDATA[Yesterday, twitterland was buzzing about EMC&#8217;s latest enhancement to their Atmos Cloud Storage platform called GeoProtect.  This new capability improves cloud data protection by supporting erasure code data protection rather than just pure object replication. Erasure coding has been used &#8230; <a href="http://silvertonconsulting.com/blog/2010/02/18/atmos-geoprotect-vs-raid/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_1536" class="wp-caption alignleft" style="width: 310px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2010/02/2712985768_a663364225_b.jpg"><img class="size-medium wp-image-1536" title="The Night Lights of Europe (as seen from space) by woodleywonderworks (cc) (from flickr)" src="http://silvertonconsulting.com/blog/wp-content/uploads/2010/02/2712985768_a663364225_b-300x225.jpg" alt="The Night Lights of Europe (as seen from space) by woodleywonderworks (cc) (from flickr)" width="300" height="225" /></a><p class="wp-caption-text">The Night Lights of Europe (as seen from space) by woodleywonderworks (cc) (from flickr)</p></div>
<p>Yesterday, twitterland was buzzing about EMC&#8217;s latest enhancement to their Atmos Cloud Storage platform called GeoProtect.  This new capability improves cloud data protection by supporting erasure code data protection rather than just pure object replication.</p>
<p>Erasure coding has been used for over a decade in storage and some of the <a href="http://www.cs.utk.edu/~plank/plank/papers/FAST-2005.pdf" target="_blank">common algorithms are Reed-Solomon, Cauchy Reed-Soloman, EVENODD coding, etc</a>.  All these algorithms provide a way for splitting up customer data into data instances and parity (encoding) to allow some number of data or parity instances to be erased (or lost) while still providing customer data.  For example, a R-S encoding scheme we used in the past (called RAID 6+) had 13 data fragments and 2 parity fragments.  Such an encoding scheme supported the simultaneous failure of any two drives and could still supply (reconstruct) customer data.</p>
<p>But how does RAID differ from something like GeoProtect.</p>
<ul>
<li>RAID is typically within a storage array and not across storage arrays</li>
<li>RAID is typically limited to a small number of alternative configurations of data disks and parity disks which cannot be altered in the field, and</li>
<li>Currently, RAID typically doesn&#8217;t support more than two disk failures while still being able to recover customer data (see <a href="http://silvertonconsulting.com/blog/2009/10/23/are-raids-days-numbered/" target="_blank">Are RAIDs days numbered?</a>)</li>
</ul>
<p>As I understand it GeoProtect currently supports only two different encoding schemes which can provide for different levels of data instance failures while still protecting customer data.  And with GeoProtect you are protecting data across Atmos nodes and potentially across different geographic locations not just within storage arrays.  Also, with Atmos this is all policy driven and data that comes into the system can use any object replication policy or either of the two GeoProtect policies supported today.</p>
<p>Although the nice thing about R-S encoding is that it doesn&#8217;t have to be fixed to two different encoding schemes.  And as it&#8217;s all software, new coding schemes could easily be released over time, possibly someday being entirely something a user could dial up or down at their whim.</p>
<p>But this would seem much more like what <a href="http://www.cleversafe.com/" target="_blank">Cleversafe</a> has been offering in their SliceStor product.  With Cleversafe the user can specify exactly how much redundancy they want to support and the system takes care of everything else.  In addition, Cleversafe has implemented a more fine grained approach (with many more fragments) and data and parity are intermingled in each stored fragment.</p>
<p>It&#8217;s not a big stretch for Atmos to go from two GeoProtect configurations to four or more.  Unclear to me what the right number would be but once you get past 3 or so, it might be easier to just code a generic R-S routine that can handle any configuration the customer wants but I may be oversimplifying the mathematics here.</p>
<p>Nonetheless, in future versions of Atmos I wouldn&#8217;t be surprised if it&#8217;s possible that through policy management the way data is protected could change over time. Specifically, while data is being frequently accessed, one could use object replication or less compressed encoding to speed up access but once access frequency diminishes (or time passes), data can then protected with more storage efficient encoding schemes which would reduce the data footprint in the cloud while still offering similar resiliency to data loss.</p>
<p>Full disclosure I have worked for Cleversafe in the past and although I am currently working with EMC, I have had no work from EMC&#8217;s Atmos team.</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=1532&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2010. |
<a href="http://silvertonconsulting.com/blog/2010/02/18/atmos-geoprotect-vs-raid/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2010/02/18/atmos-geoprotect-vs-raid/#comments">2 comments</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2010/02/18/atmos-geoprotect-vs-raid/&title=Atmos GeoProtect vs RAID">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/atmos/" rel="tag">Atmos</a>, <a href="http://silvertonconsulting.com/blog/tag/cloud-storage/" rel="tag">Cloud Storage</a>, <a href="http://silvertonconsulting.com/blog/tag/emc/" rel="tag">EMC</a>, <a href="http://silvertonconsulting.com/blog/tag/geoprotect/" rel="tag">GeoProtect</a>, <a href="http://silvertonconsulting.com/blog/tag/raid/" rel="tag">RAID</a>, <a href="http://silvertonconsulting.com/blog/tag/reed-solomon-codes/" rel="tag">Reed-Solomon codes</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2010/02/18/atmos-geoprotect-vs-raid/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Intel-Micron new 25nm/8GB MLC NAND chip</title>
		<link>http://silvertonconsulting.com/blog/2010/02/02/intel-micron-new-25nm8gb-mlc-nand-chip/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=intel-micron-new-25nm8gb-mlc-nand-chip</link>
		<comments>http://silvertonconsulting.com/blog/2010/02/02/intel-micron-new-25nm8gb-mlc-nand-chip/#comments</comments>
		<pubDate>Tue, 02 Feb 2010 17:56:33 +0000</pubDate>
		<dc:creator>Ray</dc:creator>
				<category><![CDATA[Storage]]></category>
		<category><![CDATA[Storage density]]></category>
		<category><![CDATA[Storage reliability]]></category>
		<category><![CDATA[25nm technology]]></category>
		<category><![CDATA[IMFT]]></category>
		<category><![CDATA[Intel-Micron]]></category>
		<category><![CDATA[MLC NAND]]></category>
		<category><![CDATA[NAND]]></category>
		<category><![CDATA[SSD]]></category>

		<guid isPermaLink="false">http://silvertonconsulting.com/blog/?p=1457</guid>
		<description><![CDATA[Intel-Micron Flash Technologies just issued another increase in NAND density. This one&#8217;s manages to put 8GB on a single chip with MLC(2) technology in a 167mm square package or roughly a half inch per side. You may recall that Intel-Micron &#8230; <a href="http://silvertonconsulting.com/blog/2010/02/02/intel-micron-new-25nm8gb-mlc-nand-chip/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
			<content:encoded><![CDATA[<div id="attachment_1458" class="wp-caption alignright" style="width: 310px"><a href="http://silvertonconsulting.com/blog/wp-content/uploads/2010/02/intel_and_micron_in_25nm_nand_technology.jpg"><img class="size-medium wp-image-1458" title="intel_and_micron_in_25nm_nand_technology" src="http://silvertonconsulting.com/blog/wp-content/uploads/2010/02/intel_and_micron_in_25nm_nand_technology-300x172.jpg" alt="intel_and_micron_in_25nm_nand_technology" width="300" height="172" /></a><p class="wp-caption-text">intel_and_micron_in_25nm_nand_technology</p></div>
<p><a href="http://www.imftech.com/news/release_1feb10.html">Intel-Micron Flash Technologies</a> just issued another increase in NAND density.  This one&#8217;s manages to put 8GB on a single chip with MLC(2) technology in a 167mm square package or roughly a half inch per side.</p>
<p>You may recall that Intel-Micron Flash Technologies (IMFT) is a joint venture between Intel and Micron to develop NAND technology chips. IMFT chips can be used by any vendor and typically show up in Intel SSDs as well as other vendor systems.  MLC technology is more suitable for use in consumer applications but at these densities it&#8217;s starting to make sense for use by data centers as well. We have written before about MLC NAND used in the <a href="http://silvertonconsulting.com/blog/2009/08/13/stecs-mlc-enterprise-ssd/" target="_blank">enterprise disk by STEC</a> and <a href="http://silvertonconsulting.com/blog/2009/08/06/toshibas-new-mlc-nand-flash-ssds/" target="_blank">Toshiba&#8217;s MLC SSDs</a>. But in essence MLC NAND reliability and endurability will ultimately determine its place in the enterprise.</p>
<p>But at these densities, you can just throw more capacity at the problem to mask MLC endurance concerns. For example, with this latest chip, one could conceivably have a single layer 2.5&#8243; configuration with almost 200GBs of MLC NAND. If you wanted to configure this as 128GB SSD you could use the additional 72GB of NAND for failing pages. Doing this could conceivably add more than 50% to the life of an SSD.</p>
<p>SLC still has better (~10X) endurance but being able to ship 2X the capacity in the same footprint can help.  Of course, MLC and SLC NAND can be combined in a hybrid device to give some approximation of SLC reliability at MLC costs.</p>
<p>IMFT made no mention of SLC NAND chips at the 25nm technology node but presumably this will be forthcoming shortly.  As such, if we assume the technology can support a 4GB SLC NAND in a 167mm**2 chip it should be of significant interest to most enterprise SSD vendors.</p>
<p>A couple of things missing from yesterday&#8217;s IMFT press release, namely</p>
<ul>
<li>read/write performance specifications for the NAND chip</li>
<li>write endurance specifications for the NAND chip</li>
</ul>
<p>SSD performance is normally a function of all the technology that surrounds the NAND chip but it all starts with the chip.  Also, MLC used to be capable of 10,000 write/erase cycles and SLC was capable of 100,000 w/e cycles but most recent <a href="http://silvertonconsulting.com/blog/2010/01/08/toshiba-studies-laptop-write-rates-confirming-ssd-longevity/" target="_blank">technology from Toshiba</a> (presumably 34nm technology) shows a MLC NAND write/erase endurance of only 1400 cycles.  Which seems to imply that <strong>as the NAND technology increases density write endurance rates degrade</strong>. How much is subject to much debate and with the lack of any standardized w/e endurance specifications and reporting, it&#8217;s hard to see how bad it gets.</p>
<p>The bottom line, capacity is great but we need to know w/e endurance to really see where this new technology fits.  Ultimately, if endurance degrades significantly such NAND technology will only be suitable for consumer products.  Of course at ~10X (just guessing) the size of the enterprise market maybe that&#8217;s ok.</p>
<img src="http://silvertonconsulting.com/blog/?ak_action=api_record_view&id=1457&type=feed" alt="" /><hr />
<p><small>© RayOnStorage.com. for <a href="http://silvertonconsulting.com/blog">RayOnStorage Blog</a>, 2010. |
<a href="http://silvertonconsulting.com/blog/2010/02/02/intel-micron-new-25nm8gb-mlc-nand-chip/">Permalink</a> |
<a href="http://silvertonconsulting.com/blog/2010/02/02/intel-micron-new-25nm8gb-mlc-nand-chip/#comments">No comment</a> |
Add to
<a href="http://del.icio.us/post?url=http://silvertonconsulting.com/blog/2010/02/02/intel-micron-new-25nm8gb-mlc-nand-chip/&title=Intel-Micron new 25nm/8GB MLC NAND chip">del.icio.us</a>
<br/>
Post tags: <a href="http://silvertonconsulting.com/blog/tag/25nm-technology/" rel="tag">25nm technology</a>, <a href="http://silvertonconsulting.com/blog/tag/imft/" rel="tag">IMFT</a>, <a href="http://silvertonconsulting.com/blog/tag/intel-micron/" rel="tag">Intel-Micron</a>, <a href="http://silvertonconsulting.com/blog/tag/mlc-nand/" rel="tag">MLC NAND</a>, <a href="http://silvertonconsulting.com/blog/tag/nand/" rel="tag">NAND</a>, <a href="http://silvertonconsulting.com/blog/tag/ssd/" rel="tag">SSD</a><br/>
</small></p>]]></content:encoded>
			<wfw:commentRss>http://silvertonconsulting.com/blog/2010/02/02/intel-micron-new-25nm8gb-mlc-nand-chip/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced (Requested URI is rejected)
Database Caching 23/32 queries in 0.105 seconds using disk: basic

Served from: www.silvertonconsulting.com @ 2012-02-11 13:23:08 -->
