<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:clearspace="http://www.jivesoftware.com/xmlns/clearspace/rss" xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0">
  <channel>
    <title>Intel Communities : All Content - The Data Stack</title>
    <link>http://communities.intel.com/community/datastack</link>
    <description>All Content in The Data Stack</description>
    <language>en</language>
    <pubDate>Mon, 17 Jun 2013 20:41:09 GMT</pubDate>
    <generator>Jive SBS 5.0.2.0  (http://jivesoftware.com/products/clearspace/)</generator>
    <dc:date>2013-06-17T20:41:09Z</dc:date>
    <dc:language>en</dc:language>
    <item>
      <title>Powering the Way to Exascale Infographic</title>
      <link>http://communities.intel.com/docs/DOC-21222</link>
      <description>&lt;!-- [DocumentBodyStart:bcdb7a42-bdbe-4b82-ad30-5fa54e40b012] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:bcdb7a42-bdbe-4b82-ad30-5fa54e40b012] --&gt;</description>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">xeon</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">supercomputer</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">high_performance_computing</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">cluster_computing</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">big_data</category>
      <pubDate>Mon, 17 Jun 2013 20:41:09 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/docs/DOC-21222</guid>
      <dc:date>2013-06-17T20:41:09Z</dc:date>
      <clearspace:dateToText>2 days, 59 minutes ago</clearspace:dateToText>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
    <item>
      <title>HP Moonshot: Lifting Off with Intel Atom and Xeon Processors</title>
      <link>http://communities.intel.com/community/datastack/blog/2013/06/14/hp-moonshot-lifting-off-with-intel-atom-and-xeon-processors</link>
      <description>&lt;!-- [DocumentBodyStart:d88311f5-eb35-4dd7-972e-ff0f5e3fd8c7] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;p&gt;On my way back from Vegas now &amp;#8211; a jam packed, 36 hours where I never saw the sun and never left the hotel (and didn&amp;#8217;t sleep much) &amp;#8211; but well worth the trip all the same.&amp;nbsp; I attended HP&amp;#8217;s Discover conference for two days with a primary focus to listen to HP&amp;#8217;s top executives and a number of their customers discuss the new Moonshot system that launched just this past April.&amp;nbsp; And it wasn&amp;#8217;t hard to do &amp;#8211; Moonshot certainly was the rock star of the event!&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Honestly, I was expecting a lather, rinse, repeat from the recent launch, but was pleasantly surprised to see HP do a double click down into exactly when, where and why a customer would want to&amp;nbsp; use Moonshot&amp;hellip;and even more importantly, where not to use Moonshot.&amp;nbsp; This type of clarity was exactly what the audience was looking for in my opinion and Paul Santeler, VP and GM of HP&amp;#8217;s Hyperscale division did an excellent job of it in his break out seminar.&amp;nbsp; A few key points:&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Moonshot is not a replacement for their traditional server systems like the DL/SL line and blades &amp;#8211; it&amp;#8217;s an augmentation.&amp;nbsp; Designed for light weight, scale out workloads and consumable only in a fully loaded chassis (45 Intel&amp;reg; Atom&amp;reg; S family SOC-based cartridges today, with option to quadruple the density with 180 cartridges with the next gen &amp;#8220;Avoton&amp;rdquo; based cartridges coming later this year) &amp;#8211; it&amp;#8217;s a platform targeted at web-scale environments with applications written and optimized for&amp;nbsp; light weight, distributed workloads&lt;/li&gt;&lt;li&gt;The application is &amp;#8220;back in charge&amp;rdquo;.&amp;nbsp; Moonshot is configurable across compute, network and storage based on the application need.&amp;nbsp; Instead of running every DC app on a general purpose server, light weight, scale out applications can run on a flexible, easy to configure infrastructure.&amp;nbsp; One can now optimize across such workloads as static web, batch analytics and simple content delivery, all by turning the knobs on the HP Moonshot system.&amp;nbsp; And although there is only one production cartridge shipping today (based on the world&amp;#8217;s first 64-bit, 6W, data center class SOC - the Atom S 1200), soon there will be a range of cartridges that span software compatible Atom to Xeon&amp;reg; processors to address a broader range of workloads.&lt;/li&gt;&lt;li&gt;Good things come from collaboration.&amp;nbsp; Intel and HP have been working together on Moonshot for a number of years, since the initial design and development stages.&amp;nbsp; We are proud that HP chose Intel to be the lead partner for Moonshot and we will continue to innovate together and deliver more value to our end users.&amp;nbsp; Together we bring decades of experience, a full platform portfolio and a developed ecosystem just waiting to explore Moonshot.&amp;nbsp; The best is yet to come.&lt;/li&gt;&lt;/ul&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Congrats to HP on a successful show.&amp;nbsp; Big kudos to Meg for weaving in Kevin Bacon into the opening keynote (hit me up on twitter at @RaejeanneS if you want more on this!).&amp;nbsp; And thank you for choosing Intel as your collaboration partner.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;What&amp;#8217;s next for Intel in this space? Well, my whirlwind travel schedule continues next week at the &lt;a class="jive-link-external-small" href="http://www.opendatacenteralliance.org/forecast2013" target="_blank"&gt;ODCA Forecast&amp;#8217;13&lt;/a&gt; and &lt;a class="jive-link-external-small" href="http://event.gigaom.com/structure/" target="_blank"&gt;GigaOm Structure&amp;#8217;13&lt;/a&gt; events in San Francisco, a great confluence of enterprise experts talking cloud adoption at Forecast and industry insiders discussing the future of cloud infrastructure innovation at Structure.&amp;nbsp; In the meantime, for the latest on what Intel is doing with our products, solutions and customers, please visit the &lt;a class="jive-link-external-small" href="http://www.intel.com/content/www/us/en/it-management/intel-it/it-managers.html?wapkw=IT%20Center" target="_blank"&gt;IT Center on Intel.com&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:d88311f5-eb35-4dd7-972e-ff0f5e3fd8c7] --&gt;</description>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">data_center</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">xeon</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">green_technology</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">green_it</category>
      <pubDate>Fri, 14 Jun 2013 20:31:28 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/community/datastack/blog/2013/06/14/hp-moonshot-lifting-off-with-intel-atom-and-xeon-processors</guid>
      <dc:date>2013-06-14T20:31:28Z</dc:date>
      <clearspace:dateToText>1 day, 1 hour ago</clearspace:dateToText>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
    <item>
      <title>Partners Extend the Benefits of Intel Distribution for Apache Hadoop Software</title>
      <link>http://communities.intel.com/community/datastack/blog/2013/06/13/partners-extend-the-benefits-of-intel-distribution-for-apache-hadoop-software</link>
      <description>&lt;!-- [DocumentBodyStart:7801d7fd-79cc-4cc3-8c7b-e3e7d3fb1ee7] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;p&gt;There&amp;#8217;s more to the Intel&lt;sup&gt;&amp;reg;&lt;/sup&gt; Distribution for Apache Hadoop* software (IDH) than meets the eye.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Intel is building an entire partner ecosystem around IDH that extends optimized hardware, data storage and analytics support to help ensure that IT orgs get the value and intelligence they need out of big data.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;IDH helps organizations store and analyze big data by providing an open source data management platform with the hardware-level security, manageability and performance acceleration features of Intel&lt;sup&gt;&amp;reg;&lt;/sup&gt; Xeon&lt;sup&gt;&amp;reg;&lt;/sup&gt; processors. It also comes with technical support, training, and professional services from Intel. IDH is the distribution of choice for enterprises seeking to deploy open source Hadoop for processing big data at multi-petabyte scale.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;However, the compute intensive processes of Hadoop require a combination of hardware and software optimizations and specialized analytics and visualization tools to deliver the insights, scale and ROI demanded of big data. In addition to server architectures and IDH, Intel provides a number of tools to help manage Hadoop, including Intel&lt;sup&gt;&amp;reg;&lt;/sup&gt; Graphbuilder, which enables distributed graph analytics on top of Hadoop. However, Intel turns to its partners to help create a larger vendor ecosystem of optimized and co-engineered solutions and to build a more complete IDH computing environment.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;RainStor Takes Complexity, Cost out of Big Data Querying&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;I estimate that up to 90 percent of organizations that have deployed Hadoop clusters today are just using them as an ETL offload. In other words, they are using Hadoop to store big content, but haven&amp;#8217;t gotten around to taking advantage of Hadoop&amp;#8217;s benefits as an engine for big data analysis. That&amp;#8217;s where RainStor comes in. RainStor helps organizations achieve business insights at lower costs than other data stores, and uses familiar query and BI tools to reduce the complexity of big data analytics.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Rainstor for Hadoop* is a big data infrastructure that runs natively on the Intel Distribution for Hadoop. It&amp;#8217;s made to not just handle the velocity and growth of today&amp;#8217;s data, but also tackle the changing nature of data itself&amp;#8212;log files, web clickstreams, Twitter content, machine generated data, and more, all in great volume.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;But most IT admins don&amp;#8217;t want to become query specialists or data scientists just to analyze their big data. Using RainStor, they can run real SQL queries on Hadoop stores, taking much of the complexity out of big data analysis. Rainstor is standards-based, and it uses specialized JDBC and ODBC drivers to peer into persistent Hadoop data. It can then run queries through the data using the familiar SQL environment&amp;#8212;and DBAs don&amp;#8217;t have to be stuck with HiveQL. To some that could be like telling a C++ developer that JavaScript coding requires no further training.&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;
Rainstor also offers data compression and de-duplication capabilities that can lower the storage footprint by as much as 20 to 40 times. These compression features not only reduce the hardware needs and costs of big data, they also speed up querying. &lt;a class="jive-link-external-small" href="http://rainstor.com/" target="_blank"&gt;Rainstor&lt;/a&gt; is a great example of how partners are building out the Intel Hadoop ecosystem with innovative technologies. &amp;nbsp;Learn more about &lt;a class="jive-link-external-small" href="http://rainstor.com/2013_new/wp-content/uploads/2013/04/RainStor-For-Hadoop-Solution-Brief.pdf" target="_blank"&gt;Rainstor for Hadoop,&lt;/a&gt; and follow Tim and the growing #Intel #BigData Hadoop community at @TimIntel.


&lt;/div&gt;&lt;!-- [DocumentBodyEnd:7801d7fd-79cc-4cc3-8c7b-e3e7d3fb1ee7] --&gt;</description>
      <pubDate>Thu, 13 Jun 2013 16:57:28 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/community/datastack/blog/2013/06/13/partners-extend-the-benefits-of-intel-distribution-for-apache-hadoop-software</guid>
      <dc:date>2013-06-13T16:57:28Z</dc:date>
      <clearspace:dateToText>6 days, 4 hours ago</clearspace:dateToText>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
    <item>
      <title>Intel Atom S1200 helps HP Moonshot to lift off in Vegas.</title>
      <link>http://communities.intel.com/community/datastack/blog/2013/06/07/intel-atom-s1200-helps-hp-moonshot-to-lift-off-in-vegas</link>
      <description>&lt;!-- [DocumentBodyStart:f8e03fb3-3a37-4e4d-99c6-52fcd0d4784d] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;p&gt;HP and Intel have been collaborating for a few years on project Moonshot to bring new levels of density, efficiency and TCO for light weight web workloads such as static web and dedicated hosting.&amp;nbsp; Recently, HP unveiled their first generation Moonshot systems and I am excited to see that the first and only production &lt;a class="jive-link-external-small" href="http://www.hp.com/go/moonshot" target="_blank"&gt;HP ProLiant Moonshot servers&lt;/a&gt; available today are based on the &lt;a class="jive-link-external-small" href="http://newsroom.intel.com/docs/DOC-3172" target="_blank"&gt;Intel Atom S1200 processor family.&lt;/a&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;HP chose to lead with the &lt;a class="jive-link-external-small" href="http://www.intel.com/content/www/us/en/servers/microservers.html" target="_blank"&gt;Intel Atom processor&lt;/a&gt; for many reasons.&amp;nbsp; First, HP and Intel have a long history of collaboration and we have brought many innovations to market first on Intel and HP&amp;#8217;s platforms.&amp;nbsp; Second, the Intel Atom S1200 processor is the industry&amp;#8217;s only available 64-bit SoC with critical data center class features such as full 64-bit software ecosystem support, ECC and &lt;a class="jive-link-external-small" href="http://www.intel.com/content/www/us/en/virtualization/intel-virtualization-transforms-it.html" target="_blank"&gt;Intel Virtualization Technology&lt;/a&gt; - all within an ultra-low power 6W TDP.&amp;nbsp; This means that today the ProLiant Moonshot servers using Atom S1200 can drop into any environment and software applications will run seamlessly on the server without porting needed.&amp;nbsp; The lower power you want, with the software applications you need.&amp;nbsp; Third, this SoC was designed for targeted lightweight web scale workloads, including low-end dedicated hosting, simple content delivery, and offline batch analytics making Intel Atom S1200 the perfect SoC solution for HP Moonshot&amp;#8217;s target markets.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Moonshot servers with the Intel Atom S1200 are shipping to customers today and receiving great reviews.&amp;nbsp; We look forward to sharing more results from customers going forward as these implementations go public.&amp;nbsp; We also look forward to seeing Moonshot systems that will take advantage of higher density HP ProLiant Moonshot servers using Intel&amp;#8217;s next generation Atom SoC coming later this year.&amp;nbsp; The next gen servers will be built on Intel&amp;#8217;s 2nd generation 64-bit Intel Atom SoC, code named &amp;#8220;Avoton&amp;rdquo;.&amp;nbsp; Avoton is built on Intel&amp;#8217;s leading 3D tri-gate 22-nanometer (nm) process technology and is based on a new microarchitecture codenamed &amp;#8220;Silvermont&amp;rdquo;.&amp;nbsp; It will feature an integrated Ethernet fabric controller and deliver improvements over today&amp;#8217;s Intel Atom S1200 in performance per watt and energy efficiency through a combination of new capabilities, new microarchitecture and leadership manufacturing technology. Avoton is now being sampled to customers and the first systems are expected to be available in second half of 2013. Moonshot servers using Avoton will quadruple the density (4 Avoton SoCs per server) vs. the current generation just announced using Intel Atom S1200.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;2013 is and will be a great year for Intel and HP Moonshot. We have not only enabled the first Moonshot system to lift-off but with Avoton we will also bring HP Moonshot&amp;#8217;s customers a revolution in energy efficiency and performance per watt to drive major TCO improvements when processing lightweight web scale workloads.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Learn all about it at this year&amp;#8217;s #HPDiscover in Vegas June 11-13. Sign up for the exclusive NDA session BB BB4355 on Wednesday, June 12th at 12:45PM in Murano 3206 and follow all the action on Twitter at @IntelITS.&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:f8e03fb3-3a37-4e4d-99c6-52fcd0d4784d] --&gt;</description>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">data_center</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">green_technology</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">green_it</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">cloud_computing</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">consolidated_server</category>
      <pubDate>Fri, 07 Jun 2013 14:39:11 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/community/datastack/blog/2013/06/07/intel-atom-s1200-helps-hp-moonshot-to-lift-off-in-vegas</guid>
      <dc:date>2013-06-07T14:39:11Z</dc:date>
      <clearspace:dateToText>1 week, 5 days ago</clearspace:dateToText>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
    <item>
      <title>19 Years of Data Center</title>
      <link>http://communities.intel.com/community/datastack/blog/2013/06/06/19-years-of-data-center</link>
      <description>&lt;!-- [DocumentBodyStart:c6c30816-2d40-4c01-8f50-8262b30e0b2a] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;p&gt;I woke up this morning and realized that yesterday marked my 19th year at Intel.&amp;nbsp; Just as turning 29 and 39 freaked me out more than the actual decade birthday&amp;nbsp; in my life, year 19 at Intel woke me up that soon I&amp;#8217;d be 2 decades into a career and quite honestly, most of it went by so fast&amp;nbsp; that I can&amp;#8217;t quite say how I got here some days.&amp;nbsp; Should I have tried other companies/industries, should I have tried other paths than marketing or should I have ventured beyond the data center ?!&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;So, I did some deep contemplation during my [very long] commute this morning and here is what I came up with&amp;hellip;.I love my job, I am proud to work for Intel and despite the [very long] commute &amp;#8211; one couldn&amp;#8217;t ask for a better employer and career than with Intel&amp;#8217;s data center group.&amp;nbsp; But, I also realized that there were a few things along the way that helped me stay &amp;#8220;me&amp;rdquo; as I&amp;#8217;ve gone through it.&amp;nbsp; Listed in increasing or of importance:&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;ol&gt;&lt;li&gt;I will not wear a "logo&amp;#8217;d" polo shirt, or for that matter any logo&amp;#8217;d apparel, as I work.&amp;nbsp; Polo shirts make me look like a teenage boy and quite honestly, high tech logos really aren&amp;#8217;t my style.&amp;nbsp; I will also not wear a sweater set or sensible shoes to work.&amp;nbsp; If I am not meeting with a customer, I wear jeans.&amp;nbsp; Period.&amp;nbsp; Even better when paired with 4 inch heels. And don&amp;#8217;t even get me started on the &amp;#8220;lanyard&amp;rdquo; concept&amp;hellip;&lt;/li&gt;&lt;li&gt;I will not apologize or make an excuse why I have spent most of my career in the Enterprise / Data Center domain.&amp;nbsp; If you think consumer, tablets, phones, etc are &amp;#8220;sexier&amp;rdquo;&amp;hellip;well you obviously haven&amp;#8217;t really looked into the big cloud service providers, rack disaggregation or what is powering the voice/gesture recognition technologies on those sexy phone and clients.&amp;nbsp; Game changing tech trends that are rooted in the data center&lt;/li&gt;&lt;li&gt;I choose my manager as much as I choose my job.&amp;nbsp; During my 19 years I have had some exceptional managers.&amp;nbsp; The commonality across them &amp;#8211; I honestly believe they care as much about me as they care about my output.&amp;nbsp; They&amp;#8217;ve had my back and in return I will always have theirs.&amp;nbsp; They also build their organizations with that culture &amp;#8211; respect, no tolerance for politics, investment in career development and support for work/life balance.&amp;nbsp; I am now a working mom with elementary age twins and I travel too much.&amp;nbsp; Did I also mention the very long commute?&amp;nbsp; That means I only have time to work on real work.&amp;nbsp; I couldn&amp;#8217;t do what I do at home and at work if I had to worry about watching my back or battling bureaucracy.&amp;nbsp; &lt;/li&gt;&lt;/ol&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;So although my life has dramatically changed over 19 years, and Intel with it, I am glad to say that I have landed in a highly relevant, fast paced, super cool technology space where I feel supported to be ME &amp;#8211; data center junkie,&amp;nbsp; lover of new tech, Intel loyalist and devoted mom who never wants to miss a talent show or soccer game.&amp;nbsp; &lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Question to you &amp;#8211; &lt;/p&gt;&lt;p style="padding-left: 30px;"&gt;&lt;em&gt;How have you struck balance and kept interest in your career over one, two or more decades in the data center business? &lt;/em&gt;&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:c6c30816-2d40-4c01-8f50-8262b30e0b2a] --&gt;</description>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">data_center</category>
      <pubDate>Thu, 06 Jun 2013 16:58:22 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/community/datastack/blog/2013/06/06/19-years-of-data-center</guid>
      <dc:date>2013-06-06T16:58:22Z</dc:date>
      <clearspace:dateToText>1 week, 6 days ago</clearspace:dateToText>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
    <item>
      <title>How to Optimize Hadoop Performance on Intel Architecture</title>
      <link>http://communities.intel.com/community/datastack/blog/2013/06/05/how-to-optimize-hadoop-performance-on-intel%C3%A2-architecture</link>
      <description>&lt;!-- [DocumentBodyStart:4a276f27-4da1-4b28-bda3-2509a38e817a] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;p&gt;If you are dealing with data analytics, you have probably heard about or are using Apache Hadoop*. By leveraging the compute power of multiple nodes in a cluster, Hadoop can analyze practically unlimited volumes of data&amp;#8212;both structured and unstructured&amp;#8212;with lightning-fast speed.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;But did you know you can optimize Hadoop to deliver even better performance on Intel architecture? The key is to tune the underlying Java so that it takes advantage of capabilities in Intel hardware. When you do that, you can expect to see up to 70 percent faster performance on Hadoop sort operations. Keep reading to learn how.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;h2&gt;Understanding Hadoop&amp;#8217;s Java Foundation&lt;/h2&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;A Hadoop cluster comprises a master node and multiple slave nodes, where data is stored and where analytics processing occurs. An incoming analytics request invokes several Java* services that enable efficient replication and large-scale analytics across nodes in a cluster.&lt;/p&gt;&lt;p&gt;Processing begins with two Java services on the master node. These two services then communicate with additional Java services on the slave node where the data are stored&amp;#8212;see the image below for a simple view of what&amp;#8217;s going on.&lt;/p&gt;&lt;p style="text-align: center;"&gt;&lt;/p&gt;&lt;p&gt;&lt;a href="http://kapost-files-prod.s3.amazonaws.com/uploads/direct/20130605-0020-25-6619/Java_Optimization_IA.png"&gt;&lt;img src="http://kapost-files-prod.s3.amazonaws.com/uploads/direct/20130605-0020-25-6619/Java_Optimization_IA.png" style="display: block; margin-left: auto; margin-right: auto;"/&gt;&lt;/a&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;When MapReduce* completes all the assigned analytics tasks, it returns the results to the master node, which compiles results from multiple nodes and returns them to the requester.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Important note: &lt;/strong&gt;Hadoop spawns a new Java Virtual Machine* (JVM) for each MapReduce function on each slave node. This means that a large analytics job can result in the creation of thousands of individual JVMs. Because Hadoop does not share memory resources across nodes, each JVM and Java service must perform optimally. Reduced performance on any single node can hamper data analytics performance across the cluster.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;h2&gt;Tune Java to Optimize Hadoop Performance&lt;/h2&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Given this foundation of Java services, it is easy to see that optimizing Java for Intel&amp;reg; architecture can deliver significant Hadoop performance enhancements. Optimizations are built into Java and Intel architecture, which make these improvements easy to achieve. When Intel releases a new microarchitecture and platform, Intel and Oracle software engineers work together to tune the JVM to take advantage of the new hardware advances. These optimizations can provide faster Hadoop performance on each Intel-based node as well as on the entire cluster.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;How much faster? Up to 50 percent faster on a TeraSort* benchmark test and 70 percent faster on the Hadoop Sort benchmark. Read the white paper, &lt;a class="jive-link-external-small" href="http://intel.ly/14pk7ZS" target="_blank"&gt;Optimizing Java and Apache Hadoop for Intel Architecture&lt;/a&gt;, to see the details.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Since 2007, Oracle and Intel have improved Java performance up to 14 times by tying specific Java optimizations to advancements in the underlying hardware. Read the paper to see a complete list of optimizations, but here&amp;#8217;s a taste of some that are important to Hadoop performance:&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Fast CRC increases file checksum and compression/decompression checksum, which increases Hadoop network and file system performance.&lt;/li&gt;&lt;/ul&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Large-page usage increases performance of large analytics jobs.&lt;/li&gt;&lt;/ul&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Intel&amp;reg; Advanced Vector Extensions improve the performance of MapReduce operations that contain array and string manipulation, such as sub-string or character searches; also improves integer and floating point calculations.&lt;/li&gt;&lt;/ul&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Intel&amp;reg; Integrated Performance Primitives (Intel&amp;reg; IPP) compression increases compression performance, which reduces network and disk input/output (I/O) across the Hadoop cluster.&lt;/li&gt;&lt;/ul&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;If you&amp;#8217;re using Hadoop, or analytics on large data sets are part of your job, then &lt;a class="jive-link-external-small" href="http://intel.ly/14pk7ZS" target="_blank"&gt;check out the white paper&lt;/a&gt; for a full description of the Java optimizations that are available to you to enhance Hadoop performance on Intel architecture.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Follow me &lt;a class="jive-link-external-small" href="http://twitter.com/timintel" target="_blank"&gt;@TimIntel&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:4a276f27-4da1-4b28-bda3-2509a38e817a] --&gt;</description>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">data_center_management</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">big_data</category>
      <pubDate>Thu, 06 Jun 2013 00:11:20 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/community/datastack/blog/2013/06/05/how-to-optimize-hadoop-performance-on-intel%C3%A2-architecture</guid>
      <dc:date>2013-06-06T00:11:20Z</dc:date>
      <clearspace:dateToText>1 week, 4 days ago</clearspace:dateToText>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
    <item>
      <title>Go Ahead and Virtualize IBM InfoSphere* Information Server</title>
      <link>http://communities.intel.com/community/datastack/blog/2013/05/23/go-ahead-and-virtualize-ibm-infosphere-information-server</link>
      <description>&lt;!-- [DocumentBodyStart:892c1ab3-f6be-4d68-9196-68d59eaee8df] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;p&gt;If you want to virtualize IBM InfoSphere* but have put it off, I have some new information that might give you the bump you need to move forward: How does 98 percent throughput sound?&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;In a nutshell: extensive testing shows that InfoSphere can perform very well when virtualized. How well? We achieved between 90 and 98 percent of the throughput that we typically see in a physical environment, with an overhead tax of only about 10 percent on I/O-intensive workloads.&lt;/p&gt;&lt;p&gt;You can get all of the configuration and results details in the &lt;a class="jive-link-external-small" href="http://intel.ly/10Bydme" target="_blank"&gt;white paper about InfoSphere virtualization here&lt;/a&gt;, but I&amp;#8217;ll hit the highlights for you in this post.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;While virtualizing InfoSphere Information Server is cool, here&amp;#8217;s the icing on the cake: virtualization with VMware vSphere* is now supported as an IBM PureApplication* pattern. &amp;#8220;Patterns&amp;rdquo; are a growing ecosystem of software stacks for IBM PureSystems* running on Intel&amp;reg; Xeon&amp;reg; processors. Hence, adding the leading VMware vSphere 5.1 stack indeed broadens the use of these rich appliance systems versus other industry systems. PureSystems are innovative, quick-to-uptime, and run industry-standard software stacks.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;h2&gt;Testing InfoSphere Virtualization&lt;/h2&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;IBM, VMware, and Intel teamed up recently to see how IBM InfoSphere performs when it&amp;#8217;s virtualized. Specifically, we looked at the runtime performance of:&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;IBM InfoSphere DataStage* 8.7&lt;/li&gt;&lt;li&gt;Running on VMware vSphere 5.0&lt;/li&gt;&lt;li&gt;On a server powered by the Intel Xeon processor E7 processor family&lt;/li&gt;&lt;/ul&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;The tests found that InfoSphere DataStage scaled smoothly as we cranked up the number of virtual CPUs (vCPUs), while clocking throughput at up to 98 percent of that found in a physical environment.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;More good news: we saw only a slight performance difference when using VMFS (Virtual Machine File System) versus RDM (Raw Device Mapping) data stores. This means that you can reap the benefits of VMFS for storage provisioning with virtualized workloads without concern over performance.&lt;/p&gt;&lt;p&gt;Now let&amp;#8217;s get down into some of the details so you can put these results in context.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;h2&gt;InfoSphere Virtualization Test Configuration&lt;/h2&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;h3&gt;Server&lt;/h3&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;We used an IBM* System x3850 X5 with a network-attached IBM System Storage* DS5300:&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;Four socket system&lt;/li&gt;&lt;li&gt;40 physical cores&lt;/li&gt;&lt;li&gt;Intel Xeon E7-8870 processors&lt;/li&gt;&lt;li&gt;Configured as &amp;#8220;optimized for performance&amp;rdquo;&lt;/li&gt;&lt;/ul&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;To get a good comparison between physical and virtual environments, we controlled RAM and processor availability to the native environment so that we could match the virtual environment as closely as possible. We enabled all of the virtualization-related options, except for Intel&amp;reg; Hyper-Threading Technology (Intel&amp;reg; HT Technology), which we disabled to simplify the comparison.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;h3&gt;IBM InfoSphere&lt;/h3&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;We installed all InfoSphere DataStage components on one physical server in the native environment and on one virtual machine in the virtualized environment:&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;ul&gt;&lt;li&gt;IBM WebSphere* 8.1 Application Server (WAS)&lt;/li&gt;&lt;li&gt;XMeta repository&lt;/li&gt;&lt;li&gt;DataStage engine&lt;/li&gt;&lt;/ul&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;We found that DataStage engine running in a single virtual machine had higher throughput. The specific reasons for this result weren&amp;#8217;t clear, but we&amp;#8217;re hopeful that further testing will provide more detail.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;h2&gt;ETL Workload Virtualization Test Results&lt;/h2&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;As expected, throughput in both environments increased as the number of processor cores increased, with performance in the two environments varying only between 2 and 10 percent. The overhead for the virtual environment was only 10 percent for all tested configurations.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;h2&gt;Host Server Memory-Management Test Results&lt;/h2&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;We also wanted to see what would happen with DataStage performance when overcommitting the host server. We measured a 34 percent drop in throughput when the system was 100 percent committed, and throughput continued to drop as the host processors were further committed.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Working with VMware engineers, we determined that this drop was because the eight vCPUs were not mapping neatly onto the 10-core, Non-Uniform Memory Access (NUMA)&amp;#8211;node design of the Intel Xeon processor E7 family&amp;#8217;s microarchitecture.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;To work around this issue, you could use a five vCPU configuration instead of an eight vCPU configuration. Five vCPUs would map well to the 10-core NUMA nodes. Another workaround would be to turn off NUMA scheduling in BIOS or VMware vSphere, which would allow all of the CPU cores to be used, though you would see a lag in memory performance. The lesson? Understand NUMA configuration to optimize VM performance.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;We repeated this over-commitment test with the VMware vSphere &amp;#8220;reserve CPU&amp;rdquo; option for the DataStage guest set to maximum, and the result showed minimal performance impact. However, this move can potentially impact the performance of the other non-reserved virtual machines running on the same host.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;If your system is overcommitted and you&amp;#8217;re not seeing the DataStage runtime performance you want, the best option would be to increase the host system capacity or to move the VM to another host.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;h2&gt;Go Ahead and Virtualize InfoSphere&lt;/h2&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Our tests showed InfoSphere Information Server runtime performed very well in a virtualized environment hosted on a platform powered by the Intel Xeon processor E7 family and using VMware vSphere 5.0. So if you&amp;#8217;ve wanted to virtualize your InfoSphere applications but were concerned about performance, now might be the time.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;a class="jive-link-external-small" href="http://intel.ly/10Bydme" target="_blank"&gt;Take a look at the white paper&lt;/a&gt; to see the test configuration, procedure, and results. And follow me on Twitter, @TimIntel, to get more useful InfoSphere and DB2* tidbits. You can also get the latest news and technology updates at the joint &lt;a class="jive-link-external-small" href="http://ibm.co/107052J" target="_blank"&gt;Intel and IBM DB2 website&lt;/a&gt;.&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:892c1ab3-f6be-4d68-9196-68d59eaee8df] --&gt;</description>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">virtualization</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">virtual_server</category>
      <pubDate>Thu, 23 May 2013 15:42:13 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/community/datastack/blog/2013/05/23/go-ahead-and-virtualize-ibm-infosphere-information-server</guid>
      <dc:date>2013-05-23T15:42:13Z</dc:date>
      <clearspace:dateToText>3 weeks, 6 days ago</clearspace:dateToText>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
    <item>
      <title>S5000XVN and &gt;16gb RAM</title>
      <link>http://communities.intel.com/thread/30914</link>
      <description>&lt;!-- [DocumentBodyStart:762e1759-a35d-4813-be9d-96dd2f1205e1] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;p&gt;Got an S5000XVN running dual 2.83ghz E5440's and currently 16gb of Hynix ECC 667, 4x 4gb PC2 5300. Damn thing is a beast!&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;I got 4 more identical 4gb memory sticks and when I put them in, i get the solid amber light of death at POST. I'm running in RAS Disabled and the sticks are A1 A2 and B1 B2 (the blue ones)&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;If I put any more in it, even 1gb sticks, it hangs. I know these do 32gb and I would like to double up to host a website. In the documentation it states that it only supports 16gb as mirrored but im not running them mirrored, at least I don't think I am.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Also the tech documentation was kind of vague on whether or not it supports X-series cpus. Would be nice to have 16 logical processors (as if 8 wasn't enough)&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:762e1759-a35d-4813-be9d-96dd2f1205e1] --&gt;</description>
      <pubDate>Fri, 03 Aug 2012 22:42:49 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/thread/30914</guid>
      <dc:date>2012-08-03T22:42:49Z</dc:date>
      <clearspace:dateToText>3 weeks, 6 days ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
    <item>
      <title>Server Storage Caching Considerations</title>
      <link>http://communities.intel.com/community/datastack/blog/2013/05/22/server-storage-caching-considerations</link>
      <description>&lt;!-- [DocumentBodyStart:625584df-048d-4600-9e76-7e7d31af26fe] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;p&gt;Caching &lt;em&gt;is&lt;/em&gt; storage. Here&amp;#8217;s how &lt;a class="jive-link-external-small" href="http://en.wikipedia.org/wiki/Cache_(computing)" target="_blank"&gt;Wikipedia&lt;/a&gt; defines caching:&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p style="padding-left: 30px;"&gt;&amp;#8220;&amp;hellip; .a &lt;strong&gt;cache&lt;/strong&gt; (&lt;span class="nowrap1"&gt;&lt;span lang="EN"&gt;pron.:&lt;span class="nowrap1"&gt; &lt;span class="ipa"&gt;&lt;a class="jive-link-external-small" href="http://en.wikipedia.org/wiki/Help:IPA_for_English" target="_blank"&gt;/&lt;/a&gt;&lt;a class="jive-link-external-small" href="http://en.wikipedia.org/wiki/Help:IPA_for_English#Key" target="_blank"&gt;ˈ&lt;/a&gt;&lt;a class="jive-link-external-small" href="http://en.wikipedia.org/wiki/Help:IPA_for_English#Key" target="_blank"&gt;k&lt;/a&gt;&lt;a class="jive-link-external-small" href="http://en.wikipedia.org/wiki/Help:IPA_for_English#Key" target="_blank"&gt;&amp;aelig;&lt;/a&gt;&lt;a class="jive-link-external-small" href="http://en.wikipedia.org/wiki/Help:IPA_for_English#Key" target="_blank"&gt;ʃ&lt;/a&gt;&lt;a class="jive-link-external-small" href="http://en.wikipedia.org/wiki/Help:IPA_for_English" target="_blank"&gt;/&lt;/a&gt; &lt;a class="jive-link-external-small" href="http://en.wikipedia.org/wiki/Wikipedia:Pronunciation_respelling_key" target="_blank"&gt;&lt;span class="nocaps"&gt;&lt;strong&gt;&lt;em&gt;KASH&lt;/em&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/a&gt;) is a component that transparently stores data so that future requests for that data can be served faster.&amp;rdquo; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="padding-left: 30px;"&gt;&lt;span class="nowrap1"&gt;&lt;span lang="EN"&gt;&lt;span class="nowrap1"&gt;&lt;span class="ipa"&gt;&lt;br/&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span class="nowrap1"&gt;&lt;span lang="EN"&gt;&lt;span class="nowrap1"&gt;&lt;span class="ipa"&gt; With the data explosion and consumers creating content and wanting to access it immediately, caching is becoming more and more important.&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;My colleague, Susan Bobholz is a Marketing Director in Intel&amp;#8217;s Data Center Software Division and talks about the considerations for server storage caching.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Right now, one of the hottest storage topics is storage caching. It seems hardly a week goes by without some type of caching software showing up in the press.&amp;nbsp; I thought I&amp;#8217;d spend a bit of time talking about this trend and provide some things to think about when choosing caching software.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;In many datacenters, the hottest, most frequently accessed data is stored on 15K serial attached SCSI (SAS) hard drives. But those hard drives can become a bottleneck because they are mechanical devices with moving parts.&amp;nbsp; They simply can&amp;#8217;t move fastest enough to keep up with some application demands.&amp;nbsp; One solution to resolve this is to replace all those 15K SAS hard drives with Solid State Drives (SSDs) but this can be an expensive undertaking.&amp;nbsp; The beauty of storage caching is that it protects your investment in&amp;nbsp; hard drives, because your application performance is improved without replacing all those hard drives with SSDs.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;To be as simple as possible, storage caching allows the hottest, most important data to be stored in a SSD instead of hard drives, allowing that data to be accessed significantly faster.&amp;nbsp; Often caching is implemented as a server application, but sometimes it&amp;#8217;s actually part of the firmware on a RAID HBA&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;So you&amp;#8217;ve decided you want to implement storage caching.&amp;nbsp; There are several caching options out there.&amp;nbsp; Other than cost, what are some key questions to consider about when deciding which to use?&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Where does your cache physically reside?&lt;/strong&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;As mentioned above, some caching solutions are integrated into a RAID HBA.&amp;nbsp; This means that the Cache SSD must be attached to the RAID HBA itself and only data on hard drives connected to that RAID HBA can be accelerated.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Other caching solutions allow the Cache SSD to be anywhere inside the server itself.&amp;nbsp;&amp;nbsp; This provides additional flexibility as the data being cached can be anywhere on the server - behind a RAID HBA, behind a SAS HBA or even attached to the chipset SATA ports.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;In addition to being able to have the Cache SSD inside the server, some caching solutions allow the Cache SSD to be outside the server, in a SAN or NAS.&amp;nbsp; This is important in virtualized servers as this allows virtual machine migration to occur automatically.&amp;nbsp; The cache remains active while the virtual machine moves from host to host.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Consider where you want the cache SSD to connect to your server when choosing a caching solution.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Which OSes are supported? &lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;br/&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;Think about what OSes exist in your datacenter.&amp;nbsp; Windows?&amp;nbsp; Linux?&amp;nbsp; Virtualized OSes such as VMware ESX or Xen?&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Think about whether it&amp;#8217;s important to have a common caching solution from one vendor across all these environments.&amp;nbsp; Not all caching solutions support all these OSes.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Being able to choose what goes into the cache&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;br/&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;This may sound unimportant, but imagine having an SLA with a customer that requires you to deliver the lowest latency to the data associated with that application. What if you could guarantee that specific data was always in the cache, ready to be accessed?&amp;nbsp;&amp;nbsp; Several caching solutions available today offer proprietary ways to pin data into the cache.&amp;nbsp; This is becoming so important that standards bodies such as ANSI T10 are looking into ways to standardize ways to determine whether data should be kept into a cache at all times.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Read caching or write caching?&amp;nbsp;&amp;nbsp; &lt;/strong&gt;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;&lt;br/&gt;&lt;/strong&gt;&lt;/p&gt;&lt;p&gt;Look at the applications you want to accelerate.&amp;nbsp; Do they mostly read data from hard drives or do they mostly write data?&amp;nbsp; Or is it a mix?&amp;nbsp; Some caching solutions are better are accelerate reads, others are better at writes.&amp;nbsp; Choose a caching solution that meets the needs of the applications you want to accelerate.&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Caching algorithms aren&amp;#8217;t all the same&lt;/strong&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;We all learned about Least Recently Used caching in school.&amp;nbsp; Just as the name implies when the cache is full but new data needs to be added to the cache, the data that has been sitting in the cache the longest without being use will be evicted to make room for the new data.&amp;nbsp; This can be an effective algorithm and is very common.&amp;nbsp; But some caching &lt;span style="color: black;"&gt;solutions add intelligence to the caching algorithm and are able to decide to keep specific most popular/active data in the cache longer, protecting it from being evicted by more recent, but less popular data. This reduces the probability that important data is evicted from the cache, improving overall application performance.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;So, these are just some of the areas to consider when choosing a storage caching solution.&amp;nbsp; What is important to you when you choose a caching solution?&amp;nbsp; Let me know!&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;em&gt;Full disclosure:&amp;nbsp; Intel has its own caching solution:&amp;nbsp; Intel&amp;reg; Cache Acceleration Software that works with Intel&amp;reg; Datacenter SSDs.&amp;nbsp; We think it&amp;#8217;s pretty cool.&amp;nbsp; A 30 day trial is available on intel.com.&amp;nbsp; &lt;/em&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;Susan Bobholz is a Marketing Director in Intel&amp;#8217;s Datacenter Software Division.&amp;nbsp; She&amp;#8217;s been with Intel for 20 years, doing everything from software development to initiative management to product marketing, focused on storage technologies and products.&amp;nbsp; Prior to joining Intel, Susan developed software at Siemens Medical Labs and firmware for Motorola cell phones.&amp;nbsp; She graduated from the University of Wisconsin with a BS in Electrical and Computer Engineering.&amp;nbsp; She holds 3 patents.&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:625584df-048d-4600-9e76-7e7d31af26fe] --&gt;</description>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">data_center</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">data_center_management</category>
      <pubDate>Wed, 22 May 2013 14:00:16 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/community/datastack/blog/2013/05/22/server-storage-caching-considerations</guid>
      <dc:date>2013-05-22T14:00:16Z</dc:date>
      <clearspace:dateToText>4 weeks, 3 hours ago</clearspace:dateToText>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
    <item>
      <title>At TDWI: Finding a middle way between Hadoop and relational data warehousing</title>
      <link>http://communities.intel.com/community/datastack/blog/2013/05/17/at-tdwi-finding-a-middle-way-between-hadoop-and-relational-data-warehousing</link>
      <description>&lt;!-- [DocumentBodyStart:f3e42c36-0ee2-4bb5-b194-f2c5494527ca] --&gt;&lt;div class="jive-rendered-content"&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;If you&amp;#8217;re thinking big data analytics will solve all your BI challenges, you may be looking at it wrong&amp;hellip;&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;I realized this when I was in Chicago the first week of May, attending The Data Warehousing Institute (TDWI) conference, where the theme was &amp;#8220;Preparing for the Practical Realities of Big Data.&amp;rdquo;&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt; &lt;a href="http://kapost-files-prod.s3.amazonaws.com/uploads/direct/20130514-2154-20-9551/Tim_Allen_TDWI_2013.jpg"&gt;&lt;img src="http://kapost-files-prod.s3.amazonaws.com/uploads/direct/20130514-2154-20-9551/Tim_Allen_TDWI_2013.jpg"/&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;br/&gt;There are high expectations around big data at the moment. Many people in marketing, product development, and analytics teams can&amp;#8217;t wait to get their hands on big data intelligence to better understand and target audiences.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;However, TDWI isn&amp;#8217;t their show. At TDWI, the focus instead is on traditional database administrators, data analysts, and data scientists, and it&amp;#8217;s a very technical conference firmly based on OLAP and OLTP analytics and hands-on issues of data warehousing.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;With this in mind, I attended the &lt;a class="jive-link-external-small" href="http://www.youtube.com/watch?v=xKrw2TKfj4w" target="_blank"&gt;keynote address by Ken Rudin,&lt;/a&gt; director of Analytics at Facebook&amp;#8212;a leader in cool, cutting-edge big data processing and analysis addressing a conference of (what some would consider) old-school DBAs. The message from Rudin, who has held senior leadership positions at Zynga, Salesforce.com, and Oracle, was fascinating: Don&amp;#8217;t get caught in the tyranny of either/or when it comes to data analysis&amp;#8212;businesses need both traditional relational database processing and Hadoop*-based big data analysis.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;@krudin said that Facebook started out relying almost solely on Hadoop and big data when the social media giant first launched, but now is increasingly incorporating OLAP and OLTP processes into its analytics. Hadoop is best at exploring huge data sets&amp;#8212;putting all the data into one system to discover patterns. Traditional relational data analytics is best at business, looking at focused data to derive metrics and more actionable, granular analysis. Both are valuable technologies: which one is best depends on what kind of impact you are looking for.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;So the question is not, how do you get from old-school to cutting edge as soon as possible. Rather, ask which technology is right to generate impact out of data.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;This was a conclusion that appealed to many at TDWI, as several people I spoke to registered a bit of skepticism about the value of Hadoop as an engine for analytics. For them, the main attraction of Hadoop is its potential to act as a backend data storage mechanism, where unstructured data can be warehoused.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;Then the question becomes: What&amp;#8217;s the best way to integrate data stored in Hadoop into a traditional OLAP or OLTP infrastructure for processing? The answer is just around the corner.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;At the SAP booth, I presented a discussion of the newly released joint solution from SAP and Intel that has optimized Intel&lt;sup&gt;&amp;reg;&lt;/sup&gt; Distribution of Apache Hadoop* software for the SAP HANA* in-memory database. Using SAP Smart Data Access* (watch for availability in coming weeks), this big data solution is able to leverage &lt;em&gt;all &lt;/em&gt;types of data for processing in analytical applications.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;And since it&amp;#8217;s built on Intel&lt;sup&gt;&amp;reg;&lt;/sup&gt; architecture, and leverages the full power of Intel&lt;sup&gt;&amp;reg;&lt;/sup&gt; Xeon&lt;sup&gt;&amp;reg;&lt;/sup&gt; E7 processors, HANA has hardware-enhanced performance and security built in, with solid-state drives and cache acceleration for blazing speed and stability. Watch &lt;a class="jive-link-external-small" href="http://www.youtube.com/watch?v=vBJt8OQqq84" target="_blank"&gt;this interview&lt;/a&gt; from TDWI to learn more about SAP HANA and how the database helps address big data challenges.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;If you&amp;#8217;re looking for the technology to get the most impact out of analytics, look no further.&lt;/span&gt;&lt;/p&gt;&lt;p style="min-height: 8pt; height: 8pt; padding: 0px;"&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&amp;nbsp;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;Follow Tim and Twitter @TimIntel and the SAP analytics team at @SAPAnalytics.&lt;/span&gt;&lt;/p&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;Additionally, @TDWI has some very interesting DW feeds.&lt;/span&gt;&lt;/p&gt;&lt;/div&gt;&lt;!-- [DocumentBodyEnd:f3e42c36-0ee2-4bb5-b194-f2c5494527ca] --&gt;</description>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">business_continuity</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">data_center_management</category>
      <category domain="http://communities.intel.com/tags#/?containerType=14&amp;container=2015">big_data</category>
      <pubDate>Fri, 17 May 2013 22:23:37 GMT</pubDate>
      <author>webadmin@intel.com</author>
      <guid>http://communities.intel.com/community/datastack/blog/2013/05/17/at-tdwi-finding-a-middle-way-between-hadoop-and-relational-data-warehousing</guid>
      <dc:date>2013-05-17T22:23:37Z</dc:date>
      <clearspace:dateToText>1 month, 2 days ago</clearspace:dateToText>
      <clearspace:replyCount>1</clearspace:replyCount>
      <clearspace:objectType>0</clearspace:objectType>
    </item>
  </channel>
</rss>

