1 56 57 58 59 60 70 Previous Next

The Data Stack

1,037 Posts

The industry has been waiting for Intel's new product, codenamed Nehalem - and for good reason.  With the recession in full swing and all of us re-thinking our spending habits, we are only spending our hard-earned money on the best and most important products and services ... and everyone is looking for a deal.

 

The Xeon 5500 based server is one of those products that is going to make IT, business leaders, financial accountants and Corporate Executives take notice.

 

Why? ... because an investment in a new server, used to replace nine aging single core servers, can pay for itself in an short as 8 months while dramatically reducing infrastructure space consumption, power & cooling requirements and operating costs.

 

For Finance, that means that a capital investment today, could create a positive cash flow stream within a year

For IT, that means newer, in-warranty servers delivering more performance per server simplifying operations and maintenance, with lower risk of problems

For Facility Managers, that means reduced power / cooling requirements and the ability to consolidate data centers and avoid new construction or retrofits

For the Corporate Executives, this means a more efficient business, with lower costs and more innovation that can spur business growth

 

Read My blog on the Server Room titled "Top 10 Reasons to Buy in a Recession"

View the 8 month payback calculation here

 

Read more about the collaboration of industry solutions in the Server Solutions Insider Forum and view the innovation from intel in the server room

 

Chris

Citrix has anticipated the potential for enhanced performance when using the new Intel Xeon 5500 with XenServer for some time now. We were fortunate to have received one of these gems from Intel in January to do our own comparison tests in our Bedford, MA test lab. We had just completed a series of performance validation tests for virtualizing SQL Server 2008 in a XenServer 5 environment (a very successful test series I might add...look for the technical WP on this shortly at: http://citrix.com/English/ps2/products/documents_onecat.asp?contentid=683148&cid=White+Papers). So we decided to keep the same test environment and repeat a portion of the SQL test series comparing Xeon E5405 vs. Xeon E5570-based XenServer hosts to see what kind of performance improvement we would get.

 

Each of the test servers were configured with a single SQL Server 2008 VM with 4 vCPU cores and 7GB RAM. We used DBHammer for SQL to create a 10 million record database and generated transaction processing workloads to simulate an actual SQL Server 2008 client workload. Workloads were created starting with 200 clients. Each client workload test ran for 30 minute periods. Starting after the first 10 minutes of each test, we began taking measurements using Microsoft performance monitor and continued doing so for the remaining 20 minutes of the test. Measurements of the maximum transactions-per second rate were taken every fifteen seconds until the 30 minute test period had elapsed. Increments of 200 clients were added until the maximum average CPU utilization level of 90 percent was reached. When average CPU utilization of 90 percent was reached, it was determined that the system was considered saturated and testing was ended.

 

 

Given the difference in the clock speeds of the two systems tested, 2.0 GHz and 2.93 GHz respectively, we fully expected to see some level of improved performance, but what we saw was pretty astounding, even to us. The Xeon E5405 XenServer host topped out at 1,600 clients, generating 13,708 maximum transactions per second. The Xeon E5570 XenServer host, by comparison, was able to sustain a load of 2,400 clients, generating 20,978 maximum transactions per second. That's an improvement of 53%!

 

Intel E5405 vs E5570 (640x443).jpg

 

The results are pretty clear. Not only is the new Intel Nehalem Xeon 5500 series a powerful addition to the Intel Server Solutions lineup, but it is a clear advantage to customers seeking to virtualize enterprise application platforms like SQL Server 2008 using XenServer.

 

Details of this comparison test will be posted in a whitepaper to

http://citrix.com/English/ps2/products/documents_onecat.asp?contentid=683148&cid=White+Papers on or about April 1.

 

Bernie Hannon

Director of Certification & Testing

Citrix Systems, Xen Products Division

3-25-2009 11-14-57 AM.jpgWith the Nehalem architecture-based processor server/workstation launch just around the corner, customers are demonstrating how quickly they can take advantage of the performance afforded to them by our newest processor. One recent example was a Philips* CT demonstration of a beating heart at the major European IT show, CeBit*.  According to the Philips demonstration, they were able to render the human heart twice as fast using our upcoming Nehalem workstation processors as compared to the previous generation Intel® Xeon® processor 54xx series.  Check out the beating heart demonstration video in this link using our Nehalem architecture-based workstation processor.

For those of us with no medical background,  3D imaging is a tool enabling doctors to visualize the human body without invasive tools.  The patient is scanned, data is acquired, and a 3D model is assembled and they are able to show what going on in the human body without touching it.  CT tools are able to produce thousand of pictures or slices per second.  This results in large datasets that take time to process.  Software running on a server or workstation processes this vast amount of data to generate detailed 3D images and then can be interpreted by medical staff.   The primary challenge is managing large datasets generated by large volume acquisitions while speeding time to diagnosis.

The heart is a particularly challenging to visualize using CT scanning.  It is a very fast moving organ, with little control over its movement because is involuntary, and the heart needs to be frozen in time to get accurate anatomical picture.  Another challenge for volume rendering is high level image quality necessary so that anatomical structures of the heart can be depicted in an accurate way.

One interesting aspect of Philips CT solution is that image rendering takes place in the in the CPU which means that the performance of their solution is not restricted to by the memory available on a graphics card and is not slowed down by transfer of data between the CPU and the graphics card.  The Philips’ rendering software is multi-threaded and is able to take advantage of Intel advances in multi-threading /multi-core processing technology and software optimization tools for best performance improvements.

I believe Philips’ demonstration is just one example of how Nehalem architecture-based processors will provide meaningful advances in 3D visualization, not just in medical fields, but other areas such as manufacturing and digital content creation.

Jimmy Leon

*Other names and brands may be claimed as the property of others.

The promise that Cloud Computing brings is that of a ubiquitous compute infrastructure that provides services accessible over the internet from anywhere, by any device, and as such Cloud Computing is currently getting lots of air-time on the web news services.


The challenge at present is much like in real life every cloud is different offering different services, different tools to manage, different SLA's different charging models etc - basically all clouds are unique.


Before embarking into the great unknown of Could Computing enterprise IT managers need to fully understand what they are being offered and to look closely at their current infrastructure to determine how best to utilise Cloud Computing. Its key to understand what apps and/or workload may be suitable for placing into the cloud, for many IT departments today this starts with assessing non-mission/business critical workloads - initail targets for deploying in the cloud could be travel expenses, job search, benefits packages etc.


Another consideration is whether to move directly to using an external cloud service or to utilise resources within ones own data centres to setup 'internal clouds'. For many IT departments this is a logical extension of the ongoing work to virtualize their existing infrastructure and provides a valuable learning platform before setting out into the great unknown of the external 'public' cloud infrastructure.


Some questions IT managers need to ask when evaluating Cloud Computing offerings


  • What's the SLA being offered - how reliable is the service, what's the guaranteed uptime, what's the response time to fix failures ( nearly every week there are reports of cloud failures ), whats the financial impact of loss of cloud services and how would this be recompensed.
  • What's the application latency/response time
  • How secure is the service - backup policy for data, how secure is my data, what's the isolation between my app/data and other customers in a multi-tenanted cloud.
  • Where is my app/data located - what's the impact on local data protection laws,

& the list goes on . . .


Something else that needs to be thought about when looking at utilizing Cloud computing is the interoperability between cloud vendors?


Today most vendors offering 'Platform as a Service'* have developed their own cloud platforms based around open source or commercially available hypervisors and these require specific tools and stack formatting to utilise the virtualised compute resources. Cloud vendors are also offering higher level application/web frameworks enabling the use of higher levels of abstraction e.g. Python, Ruby or .Net . All of this further drives the uniqueness of every cloud offering. This will no doubt change over time as the major virtualisation vendors make inroads into this space and standards get developed for packaging virtual machines to be loaded into the cloud and services interfaces.


But today there is little/limited interoperability between Cloud providers and this can limit the flexibility and usability of the Cloud.


Theres a good white paper here from Intel's IT folks on developing a Cloud Computing startegy with the enterprise.


So great promises, and lots of challenges, but as the saying goes 'every cloud has a silver lining' - IT just needs to work out how to extract the benefit before the sun comes out and the clouds float away.



* Platform as a Service - rent a 'hardware platform' in the cloud and load on to this hardware your own software stack, usually paid for by the hour for specific amount of CPU/memory/IO capacity as opposed to 'Software as a Service' - some element of software, either a building block to utilise within your own application ( e.g. credit card checking ) or a fully blown application, such as a CRM app, accessible via the web .

That’s right, now you can buy a supercomputer that fits right under your desk. The PSC’s (Personal supercomputer) of today would have been #1 on the Top500 http://www.top500.org in November of 1996. ASCI Red would have been the first system to overtake the performance these small supercomputers can provide today.

So why do you need one of these high performance bad boys? Well, if you are trying to keep up with technology, beat out your competition, and do it at the lowest cost possible then you just better think about buying one. Whether you are in manufacturing, engineering, financial services or life sciences, the benefits offered are huge. You can now simulate vs constructing expensive prototypes, you can do more work at your desk vs. waiting to schedule the job on the oversubscribed cluster and most importantly, it provides the competitive advantage you most dearly need to keep up with your customer’s demands for lower pricing.

There are a couple of very interesting solutions on the market right now that should be considered. One is the Cray CX1. This little monster can support up to 16 quad core processors! With Nehalem soon to launch, that is one heck of a lot of performance.

http://www.cray.com/products/CX1.aspx

The CX1 is also ICR (Intel Cluster Ready) certified. This certification helps to ensure end users the system will provide a positive out-of-box experience. When you install the system and turn it on, it just works. You are maximizing your investment. The last thing a small business needs is to make the investment and then spend days getting the system up and running.

http://software.intel.com/en-us/cluster-ready/

With today’s economy, many businesses are reducing cost and putting off capital expenditures. You have decided to be like many other businesses and wait just one more year to upgrade your current computing system. Your competition decided not to wait. Instead, they purchased a PSC that was Intel Cluster Ready Certified and are now more productive than ever before. They made the investment and are now turning out new designs faster and at a lower cost than ever before. You scratch your head wondering how they do it. As you try to save your business they are growing and winning new business. Sometimes, being aggressive in a difficult time is the prudent thing to do…

Still wondering if this is right for me? Concerned you don’t have the IT staff to support such as beast? You don’t have the budget? Is there software out there I can use? All are good questions/concerns, but the ICR certified PSC minimizes if not eliminates the need for an IT staff. The PSC is one of the most affordable cluster solutions on the market today. It plugs right into your wall socket…you have tamed the beast! As for software, if you are purchasing an ICR certified system, then there are numerous applications available and most likely, one you are already familiar with.

When you are getting ready to make your next workstation or high-end PC purchase, I strongly recommend you consider one of the new kids on the block, the PSC. If it is Intel Cluster Ready Certified, you can rest assured the solution you do buy will just work.

We are starting to see more and more IT related services being sold under the banner of “cloud computing.”  And it is not just coming from the large service providers like Google, Amazon, Yahoo, although they are certainly a big part of it.  Traditional hosting service providers are also starting to evolve their infrastructure to support this model.  A good part of the motivation to move to Cloud services is its potential to bring significant economic benefits to consumers.  We all know that servers are expensive to purchase and operate, and building and managing even a small size data center is no walk in the park.  For consumers of cloud services the pay-as-you-go model rather than having to own and manage their own infrastructure is attractive.  And the smaller consumer you are the more attractive this probably is.  So it is not surprising that we see individuals and small business as the leading adopters of this model.  Actually, it turns out that the business considerations look different if you are a large enterprise, but perhaps more on that at a later time.

But if you are a service provider what drives the economic model for you and how do you benefit?  Clearly, you need to deliver a differentiated and competitive portfolio of services; but a good part of this is also about how you drive down the cost of owning and operating your data center.  I see a difference between the “true cloud” service providers and the traditional hosting service providers in their approach to data center design and operations.  Efficiencies in data center are the key to driving down data center operating expenses.  And optimization is the path to achieving efficiencies.  There’s a lot of buzz in the press and the blogging world about how large Internet Portal Data Centers are optimizing and driving efficiencies.  Smaller cloud service operators and traditional hosting providers should be watching this to ensure not only their competitiveness, but also to ensure their ability to I’ll be sharing my perspectives on how the traditional hosting providers are evolving in upcoming blogs; in the mean time, I’d love to hear from you on what your data center challenges are and if/how you are going about driving efficiency in your operations.

The storage world is getting quite a bit of press lately. As IT organizations seek to reduce costs, streamline their management tasks, save power, increase capacity and access to their data and improve security, new types of storage solutions are being introduced continuously The growth rate of storage revenue is blistering, and the number of companies scrambling for the piece of the pie is impressive.

 

As with any hot area of technology, there are a lot of technological developments to keep track of, and of course there are disputes about which new trends will maintain dominance, and which technologies will end up carrying the day. I’ve writtenin the past about some of the potential improvements coming along with the introduction new Data Center Ethernet (DCE) [aka, Data Center Bridging (DCB)] IEEE standard improvements. These layer 2 improvements coupled with Fiber Channel over Ethernet (FCoE) can bring to the storage world a converged story for networking. However, noticeably missing from my previous post was a discussion of iSCSI. The iSCSI standard has been around since 2003, and many vendors including Intel have been selling adapters with various levels of support for iSCSI for quite some time. Microsoft has an iSCSI initiator built into their server OS stack, and VMWare also supports iSCSI in their virtualized system environments. This isn’t a comprehensive list, but just a summary to show that iSCSI is ‘real’.

 

So the question is how does FCoE fit in with iSCSI? Are they competing technologies, or complimentary, or neither? Did I leave iSCSI out of my earlier discussion because I believe the technology is doomed? Hardly…

 

The basic case for FCoE is pretty straight forward. If you have an FC SAN in your datacenter, as you roll out a new rack (for instance) you can eliminate the top-of-rack FC switch and also the need for two adapters in each of servers (one for Ethernet, and one for FC). There are additional potential savings as you move FCoE further and further into the network, but I think to keep the discussion simple, the key thing to note is “if you have an FC SAN…”; then you care about FCoE and can potentially gain from it. If you don’t, well then FCoE will not necessarily be your technology of choice. FCoE is not an Ethernet capability that will drive you to use Fiber Channel, but it may make expanding and growing your FC network cheaper, lower power, and more convenient, especially with virtualized blade servers.

 

To elaborate, today the iSCSI SAN market is growing quickly, but it is growing in areas where FC is not generally a popular solution. It is usually chosen where storage over IP on existing Ethernet was viewed as acceptable given the cost tradeoffs. In contrast, FC is deployed where key IT knowledge, and its higher cost hurdles are still overcome by its performance and high reliability; this is generally only the case in fairly large datacenters. iSCSI on the other hand tends to be used in smaller enterprises and potentially in Tier 2/3 datacenters. The question of iSCSI or FC/FCoE is likely not getting decided by the network card in the host initiator. It is decided by organizational needs, and the cost structure / ROI of the storage target deployment including IT personal / expertise. If an organization has FC deployed, FCoE makes sense. If you are designing a new datacenter, FCoE isn’t exactly going to drive your decision (although you may still want to use it). The better way to consider FCoE is that it is another storage over Ethernet option, specifically for FC (to add to iSCSI and NAS). So decide which is the best storage system solution for your Data Center and know that you can connect any of these solutions over Ethernet to your servers.

 

Finally, as the improvements in Ethernet proliferate and these DCE capabilities become mainstream the landscape may change for iSCSI and FC deployments, but the simple fact is that in most cases, the main commonality is that Ethernet is chosen fabric to deliver storage over the network. Whether the storage protocol that is used is iSCSI, or FCoE will be determined by a lot of things within an individual companies’ IT shop. The idea that iSCSI and FCoE are a pair of competitive technologies has been raised in various places around the tech industry, but in reality, these arguments are driven more by people who have some bias toward one solution or the other, it is not fundamentally driven by the end user demands for storage. Both technologies have their place, and both will be supported in varying ways by Intel as well as other ecosystem vendors in the future.

 

So iSCSI or FCoE? I think the answer is: Both.

 

As we move to 10 Gigabit networking, IT decision makers will have something great with both of these capabilities; convergence, and choice. Only Ethernet can deliver this.

 

 

Ben Hacker

scotthuck

Once in a life time project

Posted by scotthuck Mar 16, 2009

Have you ever worked on a project, or with a team than just “clicked”?  Everyone was excited to be at work.  They worked well together.  And while there were of course many challenges, everyone worked through them and the end result was something special? I worked on one such team back in the mid-90’s.  The team was a silicon design and marketing team here at Intel, and the end result was the P6 processor (the official name was the Pentium Pro processor).  That may be too long ago for many of you to remember this product…I guess I’m showing my age.  But the P6 was a revolutionary product.

So what made this team click?  I think it was the fact that we all knew we were doing something special; something that had never been done before. The P6 was the first out-of-order, Super-scale, Super-pipelined, Speculative execution, glue less MP x86 processor every produced.  And the jump in performance as compared to previous generation x86 processors was the biggest ever seen at that time.

What is interesting is that most people, who worked on that project, have a very similar view.  All look very fondly back at that project, there were special friendships created during that project which exist today, and even though many years have past, and many project have come and gone since them, the P6 was something special; a once-in-a-lifetime project which most remember as the best project the ever worked on.  I’ve been at Intel for 23 years, and the P6 is still the best project I ever worked on.

So why all the reminiscing?  Because the soon to be released Nehalem project (aka Intel(r) Xeon(r) processor 5500 Series for servers and workstations, and also known as the Intel® Core i7 processor for desktop)  has many of the same attributes as the P6 project.  While times have changed, and working in a global environment means you cannot duplicate all the things that made the P6 team so special (in particular a close, tight nit group of people working at a single site), there are many things about the Nehalem project which make it special.  While I can’t disclose all the details just yet, look for follow-on to this blog after the product launch where I can describe some if the things which truly make the Nehalem product revolutionary.  Things that make me just sit back and say wow!  What this team accomplished is truly amazing!

I was thinking about a catchy title when I suddenly recalled the ‘Look Who’s Talking’ movie series from a while back. After all catchy titles are key for blogs!.

Previously I shared some thoughts on overall TCO savings that could be achieved, performance benefits that can be realized and how to migrate from RISC to Intel architectures. We all agree that making a change for the sake of change is never a good thing and justifying a change in the current economic environment can be a challenging path. So let’s look at who is changing and the benefits they are realizing from making a change. (I do apologize for over-use of word change, this is not a political commercial)

  • BMW Group wanted to simplify management of their environment and reduce TCO of their proprietary RISC server infrastructure. BMW moved their SAP environment and achieved 2.75-3xperformance gains and greater energy efficiency and drove down cost.
  • Telefonica a major Telecom Service Provider in Europe migrated their mobile online billing system and achieved a 428%performance gain.
  • Florida Hospital moved their disaster recovery system and got higher availability, reduced recovery time and lower system maintenance costs

Changing architecture does not mean that you have to change the operating system and solution stack. In some cases IT organizations are choosing to retain their Solaris environment.

  • BT Vision wanted to triple their Data Center capacity without increasing their power consumption or consuming more space in their DataCenter. Deployed Solaris on Xeon and achieved 10xfaster performance in Solaris Applications, 25-50%increased availability and 80%savings on their underlying equipment

Hopefully these examples help in some way to show that you will not be the first trailblazer trying out something new and unproven.  IT Organizations have moved and are reaping the benefits of the change.

Finally being March 17th and Irish, I would like to wish you all a Happy St Paddy’s day!

In our previous post we noted that the state of the art power montoring in virtualized environments is much less advanced than power monitoring applied to physical systems.  There is a larger historical context, and economic implications in the planning and operation of data centers that make this problem worth exploring.

 

 

 

Let's look at a similar dynamic in a different context: In the region of the globe where I grew up, water used to be so inexpensive that residential use was not metered.  The water company would charge a fixed amount every month and that was it.  Hence, tenants in an apartment would never see a water bill.  The water bill was a predictable cost component in the total cost of the building and included in the rent.  Water was essentially an infinite resource and reflecting this fact, there were absolutely no incentives in the system for residents to reign in water use.

 

 

 

As the population increased, water became increasingly a more precious and expensive resource.  The water company started installing residential water meters, but bowing to tradition, landlords continued to pay the bills, which was still a very small portion of the overal operating costs.  Tenants still had no incentive to save water because they did not see the water bill.

 

 

 

Today there are very few regions in the world where water can be treated as an infinite resources.  The cost of water increased so much faster than other cost components to the point that landlords decided to expose this cost to tenants.  Hence the practice of tenants paying the specific consumption for the unit they occupy is common today.  Also, because this consumption is exposed at the individual unit level, the historical data can be used as the basis for the implementation of water conservation policies, for instance charging penalty rates for use beyond a certain threshold.

 

 

 

The use of power in the data center has been following a similar trajectory.  For many years the cost of power had been a noise level item in the cost of operating a data center.  It was practical to include the cost of electricity in the bill of the cost of the facilities.  Hence IT managers would never see the energy costs.  This situation is changing as we speak.  See for instance this recent article in Computerworld.

 

 

Recent Intel-based server platforms, such as the existing Bensley platform, and more recently, the Nehalem-EP platform to be introduced in March come with instrumented power supplies that allow the monitoring and control of power use at the individual server level.  This information allows compiling a historical record of actual power use that is much more accurate than the more traditional method of using derated nameplate power.

 

 

The historical information is useful for data center planning purposes by delivering a much tighter forecast, beneficial in two ways: by reducing the need to over-specify the power designed into the facility or by maximizing the amount of equipment that can be deployed for a fixed amount of power available.

 

From an operational perspective we can expect ever more aggressive implementations of power proportional computing in servers where we see large variations between power consumed at idle vs. power consumed at full load.  Ten years ago this variation used to be less than 10 percent.  Today 50 percent is not unusual.  Data center operators can expect wider swings in data center power demand.  Server power management technology provides the means to manage these swings, stay within a data center's power envelope, yet maintain existing service level agreements with customers.

 

 

There is still one more complication:  with the steep adoption of virtualization in the data center in the past two years starting with consolidation exercises, an increasing portion of business is being transacted using virtualized resources.  Under this new environment, using a physical host as the locus for billing power may not be sufficient anymore, especially in multi-tenant environments, where the cost centers for virtual machines running in a host may reside in different departments or even in different companies.

 

 

It is reasonable to expect that this mode of fine grained power management at the virtual machine level will take root in cloud computing and hosted environment where resources are typically deployed as virtualized resources.  Fine grained power monitoring and management makes sense in an environment where energy and carbon footpring is a major TCO component.  To the extent that energy costs are exposed to users along as the MIPS consumed, this information provides the checks and balances and the data to implement rational policies to manage energy consumption.

 

 

 

Based on the considerations above, we see a maturation process for power management practices in a given facility happening in three stages.

 

  1. Stage 1: Undifferentiated, one bill for the whole facility.  Power hogs and energy efficient equipment are thrown in the same pile.  Metrics to weed out inefficient equipment are hard to come by.
  2. Stage 2: Power monitoring at the physical host level implemented.  Exposes inefficient equipment.  Many installations are feeling the pain of increasing energy cost, but organizational inertia prevents passing costs to IT operations.  Power monitoring at this level may be too coarse grained, too little, too late for environments that are rapidly transitioning to virtualization with inadequate support for multi-tenancy.
  3. Stage 3: Power monitoring encompasses virtualized environments.  This capability would align power monitoring with the unit of delivery of value to customers.

As usual, after swimming in the morning, I thumbed through my Blackberry.  On the small glass screen, I saw the email from a friend, “Hi, For: Did you send this email to me?”  I was very puzzled by what she meant that I quickly scrolled down to see the full text below her message: “Dear friend, I would like to introduce a good company who trades mainly in electronic products…  I looked at the “From” line.  It is from my personal email account!  I knew immediately that some hacker hijacked my address book and used my email name to send out spam email.  But how did that happen?  How could I clean up this mess? I suspected that my not-so-strong password was hacked and I corrected it right away.  Since the send box identified who were the recipients. I then sent an email to explain the situation.  My sister-in-law shot me an email afterward: “I though it was a little strange.  

This cyber identity theft really makes me mad at the intruder and myself not taking more precaution measures.  I use my web email account everyday, save my personal data in the “cloud”, and provide my VISA card number to purchase online.  With the social media network, I may disclose even more personal information on the web.  This incident wakes me up that I need to protect myself diligently by adopting caution behaviors such as using the strong password or making sure confidential data are encrypted.  I also realize how much trust I have put in the datacenter and service provider that I may not even realize until I am personally affected.  Do the servers enforce strong passwords only?  How do I know the communication between my personal computers and the servers are secured?  Can the service provider be trusted?  It takes both the consumer end and the service providers together to create a secured environment.  Service provides have the fiduciary duty to protect their customers and their investors by focusing on datacenter security issues.  It may take only one security compromise to shake up the trust of the customers. 

I have been with Intel’s server group for the last 13 years and experienced many server technologies from form factor to power saving that have transformed the datacenters.  With our upcoming server platforms, we will be placing more focus on helping datacenters to secure their infrastructure.  We would like to see a day that no one will need to send an email to their friends to say: “I didn’t send that spam!”  

What is your story and resolution regarding security issues in cyber space and datacenters? 

Ever since Johnny Carson introduced his Top Ten format on the Tonight Show, I have found myself a big fan of this format. (Note: I guess the origin of the top 10 format may not have been new with the Tonight Show but it was my first exposure – I guess I date myself some ).

Many times the top 10 format is fun and entertaining, however, my topic today is a little more serious. In January of 2009, I found Tech Republic’s “10 Reasons To Purchase New Hardware During A Recession” on a ZDNet Blog

  1. Equipment still wears out
  2. Productivity becomes paramount
  3. Downtime is expensive
  4. Competition suffers too
  5. Manufacturers offer discounts
  6. Consultants more willing to negotiate
  7. Running older hardware longer costs more
  8. Interrupting purchasing cycles is expensive
  9. New applications require greater resources
  10. Employee retention remains a consideration

As I spend my days talking IT managers around the world, many of these items resonate with me and whether you are investing to support continued daily business operations (existing hardware is a limiter for you), to improve IT efficiency (reduce operating costs of aging install base) or to business competitiveness (offer new services before your competition does) … many of these Tech Republic Top 10 reasons (see below) were at the heart of their investment strategy.  

So before you cut your IT budget in response to economic conditions, consider if purpose driven IT investments might deliver you a competitive advantage by enabling you and your company to do more with less.

Tune in next few weeks for a my two part series covering the benefits of IT investments in specific business environments

ð      Small – Medium Business: Why Buy for the Small Guy?

ð      Large Enterprise: Why Buy for the Big Guy?

PS: Do you have a favorite top 10 (personal or business)?

Chris

Go Beyond the Kernel:

Refocusing HPC Benchmarking on Total Application Performance

Want to improve application performance by 10x or 100x? Few HPC customers would say no. Yet in some cases, the promises of tremendous performance improvements from accelerators, attached processors, field-programmable gate arrays, and the like evaporate when total application performance is evaluated. Benchmarks that focus on kernel or even partial application performance provide incomplete picture with respect to the impact on total application benchmarking.  While difficult, HPC customers should look to test total application performance.

Why benchmark?

Benchmarking is an essential means for helping end users choose and configure HPC systems. An end user has a problem and needs to know the best way to solve it. More specifically, the end user has a specific workload to run and needs to find hardware that can deliver the best performance, reliability, application portability, and ease of application maintenance. As Purdue University researchers wrote in a recent IEEE article that argued for real application benchmarking, an HPC benchmark should, among other things, produce metrics that help customers evaluate the overall or total time to solution for their problems.

The claim of a 10x to 100x improvement from a particular product can easily grab someone’s attention. But what does that 10x measurement really mean? In many cases, these claims are derived from kernel or partial application benchmarking, which might fail to tell the whole story. While an increase in floating-point performance or the addition of a CPU accelerator could deliver a significant improvement for one kernel, the total application improvement depends on additional HPC system elements. As one participant argued in a recent HPC conference reported by IDC, solution time can be represented as an equation:

Solution time = processing time + memory time + communication time + I/O time – all four combine to form total application time.  The Caveat Emptor is to make sure you analyze your application; understanding what you are measuring and ensuring that you have the balanced architecture to deliver the best performance.  I am biased but I think Intel’s soon to be announced Nehalem processor delivers just that. 

Kernel benchmarking has its place, but benchmarking total (or “real”) application performance is critical for accurately evaluating HPC systems.

Given the recent intense focus in the industry around data center power management and the furious pace of the adoption of virtualization, it is remarkable that the subject of power management in virtualized environments has received relatively little attention.

 

It is fair to say that power management technology has not caught with virtualization.

 

Here are a few thoughts on this particular subject, which I intend to elaborate in subsequent transmittals.

 

For historical reasons the power management technology available today had its inception in the physical world where watts consumed in a server can be traced to the watts that came through the power utility feeds.  Unfortunately, the semantics of power in virtual  machines have yet to be comprehensively defined to industry consensus.

 

For instance, assume that the operating system running  in a virtual image decides to transition the system to the ACPI S3 state, sleep to memory.  What we have now is the state of the virtual image preserved in the image's memory with the virtual CPU turned off.

 

Assuming that the system is not paravirtualized, the operating system can't tell if it's running in a physical or virtual instance. The effect of transitioning to S3 will be purely local to the virtual machine.  If the intent of the system operator was to transition the machine to S3 to save power, it does not work this way.   The virtual machine still draws resources from the host machine and requires hypervisor attention. Transitioning the host itself to S3 may not be practical as there might be other virtual machines still running, not ready to go to sleep.

 

Consolidation is another technology for reducing data center power consumption by driving up the server utilization rates.  Consolidation for power management is a blunt tool, where applications that used to run in a physical server are now virtualized and squished into a single physical host.  The applications are sometimes strange bedfellows.  Profiling might have been done to make sure they could coexist, as a priori, static exercise with the virtual machine instances treated as black boxes. There is no attempt to look at the workload profiles inside each virtualized instance and in real time.  Power savings come from an almost wishful side effect of repackaging applications formerly running in a dedicated server into virtualized instances.

 

A capability to map power to virtual machines, in both directions, from physical to virtual and virtual to physical would be useful from an operational perspective.  The challenge is twofold, first from a monitoring perspective because there is no commonly agreed method yet to prorate host power consumption to the virtual instances running within, and second from a control perspective.  It would be useful to schedule or assign power consumption to virtual machines, allowing end users tomake a tradeoff between power and performance.  Fine grained power monitoring would allow prorating power costs to application instances, introducing useful pricing checks and balances encouraging energy consumption instead of the more common method today of hiding energy costs in the facility costs.

In 1965, Intel co-founder Gordon Moore made a prediction, popularly known as Moore's Law, stating that the number of transistors on a chip will double about every two years. Intel has kept that pace for nearly 40 years. For IT, this translates into a roadmap that enables IT to buy new servers that cost roughly the same as the previous server but performs so much better. Compare Intel’s 4 Socket MP server performance introduced in 2006 (Intel Xeon processor 7000 series) to today’s server introduced in 2008 (Intel Xeon 7400): 3x more performance throughput as measured by SPECint*_rate_base 2000*, 2.4x more ERP users as measured by SAP-SD* and 2x more database transactions as measured by TPC-C*.

Now, introduce a global economic downturn into the mix and suddenly IT is forced to cut costs and projects (i.e. delay or cancel upgrades and non-revenue generating projects). New articles start popping up from magazines like the Economist that take Moore’s Law and propose flipping it on it’s ear: instead of products providing more performance at roughly the same price, provide products that offer the same performance as IT is already experiencing, but now at a lower price. Call it “inverting Moore’s law” where IT takes the dividend it provides in dollars vs. extra performance.

So here’s something to think about: You can also “invert” Moore’s Law by making new targeted IT investments today that offer attractive payback scenarios tomorrow - giving you similar performance but at a much lower cost. With mortgage rates dropping, you may have already benefited from a rapid payback in your personal life (i.e. I recently refinanced a house down from 7% to 5.25% 30-year fixed rate that I had continuously made additional principle payments for. The ~$5k up front investment (i.e. closing costs) will be “paid back” to me after 5 months due to monthly mortgage payment savings.

Here is a server refresh example that explains how you can also get an attractive payback for your IT department.

 

Oracle Database Refresh: Let’s next look at a hypothetical example of an IT department running current Oracle Database Enterprise Edition on 12 servers purchased in early 2006 (dual-core Intel Xeon 7041 based servers introduced in 2005) and assess the total cost of ownership difference in moving to new servers.  We’ll assume the IT manager is paying per processor licensing fees for Oracle Database. We’ll compare the old server equipment to new 4-core Xeon 7440 based servers that offer up to ~3x more database performance (Xeon processor 7400 Series come in flavors of 6-core and 4-core versions).  This should enable consolidation ratios of 3:1, enabling the IT manager to reduce from 12 servers to 4 new servers. 

First the new investment: 4 New Xeon 7400 based servers at roughly $20k each = $80k.  Add another $5k for Network, Server Maintenance and Install Costs.  Remove ~$2k in tax implications associated with the expense in year 0.  Total investment ~$83k. 

Next, let’s look at the savings: The IT Manager is paying $41.8k yearly on Oracle maintenance/support costs x 12 dual-core MP servers today, that is $501k.  The 4 new quad-core servers will have larger Oracle database maintenance/support costs because of the core count ($83k x 4 servers = $334k) but this will still result in $167k SW savings each year (difference between $501k and $334k) which my calculations show about $669k savings over 4 years.  Moving from 12 to 4 servers also reduces about $72k in network, server maintenance, and utility (power/cooling) costs over 4 years as well. In addition to all of these costs savings over 4 years, my calculations show that the original investment of ~$83k has a payback of 9 months.

Targeted IT investments today can offer attractive payback scenarios and cost savings tomorrow - giving you similar performance but at a much lower cost.   Let me know what you think? 

 

 



Filter Blog

By author:
By date:
By tag: