Home > Intel Communities > Open Port IT Community > The Server Room > Blog > Tags > data_center
1 2 3 4 5 ... 7 Previous Next

The Server Room Blog

102 Posts tagged with the data_center tag
1

Every day in our personal lives, we’re bombarded with “opportunities” to get a better deal.  At the grocery store, we might be able to buy a single item for $2.50 or 3 for $5.00…which then forces us to go thru the mental gymnastics of figuring out how good of a deal it is, and whether or not we really need three 96 oz. bottles of salad dressing.

 

But there are some opportunities out there for adding a bunch of compute performance are a bit more straight-forward.


Case in point: Dell recently had Principled Technologies compare the performance for the Intel® Xeon® Processor E5520 and E5506 CPUs each running on a PowerEdge R710 server.  Both are 4 core processors, but the E5520 has many advantages over the E5506: 

  • higher frequency (2.26 GHz vs. 2.13 GHz)
  • faster QuickPath speeds (5.86 GT/s vs. 4.8 GT/s)
  • faster memory support (1066 MHz vs. 800 MHz)
  • Turbo Boost
  • Hyper-Threading support.

 

Long story short:  Buying a slightly better processor with a server purchase can drastically increase your performance.  So if you are looking to buy a Dell PowerEdge server configured with Microsoft SQL Server 2008* and an Intel® Xeon® Processor E5506, for an additional $300 you can get up to 75% more performance by upgrading to an E5520 CPU.  More performance headroom in a similar power envelope, faster QuickPath and memory speeds, Hyper-Threading and Turbo Boost functionality – all for $300.  NOW THAT’S A GREAT VALUE!

 

Check out the summary document for the Dell R710 Principled Technologies performance testing, which also has comparative performance testing for the Xeon® E5540 and X5550 CPUs (also a great value for the money!), along with results for Microsoft Exchange.

 

NOTE:  System pricing from www.dell.com as of May 13, 2009.  Actual performance will vary based on configuration, usage and manufacturing variability. See the actual Principled Technology report in the following link for complete system configuration

1 Comments Permalink
1

Are you a developer writing applications to run on the Solaris operating system?. Are you looking for ways to optimize your Solaris solution on industry standard architecture based on Intel microprocessor? If you answer yes to either of these questions then please read on.

 

Intel and SUN have been working closely together to optimize the Solaris operating system on the Intel Xeon 5500 processor. Most of you probably know the Xeon 5500 better by its product codename Nehalem. The Xeon 5500 is the the product that fits into 2 socket platforms.

 

SUN have just published a very compelling quick reference guidethat will assist both Developers and System Administrators looking to optimize Solaris solutions on Xeon based processors. The guide talks about the work that Intel and SUN are doing together, technical descriptions of specific features and capabilities that can be implemented in the Solaris OS to optimize the capabilities of the Xeon.

 

I have just finished reading this and it is a very compelling paper covering topics such as

- How Solaris takes advantage of Intel Turbo Boost Technology to use available power headroom to deliver higher performance based on workload demand

- How Solaris can take advantage of new Intel Quickpath Interconnect (better known as QPI) and other innovations in the OS to reduce memory latency

- How Solaris performance counters help to better manage workloads

- How Solaris takes advantage of many of the power efficiency capabilities in the processor. Things like Power Aware Dispatched in Solaris enable the processor to stay longer in idle states. In non tech talk this saves power.

 

Solaris has been a tried and tested operating system for along time for companies running their most business critical workloads. This paper talks about the combination of Solaris and Xeon to deliver improved reliability and availability for these critical workloads. Detail information on predictive self healing, fault management, leveraging Intel Machine Check Architecture and more all included in this paper.

 

Probably my favourite section is around the developer tools optimizations and the different tools available for developers that want to run and optimize their applications on Solaris and Xeon.

 

Ok, I'll stop waxing lyrical now. This is a very compelling paper and it does certainly construe that Solaris and Xeon 5500 could be the perfect combination for your Solaris solution. What do you think?

1 Comments Permalink
1

The debate on how to best increase system capacity to accommodate growing applications has raged on for years; “scale up” with more CPU, memory, and I/O, or “scale out” with loosely connected systems.    Scaling out by adding networked systems to increase capacity has been a good economical solution for many IT managers because it allows them to grow by using less expensive, industry standard building blocks.  However, there are some notable exceptions to this line of thought.  One is that the class of applications that require shared memory and large database support are much better suited to run on a single, expandable system that scales up.  These are typically transaction processing, business intelligence and ERP solutions.   Until now, IT managers running applications that require scale-up systems larger than 4 or 8 CPUs have had limited platform choices and most were proprietary and expensive RISC-based servers.

 

The other problem with the scale out approach is the people, facilities, software and overhead costs and complexity of managing very large numbers of servers, which can grow to a point where the costs outweigh the performance and system cost benefits.  The industry solution to achieving better ROI has been to consolidate multiple scale-out servers onto single industry standard scale-up servers with virtualization solutions.  This is a good solution, but is limited by the number of application loads the IT manager feels comfortable placing on a single server, given the need to maintain peak performance and availability for each application.

 

Well, it looks like the scale-up, scale-out debate is about to take another turn.  In the server product update Intel gave on May 26th, they talked about new levels of system scalability and choice supported by the upcoming Nehalem-EX processor.  This processor will support systems that scale up to 8 sockets natively (shared memory, without any additional silicon), and up to 16 sockets and higher with node controllers from system manufactures that allow single systems to share memory beyond 8 sockets.   So far there are over 15 different designs from 8 OEMs that offer 8 socket or higher scalability.  But of course, for the class of application where scaling is important, socket count doesn’t tell the whole story of what’s needed for scalable performance.  Thread support, key for transaction processing and virtualization, scales at the rate of 16 threads per socket with 8 cores and Hyper Threading (2 threads per core).  That would be 128 threads for an 8-socket system, and 256 threads for 16 sockets.   And in order to keep those threads fed with data close to the CPU, each processor supports up to 24 MB of shared cache (1.5X current generation Xeon), and an impressive 16 memory slots per socket or 128 DIMMs on an 8-socket system.  In addition, the Scalable Memory Interconnect gives these systems 9 times the memory bandwidth of today’s top Xeon processor.  Finally, four QuickPath interconnect links per socket allow for high-bandwidth sharing of data across the system.

 

So the net of it is that the industry is going to see a broad selection of highly scalable, next-generation servers that significantly extend the economic advantage of industry standard scale-up solutions for business-critical, large database, and high-end virtualization/consolidation deployments.     I would expect these systems to give IT managers a very cost-effective alternative to the much more expensive and proprietary RISC-based servers they use today.

 

What are your thoughts?  Mike

 

Related Topics:

 

 

 

 

1 Comments Permalink
0

If you read my blog about server refresh and quarterbacks, you will understand how important it is to have a good quarterback inside your organization leading the server refresh effort.  Well at Intel IT that person is Matt Beckert.

 

 

 

I have had the opportunity to work closely with Matt over the past couple years and have watched Intel’s server refresh strategy develop, get ratified and … because of the economic conditions … get questioned.  It was interesting to sit on the sidelines and watch how the economy caused intel to question a proven strategy that delivered $45M of savings to intel in 2008 (Intel IT Performance Reports).

 

 

Ever since I was a kid, I have been an avid New England Patriots fan and Tom Brady is worth every dollar of the over $14M the Patriots will pay him in 2009

 

 

However, I’m sure glad that Matt is on the Intel IT team as his efforts have demonstrated to Intel that proceeding with server refresh in 2009 inside Intel IT’s infrastructure is worth $19M of savings versus deferring refresh to 2010.  Read more about “Staying Committed to Server Refresh Reduces Cost” and find out where the savings came from, how Intel IT overcame the capital budget constraints internally to make this priority investment.

 

 

  • Who is your server refresh quarterback?
  • What is your savings opportunity?
  • Model your potential savings for server refresh at www.intel.com/go/xeonestimator

 

 

 

 

 

Chris (Go Patriots )

0 Comments Permalink
0

The Intel(r) Dynamic Power Node Manager technology allows setting a power consumption target for a server under load as described in a previous article.  This is useful for optimizing the number of servers in a rack when the rack is subject to a power budget.

 

Higher level software can use this capability to implement sophisticated power management schemes, especially schemes that involve server groups.  The range of control authority for servers in the Nehalem generation is significant.  The power consumption of a fully loaded server consuming 300 watts can be rolled back by roughly 100 watts.  In virtualized utility computing environments additional control authority is possible by migrating the virtual machines out of a host and consolidating them into fewer host.  The power consumption of the power capped host now at 200 watts, can be brought down by another 50 watts, to 150 watts.

 

 

The reader might ask about the possibility of constantly running servers in capped mode to save energy.  Unfortunately capping entails a performance tradeoff.  The dynamic is not unlike driving an automobile.  The best mileage is obtained by running the vehicle at a 35 MPH constant speed.  This is not practical in a freeway where the the prevailing speed is 60 MPH.  The vehicle could be rear ended, or perhaps a more mundane motivation, the vehicle driver drives the vehicle at 60 MPH because she wants to get there sooner.  Like a server, the lowest fuel consumption in a running vehicle, at least in gallons per hour, is attained when the vehicle is idling.  No real work is done with an idling engine, but at least the vehicle can start moving in no time.  Continuing with the analogy, turning a server off is equivalent to storing a car in the garage with the engine stopped.

     

This document provides an example of the performance tradeoff with power capping.  Please look in page 5, Figure 2.

 

The following example illustrates how group power capping works.  The plot is a screen capture of the Intel(r) Data Center Manager software managing the power consumption in a cluster of four servers.  The four servers are divided in a cluster of two server sub-groups of two servers each, labeled low-priority and high-priority

 

DCM-GUI.png

 

The light blue band represents the focus of the plot. The focus can be changed with a simple mouse click.  The current focus in the figure is the whole rack.  Hence the power plot is the aggregated power for all four servers in a rack.  If the high priority sub-group were selected, then the power shown would be the power consumed by the two servers in that sub-group.  Finally, if a single server is selected, then the power indicated would be the power for that server only.

     

There are four lines represented in the graph.  The top line is the plate power.  It represents an upper bound for the server’s power consumption.  For this particular group of servers the plate power is 2600 watts.  The servers are identical, and hence rated at 2600 / 4 = 650 watts. 

The next line down is the derated power.  Most servers will not have every memory slot or every hard drive tray populated. The derated power is the data center’s operator guess about the upper bound for power consumption based on the actual configuration the server.  The derated power is still a conservative guess, considerably higher than the actual power consumption of the server. As a rule of thumb, it is ~70% of the nameplate. The derated power has been set at 1820 watts for the rack or 455 watts per server.

     

Finally, the gold line represents the actual power consumed by the server.  The dots represent successive samples taken from readings from the instrumented power supplies. 

     

The servers are running at full power using the SPECpower benchmark.  The rack is collectively consuming a little less than 1300 watts.  At approximately 16:12 a policy is introduced to constrain power consumption to 1200 watts.  DCM instructs individual nodes to reduce power consumption by lowering the set points for Node Manager in each node until the collective power consumption reaches the desired target.

When we instructed Data Center Manager to hold a power cap for the group rack (2), it makes an effort to maintain power at that level, in spite of unavoidable disturbances in the system. 

 

The source of the disturbances can be internal or external.  An internal disturbance can be the server fans switching to a different speed causing a power spike or dip.  Workloads in servers go up and down, with a corresponding uptick or dip in the power consumption for that server.  An external disturbance could be a change in the feed voltage or an operator action.  In fact at T = 16:14 we introduced a severe disturbance: we brought the workload of the bottom server, epieg3urb07 down to idle. 

 

 

 

Note that it takes a few seconds for Data Center Manager to react and to reach the original power level.  Likewise, when the bottom server is brought to idle, it also pulled back the power consumption for the group.  However, the group power went back to the target power consumption after a couple of minutes.  If we look at the plot of the individual servers, we can see Data Center Manager at work maintaining the target power.

Combined Power.png

The figure above captures the behaviors of the individual servers.  Note how DCM allocates power to individual nodes yet it maintains a global power cap. When the server at the bottom is suddenly idled, there is a temporary dip in power server consumption for the group, but it soon recovers to the target capped level.  Also note that the power not used by the bottom server is reallocated to the remaining three nodes until they get close to the previously unconstrained level.

0 Comments Permalink
0

Last week I wrote about the server product update for the upcoming Nehalem-EX processor and the expandable platforms based on it.  Today I wanted to provide you with a short 10 minute video captured from the event.  It’s a really good summary for those of you that want to learn more about Intel’s Xeon product roadmap but with limited time.

Also, as I mentioned earlier, look for some informative blogs over the next 1-2 weeks that will offer more of an in depth view of Nehalem-EX’s 4 Socket capabilities, performance, scalability, RAS, and Virtualization. 

bryce

0 Comments Permalink
0

When you’re planning a backpacking trip, whether it’s for several hours or several days, space is at premium.  Not only do you need to think about tents, sleeping bags, clothing, first aid, and navigational gear (among other things), but also how to keep yourself properly hydrated and fueled up.  Oh yeah, you have to figure out how to cram all of this gear into your pack…and carrying an additional pack is not an option!

Odds are you’ll be heading into the wilderness and won’t be able to re-supply for a while, so one of the limiting factors will be the amount of food you can carry.  Running out of fuel in the middle of nowhere makes for a potentially disastrous situation.

So let’s look at the nutritional numbers and how best to fuel the trip:

  • Fats:  ~9 calories per gram, and typically found in nuts and oils
  • Carbohydrates and proteins:  ~4 calories per gram, and typically found in sugars, grains, and meats

If you’re trying to maximize the number of calories you can carry in order to sustain you during your trip, you probably want to pack more foods with a higher fat content (such as peanut butter) than carbs or protein.  More calories per gram à more energy in your pack to get you where you want to go.

You can probably figure out where I’m going with this analogy – low power CPUs are all about helping maximize your performance per rack, just like packing foods with more calories per gram help deliver more energy in a limited amount of backpack space.

Depending on your specific rack power or overall datacenter power / cooling environment, low power SKUs might be a good fit to help maximize your performance per rack.  For the Intel® Xeon® 5500 series, there are two low power CPU options available, both spec’d at a 60W Thermal Design Point (TDP):  Xeon® L5506 (2.13 GHz) and the Xeon® 5520 (2.26 GHz).  These two SKUs have the same features as the corresponding Xeon® E5506 and E5520 SKUs, just lower in power. 

If you’re buying LV Xeon® 5400 CPUs today, such as the L5420, expect a big jump in performance per rack with the Xeon® L55xx SKUs due to lower overall system power and higher performance.  Similar story if you’re evaluating the Xeon® E5506 or E520 SKUs – same performance with L55xx SKUs with lower system power, so higher performance per rack.

Have questions – ask me on this blog or Ask An Expert in the Server Room.

0 Comments Permalink
0

In this installment on uses of server power management we continue the discussion on using this capability for other uses beyond server rack density.

Intel(r) Data Center Manager (Intel DCM) is a software development kit that can provide real time information to optimize data center operations.  It provides a comprehensive list of publish/subscribe event mechanisms that can form the basis of a sophisticated data center management infrastructure integrating multiple applications where applications get notified of relevant thermal and power events and can apply appropriate policies.

These policies can span a wide range of potential actions:  dialing back power consumption to bring it down below a reference threshold or to reduce thermal stress on the cooling system.  Some actions can be complex, such as migrating workloads across hosts in a virtualized environment, powering down equipment or even performing coordinated actions with building management systems.

Intel DCM also provides inlet temperature or front panel thermals along with a historical record that can be used to identify trouble spots in the data center.  This information provides insights to optimize the thermal design of the data center.  The actions needed to fix trouble spots need not be expensive at all; they may involve no more than relocating a few perforated tiles or installing blanking panels and grommets to minimize air leaks in the raised metal floor.  Traditionally, the hardest part has been identifying the trouble spots, involving time consuming temperature and air flow measurements. Intel Data Center Management provides much of this data ready made from operations. Typically this type of analysis is done by a consulting team and the cost of this exercise is high, anywhere between $50,000 to a $150,000 for a 25,000 square foot data center.  This analysis yields a single snapshot in time which becomes gradually more inaccurate as  the equipment in the data center is refreshed and reconfigured.

Deployment scaling can range from a small business managing a few co-located servers in a shared rack in a multi-tenant environment to organizations managing thousands of servers.

The event handling capability is an software abstraction implemented by the Intel DCM SDK running in a management console.  From an architectural perspective, and the fact that the number of nodes managed can range in the hundreds, it makes more sense to implement this capability as software rather than firmware.  Node Manager is implemented as firmware and it typically controls one server. The choice of SDK over a self-standing management application was also deliberate.  Although Intel DCM comes with a reference GUI to manage a small number of nodes as a self-standing application, it shines when it's used as a building block for higher level management applications.  The integration is done through a Web services interface. Documentation for Intel DCM can be found in http://software.intel.com/sites/datacentermanager/.

0 Comments Permalink
1

Cloud.

I haven’t seen as hyped a term in the data center arena since…um…virtualization.  Everyone is talking cloud, promising cloud, and believing cloud.  But what exactly is this thing called cloud? Is it outsourcing services to a provider, the next generation of virtualization, or something completely different?  There are a lot of definitions, and everyone has opinions…strongly held opinions that led me to my Rolling Stones inspired title to this post (yep, those lyrics...you can sing along now).  Chip Chat decided to get to the bottom of the cloud story, so we were excited to spend some time recently with Intel’s queen of the cloud, Raejeanne Skillern.  Check out my conversation with her here.

1 Comments Permalink
1

Instrumentation – sources of data and points of control – on processors, chipsets and subsystems are the foundation for innovative performance, manageability and energy efficiency advancements at the platform, rack and even data center level. The instrumentation delivered by Intel Xeon processor 5500 is the basis for enabling industry leading power capping solutions that help to squeeze the most out of every dollar spent and kilowatt consumed in the data center.

This brief animation provides a brief overview of power capping and a number of use cases. Are you using or considering using power capping?  Share your thoughts and experiences about this emerging instrumentation based technology.

 

 

 

1 Comments Permalink
0

In a previous article we explored the implementation mechanisms for monitoring and controlling the power consumed by data center servers.  In this article we'll see that an ability to trim the power consumed by servers at convenient time represents a valuable tool to reduce stranded power and take maximum advantage of the power available under the existing infrastructure.  Let's start with a small example and figure out how to optimize the power utilization in a single rack.

Forecasting the power requirements for a server over the product’s lifetime is not an easy exercise.  Server power consumption is a function of server hardware specifications and the associated software and workloads running on them. Also the server’s configuration may change over time: the machine may be retrofitted with additional memory, new processors and hard drives. This challenge is compounded by more aggressive implementations of power proportional computing: servers of a few years ago exhibited little variability between power consumption at idle and power consumption at full power.

While power proportional computing has brought down the average power consumption, it also has increased its variance significantly, that is, data center administrators can expect wide swings in power consumption during normal operation.

Under-sizing the power infrastructure can lead to operational problems during the equipment’s lifetime: it may become impossible to fully load racks due to supply power limitations or because hot spots start developing.  This extra data center power capacity needs to be allocated for the rare occasion where it might be needed, but in practice and cannot be used because it is held in reserve, leading to the term "stranded power."

One possible strategy is to forecast power consumption using an upper bound.  The most obvious upper bound is to use the plate power, that is, the power in the electrical specifications of the server.  This is a number guaranteed to never be exceeded.  Throwing power at the problem is not unlike the approach of throwing bandwidth at the problem in network design to compensate for lack of bandwidth allocation capability and QoS mechanisms.  This approach is overly conservative because the power infrastructure is designed by adding the assumed peak power for each server over the equipment’s life time, an exceedingly unlikely event.

The picture is even worse when we realize that IT equipment represents only 30 to 40 percent of the power consumption in the data center as depicted in the figure below.  This means that the power forecasting in the data center must not only include the power consumed by the servers proper, but also the power consumed by the ancillary equipment, including cooling, heating and lighting, which can be over twice the power allocated to servers.

Establishing a power forecast and sizing up a data center based on nameplate will lead to gross underestimation of the actual power needed and unnecessary capital expenses[1]. The over-sizing of the power infrastructure is needed as insurance for the future because of the large uncertainty in the actual power consumption forecast.  It does not reflect actual need.

pyramid.png

Power allocation in the data center.

A more realistic factor is to de-rate the plate power to a percentage determined by the practices at a particular site.  Typical numbers range between 40 percent and 70 percent.  Unfortunately, these numbers represent a guess representative over a server’s lifetime and are still overly conservative.

Intel(r) Data Center Manager provides a one year history of power consumption that allows a much tighter bound for power consumption forecasting.  At the same time, it is possible to limit power consumption to ensure that group power consumption does not exceed thresholds imposed by the utility power and the power supply infrastructure.

Initial testing performed with Baidu and China Telecom indicates that it is possible to increase rack density by 40 to 60 percent using a pre-existing data center infrastructure.

We will explore other uses in subsequent articles such as managing servers that are overheating and dynamically allocating power to server sub-groups depending on the priority of the applications they run.


[1]Determining Total Cost of Ownership for Data Center and Network Room Infrastructure, APC Paper #6 and Avoiding Costs from Oversizing Data Center and Network Room Infrastructure, APC Paper #37, http://www.apc.com

0 Comments Permalink
7

I was thinking about what to write in my next blog and what I could share beyond what I have written previously about Intel Vs RISC in terms of TCO, performance and the customers that are choosing to move.

 

Luckily I didn't have to think too long on a Friday morning as a a topic came to mind instantly. There are numerous articles flying around this morning that picked up on the Oracle comments yesterday about how SPARC based systems compare to Intel. Thanks for providing me with an appropriate topic.

 

So in case you missed it, there was a question and answer session with Larry Ellison. When asked about SPARC, this was the reply "SPARC is much more energy efficient than Intel while delivering the same performance on a per socket basis. This is not a green issue, its an economic issue. Today, database centers are paying as much for electricity to run their computers as they pay to buy computers. SPARC machines are much less expensive to run than Intel machines"

 

1) SPARC more energy efficient than Intel?  Seriously, in what parallel universe does that exists?

SUN continues to use watts per thread as measure of energy efficiency. The recognized industry standard benchmark for measuring energy efficiency is SPECpowerand I don't see any SPARC based results in the 91 results published. The absence of a result certainly says something very clear to me - no story.

 

These UltraSPARCT2+ systems get loaded with a lot of memory to deliver the their results, so when you look at overall system power (what people care about) they are not as energy efficient as Intel based systems.

 

SPECpower is effectively based of SPECJbb-2005 so another way of loking at this is to look at the SPECJbb-2005 results for a 4 socket UltraSPARcT2+ system and a Xeon 7400 system. The 4s UltraSPARCT2+ delivers 693k BOPs while Xeon 7400 is 532kBOPs. So you conclude that SPARC is better than Xeon?. That would be the wrong conclusion

UltraSPARCT2+ system would consume 1525 watts Vs Xeon 7400 at 816 watts. If you look at BOPs per watt (another way of looking at energy efficiency and performance) then you would see that Xeon 7400 is 43% more energy efficient. Doing a similar comparison with Xeon 5400 (I haven't even talked about our latest Xeon 5500, Nehalem) would be up to 77% more efficient than UltraSPARCT2+.

 

And lastly before I forget to mention the 4s UltraSPARCT2+ had 128GB memory and costs over $150,000for the system, while Xeon 7400 based system had 64GB memory and costs around $32,000.

 

2) SPARC deliver same performance on a per socket basis?

2S Xeon 5500 has performance leadership over 2S UltaSPARCT2+ across a wide range of benchmarks. Up to 70% more performance and up to 60% lower system cost. 4S Xeon 7400 has price/performance leadership over 4S UltraSPARCT2+, UltraSPARCT2+ results achieved with system loaded with lots of memory that drives the cost up to 3-4Xthat of Xeon 7400 system

 

3) SPARC machine are less expensive to run?. I can't for the life of me work this one out!.

Hardware systems based on Intel have leading price/performance (read cheaper), lower energy needs (so electrivity bill lower) and any software product with a license per core strcuture is less expensive on Xeon system than an 8 core UltraSPARcT2+ (which also has higher multipler per core)

 

That's all for now folks. I just wanted to share some data on why I know that SPARC machines are much MORE expensive to run than Intel machines

7 Comments Permalink
1

Virtually Everything...

Posted by Ken Lloyd May 6, 2009

"We are virtualizing".  I hear that at every customer, every day.  I am not sure where virtualization is on the hype curve, but i don't think it is anywhere near slowing down.  I am very glad to be past the "Dilbert" and "in flight magazine" era.  Customers seem to have a really solid command of what they want to virtualize and why they want to virtualize. ( not to imply that all the questions have been answered )

The latest Intel servers - Xeon 7400 processor series in the 4 socket family, and the incredible Xeon 5500 (Nehalem) processor series in the 2 socket family - deliver more than sufficient capacity for sweeping data center virtualization.  i.e. very few enterprise applications are to big for a VM on one of these platforms.

 

I hear three reasons from customers for virtualization. ( in order of emphasis )

 

1) To improve efficiency.  Most enterprise servers are only about 10% utilized ( and many of these are old, slow, inefficient servers)  Applications are partitioned onto individual servers for archaic historical reasons.  Combining these on powerful modern servers can dramatically reduce footprint, power, server costs and licensing costs.

 

2) To improve flexibility.  Virtualization allows "servers" ( think VMs ) to be easily moved from one platform to another - for sizing - for maintenance - for almost any reason.  With the Intel Flex Migration technology and recent versions of VMware ESX - customers can pool Intel servers across multiple generations and families.  Live migration from your Xeon 5100 processor based server to a cozy VM on a 4 socket Xeon 7400 based 24 core server.

 

3) To improve reliability.  Virtualization provides a vehicle for managing hardware failures, allowing near instantaneous fail-over in the event of a server loss.

 

Virtualization has moved out of the lab and become a "best know method" for doing IT right.

 

Intel points to three focus areas for servers.  Efficiency, Performance, and Virtualization.  I think virtualizaiton's place in this triad is fleeting.  It only remains  because changes are still being made to the platform to support virtualization.  Soon virtualization will become just another part of the stack- like the operating system.  Nobody claims their processor is optimized for running an operating system...  Even today choosing the best processors for virtualization is more about efficiency and performance than about virtualization features.  Fortunately - as I do work there - Intel has a solid lead on both efficiency and performance.

1 Comments Permalink
2

Sure, Intel® Xeon® 5500 Series Processors represent a quantum leap forward in terms of both performance and energy efficiency. That has been proven in a number of test results and reviews.  But for your back-end data demanding enterprise app deployments, large scale server consolidation or virtualization of business critical applications, Intel® Xeon 7400 series processors offer outstanding performance and performance per watt in 4-socket servers. So, which platform do you choose, especially when this decision is likely going to be the key determining factor for capital savings, efficiency and TCO for your datacenter infrastructure? Well, you’re read a lot about Xeon 5500 series Nehalem servers over the last few weeks.  Let me share with you some reasons to consider a Xeon 7400 series 4-socket server when you are presented with the choice between Intel’s two best of breed products for virtualization.

4 Socket and above servers (Xeon 7400) are purpose built – just like a large truck: They’re purpose built for your most data demanding enterprise applications like database and ERP, and for large scale server consolidation using virtualization. Large Trucks are also purpose built.  They’re purpose built for hauling large loads over long distances.   Now, you don’t buy a large truck to commute to work in.  You also don’t take your everyday commuter and attempt to haul large loads with it, because if you did you would be significantly undersized (you’ve all seen those cars on the road with rear tires about ready to pop under the weight of a palette of heavy goods tied on top). 

More Resources Matter for 4 Socket MP Workloads:
The apps/workloads listed above benefit from the expanded feature set associated with 4 Socket Xeon 7400 based servers: more processors (4 vs. 2), more cores (24 vs. 8), more memory (32 dimms vs. 18 dimms), more I/O capacity (7 slots vs. 4) and larger cache (16MB vs. 8MB).  These features and what they enable are why MP Server buying patterns have remained stable with IT for the last 5 years and will continue to be stable for the foreseeable future according to IDC. 

But in today’s economy there may be MP customers out there that will want to push the envelope and attempt to deploy lesser expensive 2S systems for traditional 4S solutions. Would doing so pencil out from a TCO perspective? Let’s take a look at two Virtualization usage examples and find out.

Large Scale Server Consolidation: Where almost 2x the memory matters.

In this scenario, IT Manager is dealing with numerous corporate acquisitions across the country prior to the economic downturn, with servers that now need to be consolidated to cut costs quickly.  Goal is to convert 1000 older underutilized 2S servers.  He (she) converts these to 1000 VMs and transfers them electronically to the central Data Center.   He determines that these infrastructure apps when consolidated generally run into memory constraints before they run into processor constraints, so for his candidate solutions he compares a 4 Socket Server with Xeon X7460 processors vs. a new 2 Socket server with Xeon X5570 processors.   He fully loads both systems with 4GB dimms (128GB on 4S vs. 72GB on 2S), and assigns 4GBs memory for each VM deployed (enabling 32VMs per server resulting in 31 new 4S servers vs. 18 VMs per server resulting in 56 new 2S Servers.)

Now, he only propagates the 4S Solution with 2 Xeon 7400 Processors, which allows the IT manager to still use all 128GB of memory on the 4S Servers while paying lower VMWare licensing costs.  Price these systems out on Dell, HP, IBM’s or Sun’s website, and the Xeon X7460 servers will be in the $15k-$20k range vs. the Xeon X5570 based servers will be in the $10k-$12k range (i.e. roughly 1.5x higher for 4S vs. 2S server).  Add VMWare license costs, power/cooling, LAN/SAN cabling, and system maintenance costs and you’ll see the 4S solutions offer a lower cost per VM.

Virtualizing Business Critical Workloads: Where 3x the Processor Cores matter.

In the previous example, we were looking to maximize consolidation ratios.  In this example, we’re looking to achieve predictable high performance for a business critical app.  Solutions like ERP that are put into a virtualized environment perform best when run without oversubscription, where you set the same number of virtual CPUs to equal the number of physical cores available on the platform.  This helps deliver relatively more predictable performance for all VMs and is the way that IT@Intel intends to deploy ERP in a virtualized environment as they begin to test this moving forward (read more about this in the new whitepaper).  In this example, we’ll convert ~100 non-production ERP instances (i.e. the instances used for QA, Dev, and Production break fix).  We’ll assign 2 virtual CPUs and 8GB memory for each instance.  The four-socket Xeon 7400 processor based systems (with 96GB memory) will have a total of 24 cores and will have a list price of about $25k.  This allows us to run 12 Virtual Machines without oversubscription on the MP Servers and enables 100 ERP instances to be consolidated down to about 8 MP (4 Socket) servers.  Since the Xeon 5500 based Servers just have 8-cores, the IT manager decides to avoid oversubscription and deploys 4 virtual machines – consolidating down to 25 DP (2 Socket) servers with 32GB Memory and a list price of about $8k per server.  Include the costs of the hardware, VMware ESX license costs, power/cooling, cabling, and Server maintenance – the MP (4 Socket) solution here would also offer a lower cost/vm than the Xeon 5500 based DP (2 Socket) solution due to having 3x the processor cores on 4 Socket.

When you are deploying your most data demanding enterprise applications and implementing large scale server consolidation, Xeon 7400 based servers represent a very intelligent choice. 

Let me know what you think.

bryce

2 Comments Permalink
0

I'll be up front, I really don't know what Brittany Spears, Miley Cyrus or Susan Boyle would say about moving from RISC to the Xeon 5500 processor!. What I can share is the feedback that I'm getting direct from customers. I'm currently out on the road and have got some real feedback direct from customers on why they are looking at migrating their solutions from RISC  processors to Xeon processors.

 

Over the past couple of days I have had the opportunity to meet directly with individual customers and hosted a roundtable with several customers to discuss their plans to replace their RISC based infrastructure. The conversation has been very open and frank and has not been about 'should I move' but more focused on 'how do I make the move'. As could be expected the down economy is placing big taxes on the ability of IT organizations to support their business units need for organic growth in a flat to down IT spending environment. A big priority for most of the customers that I spoke with is how to reduce their overall TCO while still meeting the increased demands being placed on IT by their business Partners. Most of the customers are already engaged in active projects to assess moving from RISC or are building their plans to make this migration.

 

During the roundtable I had opportunity to share the latest Xeon 5500 processor performance comparisons Vs the main SPARC and POWER based solutions out there. There was great rejoicing and joy (ok I'm taking poetic license here) in the roundtable when we share some of the results that we highlighted when we launched the Xeon 5500 processor just over 3 weeks ago. So I want to spread the joy and let you read for yourself the performance and price performance benefits.

 

We compared the Xeon 5570 processor vs the top UltraSPARCT2+ in a 2 socket configuration. We took best published results on spec.org and sap (so no funny games at play). The results comparing best UltraSPARCT2+ vs best Xeon 5500 with 1 taken as baseline for SPARC redults were amazing

- 20% better on SAP-SD

- 62% better java performance for Specjbb2005

- 69%better for integer performance SPECIntrate-2006

- 75% better for floating point performance SPECfprate-2006

But the best bit was the cost competitiveness of the Xeon 5500 solutions. Comparing both solutions with 32GB memory, the Xeon 5500 based solutions are offered at approx $11,000 whereas the UltraSPARCT2+ is at $36,000.

 

Compared the Xeon 5570 processor vs the top POWER6 in a 2 socket configuration gave even more staggering results. At the roundtable today customers were amazed. They keep hearing that POWER 6 has leading performance and more GHz so better performance. Right?. Wrong is the answer and I noticed many customers scribbling down the comparisons. Again taking 1 as baseline for POWER results

- 150% better on SAP-SD

- 190% better java performance for Specjbb2005

- 126%better for integer performance SPECIntrate-2006

- 90%better for floating point performance SPECfprate-2006

But the best bit was the cost competitiveness of the Xeon 5500 solutions. Comparing both solutions with 32GB memory, the Xeon 5500 based solutions are 92% less expensive than equivalent POWER 6 offerings.

 

I only shared the specific comparisons vs RISC and have not gone into the architectural advancements of the Xeon 5500 processor and how it addresses real business needs that have been flagged to us. There have been lots of other blogs out in cyberspace over the last few weeks on improvements in IO, low latency etc. so you don't need my 2 cents.

 

I think now is the time to make the move from RISC, what do you think?

0 Comments Permalink
1 2 3 4 5 ... 7 Previous Next

Filter Blog

By author: By date: By tag: