The Server Room Blog

15 Posts tagged with the power tag
1

Everyone is talking “green-energy” and “power-efficiency” these days. Reducing carbon footprint, renewable energy, CFLs, solar power, biking instead of driving, etc… the list goes on forever. Many people are excited to do something to change power consumption, but as a server administrator - are the proper tools in place?

Many of you have probably experienced the power/efficiency example at home. When the summer gets hot - many of us run to the thermostat and set it accordingly. When it's REALLY hot outside, we tend to twist the dial cooler - knowing all along, that our electric bill will most likely be higher at the end of the billing cycle. So, what do we do?

Some of us just live with the higher bills, some of us turn off the A/C and struggle in the heat - but I'd hope that most of us set the thermostat to a 'livable' temperature - it may not be the coolest, but it's enough to do the job and keep the electricity bills at a more moderate level - in a sense, it's a happy medium. In today's modern age, thermostats are programmable - taking a lot of the guesswork out of our hands and automating many of the old day-to-day temperature functions that our parents had to follow... Intel server platforms are evolving in this realm as well!

http://softwarecommunity.intel.com/UserFiles/en-us/Image/psteinb/DCM_Assets/Dyn_Power_DCM_marquee.jpg

As a server admin, do you have the tools and technologies to reduce power consumption? There are several avenues addressing this issue, and I suggest reading the post from Lori Wigle on Data Center Efficiency. The datacenter is different from the desktop… server admins aren’t likely to enable sleep states to save energy – but rather, increase utilization on fewer servers to maximize your performance output in relation to your server footprint.

When was the last time you looked at your server’s power footprint? Do you even know how much power you’re using? Some of you may have some power meters and can monitor a server (or a few servers) at a time… but how many of you can monitor a rack or servers or a datacenter?

What if this capability was built into your current generation Xeon server platform? The good news is that modern processors DO have power management capabilities. Based on the ACPI specs:

P0 Performance State
While a device or processor is in this state, it uses its maximum performance capability and may consume maximum power. Thereby the processor uses it's maximum power allocation.
P1 Performance State
In this performance power state, the performance capability of a device or processor is limited below its maximum and consumes less than maximum power.
Pn Performance State
In this performance state, the performance capability of a device or processor is at its minimum level and consumes minimal power while remaining in an active state. State n is a maximum number and is processor or device dependent. Processors and devices may define support for an arbitrary number of performance states not to exceed 16.

Each Pn State is a "notch" in the processor's performance powerband (as seen below)


http://softwareblogs.intel.com/wordpress/wp-content/uploads/2008/05/pstatepeakenergy.jpg
As these performance notches are set, the processor will lower it's power envelope and reduce the power needed in order to save energy. Just as a note, EIST must be enabled in the BIOS for this performance enhancement to work on your platform.

If you attended Intel’s IDF (Intel Developer Forum) you may have run into a few demos in regards to Datacenter Power Management, my booth showcased 4 current generation Intel Servers based on Bensley/Starlake Xeon DP boards and Xeon 54xx Series (codename Harpertown) Processors.

Here’s a quick video showcasing the demo – and just a note - we’ll be redoing this in a higher-quality format soon – so stay tuned!
Hopefully if you’ve watched the video – you’ve got some questions! The good news is that we have a new website from the Intel Software Network that is focused on Intel® Dynamic Power Datacenter Manager. The site lists the features, system requirements, downloads, and FAQ to get you started!

I’m looking forward to your feedback and questions!

.

1 Comments Permalink
2

Hank Lea and myself (Jason Davidson) will be covering the Eco-Technology debates at the Marriot Hotel in San Francisco on Monday, August 18th. We will also be hosting a blog talk radio show around this event at 5:15 PM.

In my tenure at Intel, I have had the pleasure of walking into major companies, educational institutes, non-profits, and government agencies to talk technology with many great people. “How green is this solution” is a topic on many minds lately – no matter which topic of discussion. Being an engineer by trade and scientist by education, I will typically dive into the details of around each component’s power consumption and the discussion ends with some simple math multiplying a number of units by their thermal numbers. However, there is so much more to the overall impact, and as I walk in and out of these locations, I am always amazed at the number of larger issues with much larger impacts that are unresolved or overlooked. For more information on these items, here is a blog.

The Eco-Technology Great Debates provide a unique and entertaining forum to expand your understanding of today’s most pressing data center and IT issues. Come hear industry leaders take up both sides of some of the hot topics facing
the industry.


Attendees will learn about the pros and cons of high-density computing versus low-density computing and ready-to-use container data centers versus traditional brick and mortar data centers. There will also be a panel discussion on energy efficiency metrics, which will take a look at everything from chips to cooling systems and how they play a role in energy efficiency.


The energy consumption of servers and data centers has doubled in the past five years and is expected to almost double again in the next five, costing about USD 7.4 billion annually.1 There is no single right answer on what to do about this critical situation. Take an active step in solving this challenge by attending The Eco-Technology Great Debates and IDF at a special money-saving price. Register for IDF now and enter promo code CLOECOT (admission to the Eco-Technology Great Debate and a 2-day pass to IDF) or enter promo code *CLTECOT*(admission to the Eco-Technology Great Debate and a full conference pass to IDF). The debate takes place at the San Francisco Marriott Hotel (located across the street from IDF).


Eco Debate Leaf Banner.jpg
1 [EPA Reports Significant Energy Efficiency Opportunities for U.S.
Servers and Data Centers (August 2007).|http://yosemite.epa.gov/opa/admpress.nsf/0de87f2b4bcbe56e852572a000651fde/4be8c9799fbceb028525732c0053e1d5!OpenDocument]


2 Comments Permalink
0

Last week, the first part this video series focused on the energy efficiency benefits of 45nm. The 2nd part of this video (below) is focused on the benefits of 45nm for virtualization and the intel processor roadmap including what's next in 45nm processor technology - the Dunnington and Nehalem-EP products

Is this information useful to you? why or why not?

Chris

0 Comments Permalink
0

Why 45nm ... What's Next

Posted by C_Peters Jun 4, 2008

Following a recent interview I conducted with the Register on a related subject, I was asked to talk more about Intel's current 45nm technology and our roadmap for new technology later this year. Join me in a two part video series where I discuss 45nm and beyond.

Part 1 (below) discusses the technology and benefits that 45nm xeon processors deliver for IT today.

Tune in next week to hear Part 2 - what we have planned for future enhancements to today's xeon products - the Nehalem Processor and Intel QuickPath architecture.

Chris

0 Comments Permalink
2

Have you ever asked yourself that question when you are bombarded with marketing messages from multiple different companies on why choose their products vs. a competitors product?. As a non-Engineer in an engineer centric company, I certainly have thought about this several times and asked myself a very simple question - Why should I choose one architecture type over another offering?

I suppose the best place is to start at the beginning and try and decipher the acronym soup of RISC, x86 etc. I decided to use my ‘old friend’ Wikipedia http://www.wikipedia.org/ to help with this process. What I found was another alphabet soup that I could have researched for hours, but try and simplify it below. I attach my detailed definition findings at end of this blog.

Simply put, RISC (pronounced risk) is a CPU design to use simplified instructions to execute very fast thus providing higher performance. x86 is a generic term that refers to the instruction set of another CPU architecture. So basically both RISC and x86 are types of instruction sets linked to CPU architecture.

So which one should I choose?.
Call me old fashioned, but as a business guy, it always comes down to 3 basic tenets in terms of making a decision
1) I like choice and the ability to pick and choose between multiple suppliers to get the best deal to meet my needs.(and the ability to change supplier without major obstacles)
2) Performance is really important. The higher performance means that I get my work done quicker which reduces the overall cost / improves time to revenue and ultimately improves the productivity of my business
3) System cost and total cost of ownership are key decision points in today’s era which is vastly different from the ‘dot.com’ boom. It is all about managing the bottom line through good decisions around CAPEX and OPEX spending

I applied my decision criteria and quickly found out that there is not a lot of choice from a hardware and operating system perspective with RISC architecture. In fact it looks quite the opposite of choice which always concerns me, call me pro-choice if you like, but I like the ability to move around suppliers!. On the other hand I found x86 to have lots of choice with many hardware vendors to list and a range of operating systems from windows to Linux and Solaris.

Having choice out of the way, I then moved onto performance for my business and looked at published results from many hardware vendors on different websites like http://www.spec.org. what I found was that Intel based systems had a lot of leading results against architectures like SPARC from SUN or Fujitsu and POWER from IBM.

I then looked at price (and being an ex-Accountant in my past career) nearly jumped for joy when I saw that system prices were low for x86 systems compared to the comparable RISC systems.

This analysis helped me understand it better and helped simplify my decision making.

Here is a short video with a little bit more detail. I would be interested in your thoughts and have you had any similar experiences that you would like to share.

2 Comments Permalink
0

45nm and Beyond

Posted by C_Peters Apr 23, 2008

Technology moves at such a rapid pace - it can often be mind-boggling. Even working directly with the product teams at Intel, I sometimes have difficulty keeping pace. The good news is that there is a tremendous opportunity today to be captured thanks to this rapid innovation, as well as a steady stream of advanced technology that IT can use to better support business and gain a competitive advantage. Recently I was interviewed by Tim Phillips from the Register about the current 45nm Quad-Core Intel Xeon products and the next generation Intel platforms based on the Nehalem processor.

A few years back, Intel fundamentally changed the way we design and develop our underlying micro-processor technology. We streamlined our innovation and accelerated it's pace. Internally, we call this new model Tick-Tock. I like to call it shrink and innovate.

A "Tick" is a manufacturing process shrink that delivers smaller silicon with higher speeds, more transistors and lower power consumption (example: moving from 65nm to 45nm process technology). The 45nm quad-core xeon processors (available since Nov '07) utilize unique materials (a high-k, dielectric) that are delivering industry leading performance / watt as measured by the industry's first and only standard benchmark, SPECPower
A "Tock" represents a more extensive architectural innovation (ex. Intel Core Microarchitecture) introducing new micro-architecture features and functionality fully utilizing the higher transistor count set up by the shrink. For Intel Xeon-based servers, the next "tock" is Nehalem. In addition to the new micro-architecture based on 45nm, a system re-design will incorporate next generation memory, I/O and virtualization technology for high performance, high bandwidth solutions compatible with today's leading software solutions
Listen to my podcast interview to learn more about the benefits of using today's products and the timing of next generation Intel technology featuring Nehalem. Is this information useful to you? If so ... how? Have any questions?

I'd be happy to hear from you. Chris



0 Comments Permalink
0

This is part three - the implication being that it is a sequel to part one and part two. It is. That said, each of the sections have their own messages and may or may not help your data center. The first part talked about the benefits of bringing in the latest hardware. Intel has been delivering performance increases at a pace beyond "Moore's Law". Getting rid of old, slow, inefficient servers can give you 2-12 times the capacity instantly. The second "episode" talked about getting everything you can from each server. Use virtualization and consolidation to make sure your servers are full and busy. The most efficient bus is a full bus ( this is a metaphor, I am talking about the big yellow things carrying students, not the circuitry in the box )

My focus in part three is on density. My operating premise is that the data center manager wants to get everything out of the current data center and avoid, or at least defer, construction of a new data center. If your in the data center construction business, this is not for you.

To get the most out of our data center we want to pack every server we can power into the space. You can do this by executing three actions. 1) Use every watt, 2) Build the right servers, and 3) Optimize HVAC. In many cases twice the servers can be crammed into the existing rack space even without adding power. If you are able to redirect your hvac power savings to your racks, your results could be even better.

So, we potentially got 5x capacity from new quad core servers, 5x capacity from boosting utilization with consolidation, and 2x capacity with higher density. My math says 5x * 5x * 2x = 50x the capacity ( in the same space and power!) video

0 Comments Permalink
4


Today, Intel launched 50W low power versions of the 45nm Quad-Core Xeon processors (the L5400 series).
The 2 new SKUs are listed below:

Quad-Core Xeon L5420 2.50 GHz, 12MB L2, 1333MHz
Quad-Core Xeon L5410 2.33 GHz, 12MB L2, 1333MHz

These products offer IT and business users 2 primary benefits:

  • 45nm 50W quad-core brings 25% improved performance over previous generation 65nm 50W quad-core processors
  • They also run 30W cooler than mainstream 80W quad-core processors delivering the same performance at the same frequency.

We have seen strong interest for these 50W quad-core products and I'd like to hear from you on where you would use low power quad-core and why?

4 Comments Permalink
1


Datacenter Power Management: Power Consumption Trend

Jackson He

As the internet services grow and the more users embracing internet - approaching 1 billion connected users, one of the biggest challenges for data-center operators today is the increasing cost of power and cooling as a portion of the total cost of operations. As shown in Figure 1, over the past decade, the cost of power and cooling has increased 400%, and these costs are expected to continue to rise. In some cases, power costs account for 40-50% of the total data-center operation budget. To make matters worse, there is still a need to deploy more servers to support new business solutions. Data centers are therefore faced with the twin problem of how to deploy new services in the face of rising power and cooling costs. In a recent survey of data centers 59% identify power and cooling as the key factors limiting server deployment.

blog2-pic1.GIF

Figure 1: IDC Report of data center cost structure and trend

At the same time with the increased energy cost and awareness of global warming, there is increased regulatory scrutiny around both idle and max power of servers and clients (desktops and laptops). The "green awareness" datacenter is no longer a "nice to have" feature, but of necessity of business operation and environmental regulatory compliance. Figure 2 highlight the world-wide existing and emerging regulations on power and energy consumption. Future datacenters have to be able to clearly measure and proof regulation conformance in order to operate properly.

blog2-pic2.GIF

Figure 2: Existing and emerging energy and power regulations

To sum it up, the power management trends for future datacenters are multifaceted and will not be covered by a single company or a single business segment. They could be summarized in the following areas:

  • At environment level: conform to increased government regulations on energy and power and increased power constraint (limited available power) - need innovative ways to conform "green datacenter" regulations, while deliver great values to business.
  • At the datacenter level: more computing power is needed with increased demand; emergence of mega datacenter and modular datacenter (datacenter in a container); the overall power and cooling distributions need to match the increased need - new datacenter designs and power/cooling management needed.
  • At rack level: higher power density and higher server density per rack is needed to pack more computing power for a given space and cooling; workload balance between racks to increase power efficiency and overall datacenter reliability - need effective rack-level power and cooling monitoring and dynamic management capabilities
  • At server level: need lower idle and max processing power, so that platform power consumption trend is more linear with platform performance; dynamically adjust power consumption based on policy and workload - need more server-level instrumentations for power/cooling monitoring and more control knobs to dynamically optimize power and performance.

I hope you agree with me of the overall datacenter power management trends at datacenters in the coming year. These trends pose challenges for each of the areas listed above. These challenges also mean opportunities for innovative solutions to thrive. I'd like to listen to your feedback about these trends. I will talk more about challenges and potential solutions in the upcoming blogs. You are welcome to share your thought of where you believe the datacenter power management is going. Thanks a lot.

1 Comments 0 References Permalink
0

Every now and then a colleague, customer or acquaintance sends me a link to an article or blog that usually either features our products or those from one of our competitors. More often than not I get a lot of repeat sources (The Register, The Inquirer, CNET, etc…). The blog that comes my way most often is one from George Ou at ZDNet. One of his most recent blogs (A comparison of quad-core server CPUs) shows a bunch of our latest quad core CPUs and how they stack up against our previous versions as well as those from AMD. I won’t rehash the article here aside from saying it was positive for Intel and to say AMD’s issues with their quad core processors have been well documented.


Is Intel winning now because our products are superior? Are we winning because our competitor is struggling? Do these benchmarks mentioned in George’s blog tell the whole picture? As you can imagine we constantly ask ourselves these questions and many more internally. Our conclusions are that for processors and server platforms, as long as we provide leadership along several key vectors then our market share and overall market position will improve.

Manufacturing process, processor architecture, system architecture, cache size. These are four critical vectors that we have direct control over when we are making design and enabling decisions. At times in our past and in the present we have had leadership on all four. In those times we have won hands down. There have also been times where a competitor has chosen to focus on one or two vectors and that has led to their products being better for a specific area. The four vectors above are things that Intel focuses on but we always have to keep an eye on what end user value they deliver.

Our customers tell us they care about three main things; Price, Performance and Power. The three P’s. George’s blog shows that for one of the P’s (Performance) Intel has leadership, particularly on integer and floating point. There are similar looking examples for database, virtualization and pretty much any performance benchmark we have looked at recently. Thankfully for Intel, Performance is the “P” with the strongest correlation to success in the server market from a MSS perspective. We are also doing some amazing things with regard to Power. Some have been launched already and some will be coming soon with new products in 2008. The market is segmenting and we now make CPUs, chipsets and networking components that help OEMs build platforms targeted at high performance computing, mainstream enterprise, blades, workstations and emerging markets. Each has unique requirements with respect to the three P’s and one size no longer fits all.

I believe that overall George’s blog highlights the success that we are having today. I also think that there will be a steady stream of innovations that will be delivered in 2008 and beyond that will cause us to rethink how we deliver performance at the most efficient power level for the best possible price point. Virtualization, utility computing and charge back models for datacenter environments are all stepping up to take center stage. We all must innovate or become irrelevant…technological evolution waits for no one.

Shannon

0 Comments Permalink
2

I have been working as a full-time performance engineer at Intel for 6 years. I started by benchmarking server products for performance validation and now I focus on the TPC-C and TPC-E OLTP server benchmarks. I have used a variety of workloads in this job and spent time optimizing each level of the performance hierarchy: application, system, and processor. I, like many of you, have learned the "tricks of the trade" the hard way: by trial, error, and success. I'm sharing now, so you can all benefit from the things I've picked up along the way.

Let's start with some general methodologies to follow when tuning performance, whether you do it full-time, as a hobby, or just in your spare cycles after getting your "regular work" done. I will follow up with a more detailed post on each habit individually.


1. Ask the right question: Why are you tuning your platform? What level of performance are you hoping to achieve? What do you (or your users) care most about: raw performance, cost/performance, performance/watt, or something else?

2. Start at the top: The first and easiest part of your application server to tune is the hardware itself. Move on to the software and workload only after you feel confident that you have removed any system-level bottlenecks.

3. Know your Platform: This should be where you begin your system (hardware) tuning. The first thing, which I can't stress enough, is to get a block diagram of your platform. Then study it!

4. Know your BIOS: Server BIOSes these days come with more and more options. Be sure to give your new platform's BIOS a once-over. Pay particular attention to options relating to performance and power.

5. Know your Workload: To quantify performance, you need a workload! Some examples: web server response time, boot time, frames rendered per second, simultaneous connections supported, etc. Understand as much as possible about how the work gets done.

6. Try one thing at a time: Little changes that seem harmless can significantly alter the behavior of your system. Or worse, they can interact with each other to wreak havoc. Always try one change at a time, and for goodness' sake, do habit number 7.

7. Document and Archive: When you change something, log it! For each experiment you do, store your hardware and software configuration, performance level, and any collected data.

8. Use the right tool for the job: There are free data collection tools out there for various levels of the tuning process. System tuning tools include such as Performance Monitor for Windows or Sar for Linux. Application-level tools include Intel ® VTuneTM for both Windows and Linux.

9. Don't break the law: Amdahl's Law, that is. Amdahl's Law tells us the maximum amount of performance improvement we will get from a particular enhancement. Amdahl can help you set your expectations properly and clue you in to when you should be suspicious.

10. Compare apples to apples: Todd Christ reminds us of this habit in the last paragraph of this post. Don't compare the performance of mis-matched systems. If you must do it, know exactly what the differences are: the processor, memory type/speed/vendor, a software component, chipset, etc. Dig into the configuration details!

So now you have the high-level list! Stay tuned to The Server Room for more information about each habit in the coming weeks.

2 Comments Permalink
3

Continuing on the theme of measuring Data Centre efficiency - power consumption of the facilities and IT load are only one element albeit a large one - that contributes to the overall efficiency of a data centre. Ultimately a DC has to deliver useful workload and the amount of workload that can be achieved within a given physical DC is an increasing challenge. Lowering server power and increasing the cooling effectiveness of a DC are one of several ways to enable more equipment to be installed into an existing facility.

General consensus seems to be that the servers in many data centres do not always run a maximum utilisation - many are in the 10-15% utilisation range. This results from many IT shops following a policy of hosting one workload ( application ) per server and sizing the server to support worse case usage of that workload - this leads to low average utilisation of the servers. There are several approaches that can be taken to increasing the server utilisation

Consolidating several applications onto the same server that have different mixes of utilisation - this is not perfect as a problem on one application could impact the others on that server causing significant business impact

Deploying virtualisation within the DC - this enables multiple OS/App instances to be run on the same server. There are multiple benefits here in that the server utilisation increases whilst the number of servers could potentially be decreased so reducing the overall electrical power consumption of the DC and consequently the utility bill. Another aspect of virtualisation is that to achieve the highest levels of consolidation it is best to deploy the latest generation high perf/low power servers, this can result in the removal of many older generation high power servers from the Data Centre and the deployment of a smaller number of newer more power efficient servers

There are circumstances where virtualisation may not be appropriate and it is necesseary to retain one workload per server - in this case an increase in the workload capacity of a DC can be achieved by replacement of older smaller servers with the latest generation high performance servers - this can enable the workload capacity of a DC to be significantly increased without building a new DC, again the side benefit here is that latest generation servers consume less power than the older servers they are replacing.

There are many different ways in which the workload capacity ( and hence utilisation ) of a DC can be increased , with care most can also result in a reduction in the electrical power consumed by the DC.

Given the right tools the utilisation of servers within a DC is 'relatively' easy to measure, so this element of DC effectiveness can be quantified. There is another major element that I believe contributes to the effectiveness of a DC - that is the processes that are in place to manage the DC and hence the way a DC can respond to the new challenges placed on it by a business unit. Gartner have an infrastructure maturity model that is useful to try and quantify how effective a DC is in responding to business needs and looks at responsiveness, Service Level Agreements, IT processes etc. Currently I do not believe many DC managers are measuring how effective their DC in terms of process and when asked to judge where they sit within a model like Gartner's many IT managers will judge themselves more efficient than they really are.

Are there other areas that contribute to the efficiency of a DC - I would be interested in your feedback.

3 Comments Permalink
2

In the second comment around the right time for datacenter refresh, I'd like to look at Costs. Power is covered in the comment from Chris and I covered some comments on Space already in the discussion forum. So what it really boils down to is cost of running your existing datacenter versus the costs of throwing the servers out and replacing them. It is clear also from the other comments, that it doesn't make sense to throw out servers which are utilized in average 15% and have them replaced by new servers, which are 5 times faster and utilize the servers 3%... Great achievement hu?... Server Refresh makes therefore most sense to do only when consolidating the environment. How do I consolidate the environment? By using virtualization. See Helmuts blog and the whole theme next week on that topic.

Therefore let's look at the real cost factors, when refreshing the servers:

  • Cost of new hardware: That is obviously a significant capital expenditure and starting at about 2000$ for a reasonable DP server. But the trick is also that a lot of server companies offer financing models which make this an operational expenditure. But key is also to understand, that by consolidating your servers at the same time the depreciation costs of the servers may actually decrease, as you have less hardware to depreciate!
  • Maintenance costs: Again, reducing the number of servers running given applications, and at the same time unifying the environment helps significantly to reduce the maintenance costs. This can be a significant step in unifying on a given OS or hardware platform.
  • Power consumption: Similar to utilization, it doesn't make sense to just look at the power consumption by server, but at the consumption by performance and therefore I can save about 38% in power bills, on a given workload vs. the previous generation hardware and about a 10^th^ of the power of hardware which is 2-3years old. Again, obviously only, if I do this in combination of consolidating the servers. Trick often is, that those costs are often not taken into consideration, as those are not billed to the IT department but to the facilities group. So it becomes an executive decision to ensure they are looked at!.
  • Switching costs. Obviously very hard to measure, as this depends on the environment of the customer. And I talked to the customer who said: "No I will never touch this AS400 system, as it just runs and runs and runs." On the other hand I had a customer who replaced just those AS400 systems and saw huge synergistic effects, because he put the application on a standard based architecture and was able to finally integrate it in the other production system and therefore have one reporting and analytics tool.

I try to make a long story short. This is not something you do very often, but you don't get married every year either. But most of the time it's worth going through the efforts. So thinking about replacing the servers which are older than 2-3years is definitely worth while and often an effort which pays off in the first year!

2 Comments Permalink
0

As this is the first time posting here, here is a quick intro, I started out as a hardware designer for a UK computer company - back in the days when the PC was still a grey tin box with a 4.77MHz 8088 inside. I have been with Intel now for more years than I care to think about, with much of this time working with the OEMs and end-customers focused in the server market across EMEA.

As I trawl thru the press and listen to the industry analysts one topic that everyone is discussing is 'data centre efficiency' ( even elsewhere on this forum Intel IT Data Center Efficiency Initiative - Going Green, Data Center Efficiency ) but what's not real clear is what defines an efficient data centre - is it the efficiency of the servers, the cooling subsystems, the workload that can be handled in a given time or the operational processes that are in place to run the data centre ? And once you have decided what is considered 'efficient' how do you measure or quantify this efficiency.

Currently there are several approaches being considered by the industry to measure data centre efficiency, and I thought it would be worth spending some time looking at three elements that can affect DC efficiency - power, utilisation and process. Given the complexity of the topic I plan to take this in bite sized chunks ( rather than write a mass of text and lose the thread ). So, in this blog I will cover power and will come back to the topic in a subsequent posting to look to the other elements. If you think there are elements to DC efficiency that I am missing please feel free to chip in and provide your insights.

Power Efficiency - Measuring the ratio between the facilities load - cooling, power conversion etc vs. the IT load - compute/storage/infrastructure. Typically this approach focus's on the ratio of electrical power consumption of the various elements within the data centre. With the current focus on the 'environmental & green' aspects of data centres this seems to be the area where most of the attention on Data centre efficiency is focused.

If you look at the average Data Centre today its not just the compute infrastructure that consumes the Watts, power gets consumed by the cooling systems and air conditioners, voltage conversion & battery storage, lighting etc. All this contributes to the 'facilities load' - for many IT managers this does not hit their IT budget and they may not even see the power bill from the utility company so have no idea how much power is consumed by these key elements of their data centre. Current estimates indicate that upwards of 50% of the power that comes into the average data centre gets 'lost ' in the facilities load, more details here & here

There are several groups looking to quantify energy efficiency The Green Grid is working on metric called PUE ( Power Usage Effectiveness ) to measure the ratio of power consumed by the facilities load vs. the power available to the IT equipment in the data center - details in their white papers here. Also the Uptime Institute are doing something similar and various government institutions are getting interested as well and there's an extensive US govt white paper ( if you have a few hours spare to ingest its 150 pages) . In addition the European Union is working on a Data Centre Code of Conduct

The server OEMs are also working on a benchmark for measuring perf/watt ( http://www.spec.org/specpower/ ), these are great for measuring how good a server is on a test workload and how many transactions it can deliver for a given power input. With the increased focus on energy efficient performance this metric will become more and more important to the specifiers and purchasers of servers. With Intel's latest generation 45nm quad core Xeon processors we continue to drive up the performance a processor can achieve for a given Watt input, the challenge for the rest of the industry now is to lower the overall power consumption of the other elements within the server and to increase the throughput of the storage and I/O subsystems to complement the increase processor performance. But at the end of the day does a good perf/watt for a server indicate that a data centre is efficient ?

What's missing from this approach is that there is often no consideration made as to the utilisation of the servers within the data centre consequently it might be possible to achieve 'good' power efficiency numbers but have low server utilisation and hence not extracting the most workload out of the data centre. Here in EMEA we have initiated a Data Centre Efficiency Award to try and start to get a handle how best to identify DCs that are running best practices and delivering of power and utilisation efficiency.

I guess the question at the end of the day is do you consider that your Data Centre is efficient and how are you quantifying this efficiency ?

0 Comments Permalink
0


Watt do you care about more?
the Power Consumption of your servers (watts) or the Power Efficiency of your servers (performance / watt)
... or maybe you prefer the Performance per Watt per SqFt argument

http://communities.intel.com/openport/servlet/JiveServlet/downloadImage/1154/power+or+ppw.JPG

I have spent a lot of my time the last several years discussing this topic with IT professionals around the world - and there are a lot of varying opinions.

I believe that Performance per Watt is a better measure of overall value for the data center and server room.
The power consumed by a server is an important measure, but power only comparisons can be misleading.

Example: If server ‘A' consumes 50W less power than server ‘B', then it can save IT $79 per year per server in power and cooling costs (assumes $0.08 kW/hr power costs and cooling costs equal to power costs). Scale that $79 savings per server across a data center with thousands of servers and it can be a pretty impressive number.

However, if a server with 50W lower power delivers lower application performance ... is the power savings worth it? The answer of course depends ... but generally in my experience the answer is a resounding No.

Example: What if server A (the 50W lower power server) underperforms server B by 33% in performance. This means that you need to deploy more ‘A' Servers to get the same performance as ‘B' Servers. In fact, with a 33% performance advantage, you need only 3 ‘B' servers for every 4 ‘A' servers. The higher performance per Watt delivered by server B reduces acquisition costs, reduces power consumption (less servers) and minimizes space and eases manageability. This example is shown graphically above

What do you think? What power and performance metrics do you look at before purchasing servers
... Lower Power or Higher Performance per Watt?

0 Comments Permalink