Home > Intel Communities > Open Port IT Community > The Server Room > Blog > 2008 > June
3

I'm blogging here today from the Intel Premier IT Professional (IPIP) event in Denver, Colorado. This is a really amazing setting at the Center for the Perfoming Arts in downtown Denver. There are some 200 industry professionals here networking and sharing best practices around client and server technologies with some of the main topics including Intel's technology roadmap, security, client and server virtualization. For those who couldn't be here, check the IPIP Website for event details and to download the presentations. In addition to updates on this blog, Josh Hilliker and I will have an event wrap-up on Blog Talk Radio, stay tuned for the details. Check back to this blog for event updates as they occur.

 

Wm. Hank Lea

Community Manager

Open Port-The Server Room

 

2pm- Event Update

 

Here's some cool video of XEON 7300-series(4P)running a database transaction application:

 

 

And another video showing the XEON 5400-series (2P) running the Black-Scholes Option Pricing benchmark:

 

 

And a third demo showing the XEON 5400-series in a workstation configuration running 3D rendering application:

 

 

3 Comments Permalink
2

Energy consumption and energy efficiency issues are becoming more prevalent in the datacenter. This short podcast hosted by the Register provides some insight on topics that IT manager should consider to improve energy efficient performance in the datacenter.

 

 

 

 

http://www.podtech.net/home/5116/energy-consumption-in-the-data-center

2 Comments Permalink
4

Ethernet has been around a long time. It is a highly reliable and trusted means for interconnecting computing nodes, and above that, it has generally been the most commoditize (read: lowest cost) form of interconnect for quite some time. Broad deployment, administrator trust, and low cost have kept Ethernet as the mainstream fabric for LAN traffic for a long time.

 

However, despite Ethernet's strong connectivity credentials, it still comes up short in certain applications. Ethernet is what is referred to as a ‘best effort' network. This simply means that in the real world, you will generally get pretty good performance (throughput, latency, lack of dropped packets, etc), but from time to time when there is congestion, packet drops and performance degradation can be quite a nuisance. For many applications, this doesn't matter. If you are using email, browsing the web, or transferring files to a shared drive, the only thing you will notice is a decrease in performance, but everything will still ‘work', and transfer properly. For some applications like storage though, this non-deterministic performance is unacceptable. If packets are dropped, or arrive out of order, storage applications have a nasty tendency to hang or crash.

 

Because of this limitation of the standard, there have been separate fabrics used for Storage Area Networks (SANs) for quite a while. One of the main fabrics developed and used for high performance SANs is known as Fiber Channel. In order to create a Fiber Channel network, a server and storage target need to support a Fiber Channel Host Bus Adapter (FC HBA) to communicate via the Fiber Channel protocol. In addition, the switches that connect the Fiber Channel infrastructure must also be dedicated Fiber Channel switches; a standard Ethernet router cannot be used.

 

Once in place, this SAN architecture provides a very high performance, high reliability network that is ideal (and required) for high end storage traffic, but it comes at a cost:

 

1) Fiber Channel HBAs are generally more expensive than their Ethernet counterparts.

 

2) You have to have a separate fabric in your network which also adds to your infrastructure (switch costs, and cabling costs) as well as complicates IT management.

 

3) Servers connected to the SAN now need to have an Ethernet adapter AND a Fiber Channel adapter.

 

The upside to the additional cost and complexity is of course better performance, but the question has always been "Is there a better way?"

 

I believe there is a better way, and that Fiber Channel over Ethernet (FCoE) (and importantly, the standards in IEEE that are making it possible) seems to be the logical path to solve the issue of performance on lossless performance on Ethernet, while maintaining Ethernets historical core cost advantages.

 

‘Best Effort' is not good enough:

The bottom line for today's Ethernet is that it simply can't provide the ‘lossless' behavior that storage traffic demands; but this fact is changing. Below I will summarize at a high level some of the standards being developed in IEEE to improve the performance of Ethernet for storage applications, and how they help to mend some of the issues with Ethernet and how that helps to enable FCoE.

 

Bandwidth Sharing, Priority Flow Control and Pause:

This capability offers a method to assign priorities to different Ethernet traffic classes. From there, when congestion becomes an issue, traffic can be ‘paused' on a per-priority basis; allowing the lower priority traffic to be halted temporarily while keeping the top priority traffic like storage running smoothly. This per-priority pause capability is really the first basic step in allowing Ethernet to provide some ‘QoS like' Layer 2 guarantees.

 

Congestion Notification (or Backward Congestion Notification):

In addition to simply pausing individual low priority streams of traffic, congestion notification allows for a communication method to go upstream from the node and notify the offending traffic generator to throttle back its traffic and re-route as necessary. This capability is a key to the longer term development of FCoE because with only the pause capability the congestion is really just pushed up a single node in the network. In order to support FCoE storage across multiple nodes in a network, congestion notification is needed.

 

Shortest Path Bridging:

This capability is really an optimization for inter-node routing that defines the path within the network between switches. Using traditional spanning tree path algorithms will sometimes result in paths in the network that are non-optimal and incompatible with high performance storage traffic. A new algorithm to determine the shortest path between nodes will help to enable both less congestion in the network as well as fast delivery of critical packets for storage.

 

DCB Capability Exchange Protocol (DCBX):

This capability goes by several different names depending on who you talk to, but essentially what it will provide is the ability for switches on the network to exchange their capability sets with other nodes of the network. This allows for each switch to understand what others switches near it can use the Congestion Notification, Flow Control, or other features need to support this ‘Lossless Ethernet' capability.

 

While the list above is not meant to be all inclusive of all the new IEEE development under way for this new ‘Lossless Ethernet' initiative, it should provide a good overview of the general push taking place and how the goal of getting to near lossless performance is going to be accomplished.

 

Weren't we talking about Fiber Channel?

Astute users will realize that I haven't really addressed the Fiber Channel piece of this. The above features I described only allow for Ethernet to carry certain kinds of traffic (like Fiber Channel) that require very high reliability and performance; but how do you get the Fiber Channel data onto an Ethernet frame?

 

In today's environment, a Fiber channel initiator on a Server system will place Fiber Channel data onto an FC HBA to send over the SAN to a storage target. All of this data is transmitted over a fiber channel network. Under the FCoE model, what you will need is a Server system that has an FCoE initiator, and on the target side, the switch connected to the target must be able to convert the data from storage target and encapsulate it into Ethernet. Beyond that, the data is transmitted over the Ethernet fabric as normal, but the features that I described above allow for the performance of Ethernet to allow a Fiber Channel application stack to function properly.

 

This is certainly a capability that Intel has been supportive of. Ethernet is a critical piece of the computing platform, and FCoE provides a potential improvement for datacenter and storage network design. By consolidating the Fiber Channel data onto a single Ethernet wire, end user IT houses can also see several benefits:

 

1) Reduced the need for two physical network cards in each server. Now, a single NIC will connect to the SAN and to the normal TCP/IP data network.

 

2) Along with the consolidation in network cards, you also save in terms of cabling. One 10 Gigabit link can replace the old Fiber Channel fiber link and Ethernet links.

 

3) Reduces power consumption and cooling

 

4) The commoditized and low cost nature of Ethernet provides additional benefit by converging system I/O onto what will likely be the lowest cost interface over the coming years; 10 Gigabit.

 

In summary, FCoE may be in its infancy, but the standards in final, or in process. Products are available today, and the value proposition in here. Further performance improvements and cost reductions and the proliferation of 10 Gigabit networks, as well as more choices in the future, will only further the support and interest in Fiber Channel over Ethernet in datacenter SAN applications.

 

 

 

~ Ben Hacker

 

 

 

 

Links for further information:

http://ieee802.org/

http://www.open-fcoe.org/

4 Comments Permalink
1

Here's the 5th follow-up post in my 10 Habits of Great Server Performance Tuners series. This one focuses on the fifth habit: Know Your Workload.

 

Spend some time getting to know your workload.

 

 

 

 

 

The idea of a "workload" is integral to the concept of performance. The workload is the set of software and tests that you run on the server in order to measure its performance. Also part of the workload is the is concept of the "metric", which means, the number you will use to quantify performance. You should understand as much as you can about your workload in order to characterize and interpret your system's execution.

 

 

Let's look at the real-life example of a car's fuel economy. The EPA measures fuel economy using 2 workloads: city and highway. Each workload tests different aspects of the car's performance, and the metric used to quantify that performance is miles per gallon (MPG). Like the EPA's fuel economy test, a good workload for server performance tuning should have the following three characteristics:

 

  • Measurable - There is a quantifiable metric.

  • Reproducible - Measurements are repeatable and consistent.

  • Representative - The workload should be typical of normal operating conditions and should stress the parts of the system (including code) where performance is most critical.

 

Depending on the usage model for the server(s) you are tuning, some example appropriate workloads might be: loading websites , processing XML, encoding/decoding MP3s, responding to database queries, rendering frames, etc. Metrics could be time to run, number of users serviced, transactions processed per second, etc. If your metric is time, take special care that you are measuring it accurately.

 

After choosing or creating a suitable workload, spend some time getting to know it. Measure the variance between runs. Use O/S and processor-level tools (to be discussed in the blog for habit #8) to sample the workload's characteristics at various points during its execution.

 

 

One thing to remember about sampling is that you want to make your sample interval at least as long as the amount of time it takes to complete a unit of work in your workload. For example, suppose your workload is a stream of web page requests and you are measuring response time. If the longest response time you see is about 2 seconds, then you want to make sure you take samples over 2 seconds in length. It's best to use a multiple of your longest operation time, so 4 or 6 seconds in this case. This way you can be sure your samples include one complete operation in the workload. Then try to determine if the workload is stable - meaning, do the characteristics vary at different times during execution? (If so, you will need to sample more often to understand the workload or possibly split it into phases). Use the data to get an idea of your workload's CPU, memory, network, and I/O usage.

 

 

At the application level, become familiar with the software stack you will use. How is the workload generated (user, clients, test files, etc)? Understand the major operations that occur - what components of the O/S are needed? What device drivers are used? And finally, study the application(s). Know whether the application(s) being tested are single- or multi-threaded and as much as you can about the internals.

 

 

Choosing (or developing) an appropriate workload is necessary for correct performance measurement and tuning. Being as familiar as you can with the workload will help you to interpret your performance data and identify areas for optimization.

 

 

Keep watching The Server Room for information on the other 5 habits in the coming weeks.

1 Comments Permalink
2

Last week, the first part this video series focused on the energy efficiency benefits of 45nm. The 2nd part of this video (below) is focused on the benefits of 45nm for virtualization and the intel processor roadmap including what's next in 45nm processor technology - the Dunnington and Nehalem-EP products

 

Is this information useful to you? why or why not?

 

Chris

 

 

2 Comments Permalink
1

Using some data from our own IT group, we developed a simple ROI calculator. This tool provides an estimate of performance and IT cost savings of refreshing older servers with new ones. Below is a screen shot of the calculator that is now available on our new server tools section of the Server Room. Give it a try and let us know if these assessment tools are helpful?

 

1 Comments Permalink
0

Following a recent interview I conducted with the Register on a related subject, I was asked to talk more about Intel's current 45nm technology and our roadmap for new technology later this year. Join me in a two part video series where I discuss 45nm and beyond.

 

Part 1 (below) discusses the technology and benefits that 45nm xeon processors deliver for IT today.

 

Tune in next week to hear Part 2 - what we have planned for future enhancements to today's xeon products - the Nehalem Processor and Intel QuickPath architecture.

 

Chris

 

 

0 Comments Permalink

Filter Blog

By author: By date: By tag: