Home > Intel Communities > Open Port IT Community > The Server Room > Blog > Tags > 10gb_ethernet

The Server Room Blog

7 Posts tagged with the 10gb_ethernet tag
0

I’m a bit late in relaying my thoughts from Intel’s Developer Forum (IDF), but there was definitely some excitement around virtualization and high performance networking that I wanted to get the word out about!

In the past I’ve shared some details about SR-IOV and the advantages you can gain by being able to present virtual LAN hardware to each Virtual Machine (VM), effectively avoiding the Hypervisor when presenting virtual devices to each VM.  The advantage of being able to do this is clear:  The less interaction in the networking stack there is from the hypervisor, the less processing overhead is required for the system process the data.

That’s all good because if you have a dual 10 Gigabit adapter, you can segregate those two physical pipes into perhaps 16 virtual pipes that get exposed to 16 VMs.  By segregating these LAN pipes at the hardware level with SR-IOV instead of using Hypervisor switching, the performance gains in both CPU utilization as well as maximum total throughput can be very large.  There were several demos at IDF with various configurations, but reductions in CPU utilization of 40% were possible coupled with dramatic improvement in throughput!

But there is unfortunately one minor complication that I didn’t mention in my last post on the topic of SR-IOV.  There is the little fact that when VMs move between physical boxes (a usage that is highly desired and commonplace these days) you run into some problems with this SR-IOV capability.  When the hypervisor owned the network hardware abstraction, the performance was worse, but the functionality was better because you could seamlessly migrate from one box to another and the virtualization application would handle the details.  But with SR-IOV, a new layer needs to be added so that the direct hardware connection between the VM and the LAN hardware can be moved to a new box.

The really exciting part of IDF demos that I saw was the demonstration not just of the SR-IOV functionality on multiple hardware and virtualization configurations, but that these demonstrations also showed updated software from two virtualization vendors allowing mobility of the VMs while supporting SR-IOV! 

There was a demo on Dell systems showing this fully functional SR-IOV implementation with Citrix’s Virtualization suite.  There were two separate demonstrations on Dell systems, with VMWare displaying their new Network Plug-In Architecture (NPIA) solution that allows for the migration of SR-IOV connected VMs seamlessly between servers.

For those hungry for more detail, I’ve included the three SR-IOV demonstration videos here:

The first is the Citrix demonstration on Dell and Intel hardware of SR-IOV with VM mobility:

 

These next two are two videos are demos on Dell and Intel hardware with VMWare and their NPIA software implementation.

 

Each virtualization demo shows the massive performance benefits under various workloads when moving from Hypervisor based LAN segregation to SR-IOV implementation.  But most importantly, each demonstration proves out the capability to migrate VMs between physical hardware.  The only system hardware requirement is that the server itself supports VT-d.  If the networking hardware in the newly migrated-to box supports SR-IOV you get better performance, and if not, the solution falls back on the legacy Hypervisor virtualization.  Backwards compatibility is maintained!

I didn’t get firm details on when this full support for SR-IOV and migration will be available in Citrix and VMWare’s releases, but the demos looked pretty clean, and hopefully these suites will be available soon with this new functionality.  The LAN and Server hardware ecosystems are ready today, and it looks like the software vendors are just around the corner.  Virtualization momentum continues!

While virtualization was the big takeaway for me from IDF, there were also several other interesting demos for us networking hounds.  I’ve linked a couple videos of them below for anyone still thirsting for more of the latest networking technology and performance details!

The first video is a demonstration of Intel’s 82599 10 Gigabit Ethernet-based adapter card with Fiber Channel over Ethernet (FCoE) support.  Storage and Ethernet together at last!

The second video is a demonstration of Intel’s NetEffect 10 Gigabit Ethernet card publishing 1 million messages per second in a simulated NYSE floor trading scenario.  Oh yeah, only 35uS of latency.  That is fast.

So although I am two weeks after IDF, I hope some of you got a little taste of the networking excitement that took place.   Industry wide, hardware and software vendors alike are delivering ultra high performance low latency applications for the financial services industry, as well as mainstream performance increases for virtualization.  The performance and technology beat moves forward.  Exciting times!

--

Ben Hacker

0 Comments Permalink
0

Intel has just launched the Intel® Ethernet Server Adapter X520 family.  These NICs are Intel’s first 10 Gigabit adapter products that support “pluggable” optics.  This additional configuration option gives IT users a great deal more flexibility in how they deploy 10 Gigabit in their servers and datacenters.

 

The X520 Family of adapters support bailed optics that allow the removal or addition of different kinds of optics or support with no optics at all.  For previous 10 Gigabit products, if you wanted 10 Gigabit SR Fiber connectivity, you had to purchase a 10 Gigabit SR adapter.  But with the pluggable X520 adapter family, you can support SR, LR, or simply an SFP+ direct attach cable via the same card by simply removing / exchanging the optics.

 

 

X520Images.jpg

 

 

With the X520 you can still buy an SR or LR fiber configured adapter, but you can also switch back and forth after purchase by ordering only the new optics that you want to support (not a whole new adapter).  In the case of the Direct Attach adapter that supports an SFP+ cage, but comes without optics inserted, you can still use Twin-Ax copper cables to run in the rack less than 7m length runs of 10 Gigabit, but you can also upgrade the Direct Attach adapter later with SR or LR optics as the needs for the particular adapter may change. You can also mix and match optics modules in a dual-port adapter, meaning you could have an LR module in one port and an SR module in the other. You could also throw a Twinax cable into the mix.

 

The Intel® Ethernet optics modules for the X520 family of adapters also support both 1 Gigabit and 10 Gigabit speeds to help with backward compatibility – an industry first.

 

Finally, while this new pluggable capability of Intel 10 Gigabit adapters adds a bit more usage flexibility from an IT perspective, the performance capabilities and advanced features for the datacenter I’ve discussed over the past 18 months are also supported.  The X520 is based on the Intel® 82599 10 Gigabit Ethernet Controller, so the end result is a flexible product that can help unleash server IO performance whether FCoE, iSCSI, Virtualization, Security, or just raw IO performance.  Regardless of your 10 Gigabit needs, the X520 probably has what your Server environment needs.

 

--

Ben Hacker

0 Comments Permalink
0

I’ve spent a fair number of words in the past on the benefits of 10 Gigabit and what it means for the server market.  Through the addition of FCoE and DataCenter Ethernet as well as advanced virtualization features 10 Gigabit seems likely to have its big day in the sun here pretty soon.  But the question is still “When”?


While the proof is ultimately in the raw volumes of 10 Gigabit that ship, and the number of IT users who utilize the higher performance, there are some key reasons to think that 10 Gigabit momentum is accelerating beyond just the numbers* below:

10 Gigabit Forecast.JPG

 

Over the past year, there has been a raft of new 10 Gigabit switch announcements** from Cisco (Nexus 5k/7k), Arista (7100, 7124, and 7148), BNT (G8100), Extreme Networks (Summit X650) Juniper (EX8200), Voltaire (8500) and many others that have increased the choice, and the density of 10 Gigabit switches in the marketplace.   There are now many 48+ port 10 Gigabit switches available and even a few 200+ port models.  Also, the improved density and feature set of certain switches (such as Voltaire’s 280+ port 8500 series switch) provide a path for 10 Gigabit’s ascent into the clustering market by improving port density and latency for clustering applications.

 

Broad acceptance of SFP+ has also helped to drive a rapid improvement in price, density, and power.  SFP+ provides a smaller form factor standard for optics, as well as a standard connection methodology to connect directly from switch to NIC via a Twin-Ax copper (read: ‘low cost’) cabling solution inside the rack (up to 10m).  The widespread adoption of SFP+ form factors has dramatically reduced the entry level price points for switches, and through the ‘direct attach’ copper connection capability it has also reduced the overall cost for initial and ongoing deployments of 10 Gigabit by providing a lower cost bridge to optical or full 10GBase-T support.

 

There are also a few data points to suggest that the Server side cost for 10 Gigabit will also be dropping fast going forward.  As power for 10GBase-T continues to drop quickly, more and more Server vendors are looking at the options available to embedded 10 Gigabit directly into their systems.  This will not likely be a 2009 story, but it is approaching quickly.  Additionally, the acceptance of SFP+ form factors for optics/direct attach cabling has provided a path that some Server vendors may use to design 10 Gigabit down on motherboards without adding the extra cost and power of a 10GBase-T solution.  This looks like a likely near term given that the solution power and design are robust and ready for motherboard based designs today.

 

Finally, the continued cost reduction provides an attractive long term value of standards based 10 Gigabit Ethernet.  There is clear indication downward pressure on 10GbE prices already present today.  We will see 10 Gigabit pricing follow a similar price curve as we saw with Single Gigabit.  This is evidenced in the recent pricing announcement where Intel reduced the cost of single port 10GBASE-T adapter 40% from $999 to $599.  The competitive economics of standards based hardware will continue to drive down 10 Gigabit prices even further and we will see 10GBASE-T pricing below the $500 / port price in the near future. Once it gets on the motherboard, prices will drop even further.


Overall, the power, density, latency, and cost of 10 Gigabit are all improving at a rapid rate.  Form factor flexibility coupled with a wide array of switch and NIC vendors in the marketplace will provide choice and low cost for IT departments while virtualization and convergence in the datacenter and elsewhere continue to provide demands for ever greater I/O bandwidth and performance.

 

--

Ben Hacker

 

 

 

* Del’Oro Forecasts as of Q1 ‘09

0 Comments Permalink
0

What if every server in your virtualized data center was driving 10Gbps of traffic?

My team just completed a test with an end user where we drove nearly 10Gbps of traffic over Ethernet through a single Xeon 5500 based server running ESX4.0. The workload was secure FTP. Our results will be published in the next 30 days. We’ve seen 10Gbps through a server in several other cases (notably, video streaming and network security workloads) but this is first time we’ve really tried to do a 10GB “enterprise” workload in a virtualized environment. It took a fair amount of work to get the network and the solutions stack to work (we had to get a highly threaded open source SSH driver from the Pittsburgh Supercomputer Center, for example, to make it scale). We also found some good value for some of our specialized network virtualization technologies (i.e., the VT-c feature known as VMDQ). But, regardless, by working at it moderately diligently, we got it to work at 10Gbps and don’t see any real barriers to doing that in real production environments.

We also found that the solution throughput is not particularly CPU-bound, it’s “solution stack bound”. That means that workloads that are more “interesting” than virtualized secure FTP and video streaming are likely to be able to source and sync more than 10Gbps/server, too. And, when we get to converged fabrics like iSCSI and FCOE that put the storage traffic on the same network path (or at least the same medium) as the LAN traffic, we’d expect that the application needs for higher Ethernet throughput will increase.

So what? Well, if you buy the fact that virtualized servers can do interesting things and still drive 10GB/s of Ethernet traffic, you have to wonder what’s going to happen to the data center backbone network. If you have racks with 20 servers each, putting out a nominal 6Gbps of Ethernet traffic, each rack will have a flow of 120Gbps and a row of 10 racks will need to handle 1.2 Tbps. I’m not sure what backbone data center network architecture will be able to handle that kind of throughput. Fat tree architectures help especially if there are lots of flows between servers in close proximity to each other in the same data center. But, fat tree networks are very new and not widely deployed. Thoughts?

0 Comments Permalink
2

Sometimes the next step up is a big one. The Intel® Xeon® processor 5500 series (formerly codenamed “Nehalem”) is one of those kinds of steps.

Over the last few years 10 Gigabit has started to take off, but there have always been some negative mutterings: “Why do I need 10 Gigabit?”, “Why do we need this much bandwidth?” or “My server can’t support 10 Gigabit per second bidirectional traffic anyway.” Despite the volume of 10 Gigabit products shipped, there is still the reality that if you intend to use the entire 20 Gbps (10G both directions) or heaven forbid you try to use 40 Gbps with a dual port product; you will likely be sorely disappointed with the results.

 

The reason for this is simple. Most current mainstream servers and 10 Gigabit products don’t support the intense usage models needed to drive that much network I/O and they also don’t have the memory architecture to unleash the full potential of dual 10 Gigabit links.

 

Luckily, that all just changed with Intel® Xeon® processor 5500 series.

 

In addition to the great processing improvements that the Intel® Xeon® processor 5500 series brings to the table, Intel has also introduced our third generation 10 Gigabit product, the Intel® 82599 10 Gigabit Ethernet Controller which provides two ports, and new capabilities and enhancement to the 10 Gigabit product landscape that help unshackle the new processor from its predecessor’s network I/O handcuffs and unleashes blazing performance in a variety of usage models. These improvements, coupled with the new architecture of the Xeon 5500 provide a symbiotic processor-networking combination that makes new usages possible and expands server and datacenter computing by a big leap… not just a baby step.

 

One of the key changes with Intel® Xeon® processor 5500 series architecture is a step function improvement in the internal system I/O. The new local memory controller design, faster cache architecture, and support for DDR3 help push Xeon 5500 to be able to support peak memory bandwidth of ~32 Gigabytes, per socket. In a dual socket system this provides for ~64 Gigabytes of bandwidth which is dramatically more than the previous generation server configuration. In addition, the new Intel® QuickPath Interconnect (Intel® QPI) improves the speed both for inter-Processor communication as well as a faster path to the I/O hub. Finally, PCI Express* 2.0 I/O Bus support has been added to improve the entire data path from Processor to the 10 Gigabit Ethernet link.

 

Taken together, the above improvements are a performance game changer for 10 Gigabit Ethernet.

 

The chart below** shows the previous generation Intel® Xeon® paired with the previous generation Intel 10 Gigabit Ethernet Controller compared to the latest platform using the newest Intel silicon for both processor and networking. Not only is the performance better in 1-4 port configurations, but the performance scales dramatically better to above 50 Gigabits per second of total LAN throughput in a four port configuration vs. *just* 17 Gigabits on the previous generation! A complete platform architecture solution makes this huge improvement possible.

82599 + Xeon 5500.jpg

Now, it’s great that Intel® Xeon® processor 5500 series coupled with the Intel® 82599 10 Gigabit Ethernet can deliver such raw performance, but there is the forever nagging question of usage model. Luckily, the new headroom breathes new life into both Virtualization and storage over Ethernet usages (both of which I’ve talked about here and here) and provides new opportunities to more efficiently utilize your network link.

 

Intel® Xeon® processor 5500 series allows the vision of consolidation in the datacenter to scale new heights, increasing the number of Virtual Machines (VM) that can effectively live inside a single system enclosure. Each incremental VM will add additional network I/O that is already starting to exhaust a 4 or 8 port single gigabit interface configuration with today’s server capabilities. As more VMs are added in the Xeon 5500 generation, 10 Gigabit will no longer be seen as optional; it will be required. For its part, the Intel® 82599 10 Gigabit Ethernet Controller supports Intel® Virtualization Technology for Connectivity (Intel® VT-c) to improve overall system performance in virtualized server environments. Intel VT- c includes hardware optimizations that help reduce I/O bottlenecks, boost throughput and reduce latency. Components of Intel VT-c include VMDq, and VMDc. VMDc consists of SR-IOV which I’ve covered before, and the ability to support VM mobility; a critical usage model for modern a IT deployment. All together, server systems can support more VMs, more throughput, more flexibility and better performance in a datacenter environment.

 

Finally, the additional capabilities of the Intel® 82599 10 Gigabit Ethernet Controller product surrounding support for FCoE offloads and full support for the new Data Center Bridging (DCB) standards provide an opportunity for storage convergence over Ethernet in either a datacenter using a Fiber Channel SAN environment or an IT environment more focused on iSCSI. On the performance side of things, iSCSI acceleration along with FCoE data path offloads are supported in the Ethernet controller, and on the processor there is support for the CRC instruction set which insures iSCSI data integrity while minimizing processor overhead.

 

The ability to converge at least part of the additional storage infrastructure onto Ethernet is just another factor driving massively increased data rates over Ethernet… luckily, the Intel® Xeon® processor 5500 series and the Intel® 82599 10 Gigabit Ethernet Controller solutions are up to the task.

Over the past few days, there has been a lot of noise around Intel® Xeon® processor 5500 series and the many other platform components that help it shine its brightest. Improved processing power, memory controller bandwidth, faster and redesigned FSB, and improved 10 Gigabit networking all converge together to provide a fantastic performance, convergence, scalability and power story. Intel’s strong history in the server and processor markets, coupled with over 25 years in Ethernet makes this latest release a natural evolution of technology. Together these capabilities, along with the improved 10 Gigabit features and performance, are helping to transform the datacenter. It will be denser, more power efficient, more performant, and more consolidated with capabilities like FCoE and iSCSI.

 

As for “Why do I need 10 Gigabit?” We have the answer, and it’s the new Xeon®.

 

Ben Hacker

-----

** Source. Intel. Mar 2009. Up to 2.5x performance compared to Intel® Xeon® processor 5300 series. Performance result of a bandwidth intensive network benchmark (IxChariot). Network throughput was measured on 64KB I/O size transfers between the test system and multiple network targets. Intel pre-production system with two Quad-Core Intel® Xeon® processor 5500 series CPUs (2.93 GHz), 12 GB memory (6 x2GB DDR3 - 1066MHz) vs. Intel Production system with two Intel® Xeon® processors X5365 (3.0GHz, 1333MHz FSB), 8 GB memory (8 x 1 GB DDR 2 - 667). Windows Server 2008, stock unmodified installation.

2 Comments Permalink
2

Virtualization 1.0 is yesterday’s news; the days of virtualization being used only as a tactical tool to drive consolidation and higher system utilization are quickly ending. For the most part, companies have figured out how to get improved utilization, and are using server virtualization in a wide range of usage models across development, testing and some rather interesting production/mission-critical scenarios.   Its use is gradually maturing from simple partitioning and encapsulation to leveraging the mobility of virtual machines to improve management and operations of IT environments. This is allowing the change in deployment models for virtualization from typical scale-up approach (SMP with large Memory servers) to a scale-out model.

Virtualization 2.0 includes a host of new use cases (shouldn’t be surprising to anyone) that include:

·         Load-balancing for SLA Mgt

·         Power-optimization

·         High availability (no downtimes)

·         Disaster recovery and business continuity

·         Hosted clients

·         SOA & Utility computing.

I see three key foundational tenets as the underpinnings for these usages.  First are the “abstraction” and the “convergence” of compute servers, storage and networks. It has been happening, but virtualization 2.0++ is driving (and will continue to drive) a seismic rethink in how Data centers are architected, and the data center would be a “Fungible” pool of infrastructural resources, for a wide variety of services that IT provides to run the businesses.   I will get deep into the implications of this to IT operations, etc, in a follow on blog, but will leave you with this thought.  The new control point in the data center, both architecturally and operationally, would be the integration of compute, storage and network virtualization architectures.  Key industry players like IBM, HP, Cisco, EMC, VMWare and Microsoft are introducing integrated solution architectures targeted at positioning themselves as the first vendor of choice for this emerging direction.  This foundational tenet, coupled with the merits of Service-Oriented Architectures (SOA), is providing an infrastructure for ‘Cloud Computing’.

The Second core tenet is the mobility of Virtual machines - The migrate-ability of the ‘encapsulated’ Virtual machines on this abstracted infrastructure for the best performance, operational cost and SLA management.  They are no longer tied to a server or a set of servers. In some cases they are not tied to a datacenter; hybrid models are emerging where these VMs would execute in the ‘enterprise’ data center, or on external clouds – the optimal place for the best TCO, and SLA management  (Yes, yes, there are security, compliance, accounting, performance concerns… I agree)

The third core aspect is Manageability.  The abstraction and the mobility, coupled with IT’s job of ensuring security, reliability and compliance brings a totally new set of requirements for Manageability.   

If done right, the benefits of Virtualization 2.0 (and 2.0++) to IT shops would be in the form of reduced administrative costs, improve productivity even as demand goes, reduce energy and cooling costs, etc,   however, there are quite a few challenges with the adoption of Virtualization 2.0.  Let us briefly look at these.

 

Challenges with Virtualization 2.0

1.      There is a significant challenge in the management of large scale virtual infrastructures. There are no clear boundaries and responsibilities in terms network, storage and datacenter management teams.  The emphasis on monitoring and management in Virtualization 2.0 is shifting from virtual machine (VM) management to service management; i.e., knowing how a business service is performing and which components of the Data Center (network, server, VM, applications) are working properly and which are not. Hence, it's no longer sufficient to just monitor the uptime or resource usage levels of virtual machines and physical servers and conclude that the entire IT infrastructure is working right.   More granular monitoring and management of resources would be needed to provide precise QoS and SLA management.

2.      VM Mobility – The Mobility of Virtual machines puts requirements on the underlying server CPU architectures, and has challenges with networks and storage.  Such mobility occurs via either a cold migration - which simply copies the virtual machine and restarts a copy somewhere else. Or a live migration, which moves a live running virtual machine, while maintaining state.  There are clear cases where cold migration is sufficient, but the flexibility and agility that is inherent with the virtualization 2.0 use-models requires the ‘live migration’ of VMs. 

·         VM Mobility and the ‘Compatible CPU Architecture’ requirements: Successful migration relies on compatibility between the processors of the host servers within a cluster. For live migration to take place, the source and destination servers must be in the same cluster and must have processors that expose the same instruction set., In the past, it has not been possible to mix servers based on different processor generations, each of which support different instruction sets, within the same cluster without sacrificing the ability to live migrate VMs across hosts supporting different instruction sets. As a result, IT organizations have needed to create separate clusters for different server generations. This has limited our ability to provide an agile data center environment because it creates islands of compute capacity, resulting in data center fragmentation.  Intel’s VT FlexMigration assist, together with VMWare’s Enhanced VMotion, provide a solution. These products are designed to allow IT to maximize flexibility by creating a single pool of compute and memory resources using multiple generations of Intel processor-based servers within the same cluster.  This can reduce the number of pools, increase the efficiency and utilization of servers.

·         VM Mobility & networks: Today, when Virtual machines move on the virtual infrastructure, its network properties and policies are not retained.  Connection state, ACL, Port Security properties, ACL Redirect, Qos Marking, etc are lost as these VMs move across hosts.   Technologies like the VMWare distributed switch, and Cisco’s Nexus 1000v are specifically targeted to address the ‘Network and Security’ aspects of VM Mobility.

  1. Licensing in Virtual environments:  Licensing rules for applications, development tools, data management tools and operating systems often make a completely virtual environment more costly than the organization expects.   Most all ISVs are looking at ‘virtualization’ friendly licensing models, but they are far from being there.  Example:  With Oracle database servers, if you have a 16core server as your host, it doesn’t matter if you database VM uses 4 vCPUs, you would still need the license for 16 cores.  If you would “Live Migrate” the VM, you would need the license on each of the host… This gets prohibitively expensive and impractical.

  1. 10G Networks and Converged Fabrics: The Compute power on the servers has increased dramatically, and with the advent of 8 core processors, the bottleneck clearly moves out of the server, and on to the network and storage bandwidths and throughput.  Virtualization 2.0 will require the consolidation of network traffic and will also increase the need for more bandwidth to the server, both of which will be possible as enterprises make the move to converge and consolidate data, storage, and inter process traffic on 10GbE networks.  10GbE and the converged networks need new switches, access cards, and also a rethink of how applications view the network I/O.  

  1. Security and Isolation guarantees – The hosting of multiple ‘services’ on an abstracted virtualized infrastructure has very specific needs on Security and isolation, multi-tenancy isolation, compliance and audit requirements..  In addition to providing these on a server (for a given service), the infrastructure has to guarantee these across the infrastructure – doesn’t matter on which server (and where) the service and data reside/execute, they need to be secure and isolated. 

In conclusion, Virtualization 2.0 would have a dramatic impact on the architecture of the data center, and also IT architectures and operations.  IT shops will use virtualization for administrative cost reduction, better resource allocation, and more flexibility in a mobile world.   Coupled with Service Oriented Architectures,, the promise of true service-oriented/utility computing might be closer than it has ever been with Virtualization.

Would love to hear your thoughts and views on this..

2 Comments Permalink
0

Day 1: I'm live from VMWorld this week experiencing the virtualization event of the year. I'll be updating this blog with happenings from the Intel booth and around the show floor. Some really cool video interviews with Intel Partners who are making a big impact in the virtualization world and giving IT managers real advantages over previous generation solutions. Here's a video showing the XEON 7400 near perfect scalability from 8 to 24 to 48 cores. Wow, 48 cores, that's cool!

 

 

 

 

If you liked the first video, check out this one where Jon Markee is talking about Intel Virtualization Technology (VT) and how flex priority improves performance and reduces boot time in your virtualized environment.

 

 

 

 

 

Day 2: Here's another video from the Intel Booth showing more examples of Intel Virtualization technology.

 

 

0 Comments Permalink

Filter Blog

By author: By date: By tag: