Home > Intel Communities > Open Port IT Community > The Server Room > Blog > Tags > performance
1 2 3 4 5 ... 10 Previous Next

The Server Room Blog

143 Posts tagged with the performance tag
0

I’ve spent a fair number of words in the past on the benefits of 10 Gigabit and what it means for the server market.  Through the addition of FCoE and DataCenter Ethernet as well as advanced virtualization features 10 Gigabit seems likely to have its big day in the sun here pretty soon.  But the question is still “When”?


While the proof is ultimately in the raw volumes of 10 Gigabit that ship, and the number of IT users who utilize the higher performance, there are some key reasons to think that 10 Gigabit momentum is accelerating beyond just the numbers* below:

10 Gigabit Forecast.JPG

 

Over the past year, there has been a raft of new 10 Gigabit switch announcements** from Cisco (Nexus 5k/7k), Arista (7100, 7124, and 7148), BNT (G8100), Extreme Networks (Summit X650) Juniper (EX8200), Voltaire (8500) and many others that have increased the choice, and the density of 10 Gigabit switches in the marketplace.   There are now many 48+ port 10 Gigabit switches available and even a few 200+ port models.  Also, the improved density and feature set of certain switches (such as Voltaire’s 280+ port 8500 series switch) provide a path for 10 Gigabit’s ascent into the clustering market by improving port density and latency for clustering applications.

 

Broad acceptance of SFP+ has also helped to drive a rapid improvement in price, density, and power.  SFP+ provides a smaller form factor standard for optics, as well as a standard connection methodology to connect directly from switch to NIC via a Twin-Ax copper (read: ‘low cost’) cabling solution inside the rack (up to 10m).  The widespread adoption of SFP+ form factors has dramatically reduced the entry level price points for switches, and through the ‘direct attach’ copper connection capability it has also reduced the overall cost for initial and ongoing deployments of 10 Gigabit by providing a lower cost bridge to optical or full 10GBase-T support.

 

There are also a few data points to suggest that the Server side cost for 10 Gigabit will also be dropping fast going forward.  As power for 10GBase-T continues to drop quickly, more and more Server vendors are looking at the options available to embedded 10 Gigabit directly into their systems.  This will not likely be a 2009 story, but it is approaching quickly.  Additionally, the acceptance of SFP+ form factors for optics/direct attach cabling has provided a path that some Server vendors may use to design 10 Gigabit down on motherboards without adding the extra cost and power of a 10GBase-T solution.  This looks like a likely near term given that the solution power and design are robust and ready for motherboard based designs today.

 

Finally, the continued cost reduction provides an attractive long term value of standards based 10 Gigabit Ethernet.  There is clear indication downward pressure on 10GbE prices already present today.  We will see 10 Gigabit pricing follow a similar price curve as we saw with Single Gigabit.  This is evidenced in the recent pricing announcement where Intel reduced the cost of single port 10GBASE-T adapter 40% from $999 to $599.  The competitive economics of standards based hardware will continue to drive down 10 Gigabit prices even further and we will see 10GBASE-T pricing below the $500 / port price in the near future. Once it gets on the motherboard, prices will drop even further.


Overall, the power, density, latency, and cost of 10 Gigabit are all improving at a rapid rate.  Form factor flexibility coupled with a wide array of switch and NIC vendors in the marketplace will provide choice and low cost for IT departments while virtualization and convergence in the datacenter and elsewhere continue to provide demands for ever greater I/O bandwidth and performance.

 

--

Ben Hacker

 

 

 

* Del’Oro Forecasts as of Q1 ‘09

0 Comments Permalink
0

Did you know that using an electricity rate of 11.4 cents per kWh provides a simple method of calculating annual electricity cost of any device?

 

1 watt of power consumption, with an electricity rate of 11.4 cents per kWh, cost $1 per year, assuming power usage remains constant.  Also, as a general rule of thumb, every 1W of device power consumption in a data center requires an additional 1 watt for overhead power (Source: Intel IT). So a device that consumes 1W actually consumes 2W of power at a data center level.


Here's the math:  1 Watt power * 8760 hours per year / 1000 * $0.114 electricity rate per kWh = $1 per year.  This math holds the same for any currency, Euro, Yuan, etc.  11.4 cents per kWh is the crossover point…and as electricity rates increase over 11.4 cents, 1 watt will cost more than a $1 per year. 

The datacenter overhead power, often referred to as Power Usage Effectiveness (PUE) is a number which has emerged as the leading metric for data center energy efficiency.


You might say that 1W = $2 annually doesn't sound like much, but start doing the math for 1000 servers that consume 200W in a data center with a PUE of 2.0 which works out to annual electricity cost of ~$400,000 per year.  Now, for every 1 watt the server power consumption is reduced, this would translate into $2000 annual savings.  Note, this is a very rudimentary example, but it is useful to illustrate why customers are really starting to focus on power as one of their key purchase decisions.

    

If you need energy efficient servers, there are multiple server vendors currently have some exceptional energy efficient products based on Intel(R) Xeon(R) 5500 processors.  And looking forward, we are also actively working on how to reduce power of the processor and at the system level for the upcoming generations of products.


Here’s some good reference on electricity rates:

For United States, state by state electricity rate comparison

For Europe, 1st half of 2008 rate comparison by country.


Remember, power is one purchase decision, but it is not the only one.  A rack of servers that consumes less power that does less work isn't an efficient way of deploying servers either.  Ensure that the performance vector is considered.  Intel® Xeon® 5500 processor based servers provid exceptional performance and perf/watt leadership over the competition.


Quick question for you:  How does electricity rate of 11.4 cents per kWh and a data center PUE of 2.0 compare to your data center? 

 

 

0 Comments Permalink
0

We talk a lot about how great the Intel Xeon processor compares vs. competing RISC architectures when it come to price and price/performance on various workloads, but unfortunately for many existing people running on RISC hardware, simply throwing out the old and standardizing on the shiny new Intel-based servers isn't always that simple of a proposition.  Why? Your existing software running on UNIX (i.e. AIX, Solaris) may be custom-coded on your flavor of UNIX, the source code may be lost, the guy who wrote retired 5 years ago, etc.  So, how do you account for this when 'running the numbers' to see if it makes sense to rid yourself of the power and money-sucking old RISC server collecting dust in the back of the data center?  These five steps may help:

 

1.  Understand the business benefits of moving from your existing RISC hardware to IA (and compare vs buying new RISC hardware)

This is the simple analysis that looks at performance of your existing system, compares it to new hardware and then factor in other significant cost items like power consumption, software licensing, software/hardware maintenance costs, etc.  Of course, this almost always shows that new Intel hardware will save you significant dollars over the long-term and you can figure out how quickly you pay-back your cost of the server in years or possibly months based on this simple calculation.  And, many server hardware vendors (and Intel field reps) have these tools available, you just need to ask.

2.  ***** your current RISC-based infrastructure

Meaning, look at all the software that actually runs on these servers, the packaged applications and the custom code.  Do you use particular storage adapters and drivers for your SAN, etc?  Make a list.  Now, look to see which items are easy, and which ones will be hard to migrate.  If it's a packaged database that available on Windows, Linux, and Solaris for IA already, then it may be fairly to migrate the data over in a short period of time by yourself and move on.  However, for those custom codes and potential software packages that will need to be changed in order to move to current hardware, start looking at the real costs to migrate these pieces.  Often, this step will require some help from a services company or a hardware vendor that can provide these services in addition to selling you the new hardware.  Now that you have these estimates, factor it back into step #1.  Sure, the ROI will not look as good, but often will be surprising still very good even after factoring in these migration costs.

3.  Develop a migration plan

You may chose to do this on your own if it doesn't look too intimidating, but for more complicated migrations, likely you will need some external help.  If you've factored in these services costs already during the previous step then the cost of doing this step is already justified.  Many services companies will give you the estimate very inexpensively.

4. Test

You may only be able to test the 'easy stuff' initially, but verify the performance deltas between the new and old systems calculated in step 1 to correctly size how much hardware you will need in actual deployment.  This is where the actual performance of the system will measure up vs. the performance estimates used in your ROI analysis in step 1.  Sometimes this can be better than calculated or worse, your mileage is guaranteed to vary.

5. Deploy

If you have your migration plan in place from step 3, now you execute according your plan, ensuring your migrate data in the right order to ensure minimal downtime.

 

These steps can be very intimidating, many people in IT find it hard to justify the migration costs (particularly if you need to pay for some services), but taking a systematic approach to it and carefully calculating your ROI including these extra costs will often make it worth the effort.

0 Comments Permalink
1

I was out at HP Tech Forum last week and had a chance to catch up on all the latest technology advancements with HP and Intel. What I saw was staggering, over 17 new HP-Intel designs, the HP Performance Optimized Datacenter (POD), and lot's more that I will be sharing with you in coming days as I add more video from the event and help to tell the story if you couldn't be there. First off, I caught up with John McAtee from Intel's HP account team. He was showing a cool demonstration on why now is the right time to invest in XEON 5500 processor series technology. Check out this video and find out how you can start saving in your datacenter today !

 

 

If you want more information on how the XEON 5500 processor series can starting saving in the datacenter, check out this ROI Calculator tool. Also, if you are looking for detailed information or are just looking to gain more knowledge, you can always "Ask The Professor" in our Server Learning Center.

1 Comments Permalink
7

Sunsets can last a while, but in the end the sun will go down.  I talk to a lot of companies and listen to a lot of data center managers.  Customers trust their AIX-Power and Solaris-Sparc platforms.  These are solid platforms and deliver good features and reliability, but, if these managers could get the sense of security, performance, and reliability with Linux / Intel Xeon platforms, they would move tomorrow.  It is simple economics.

 

The reality is that customers are making this move, and being successful.  The hardware reliability on Intel platforms today is amazing.  Intel recently announced that their next generation of Xeon(Nehalem) EX based servers will support Machine Check Architecture.  This brings high end Xeon X86 servers into the RAS family previously reserved to proprietary RISC & mainframe platforms.  Intel Xeon already eclipses the performance of proprietary RISC processors both on a per processor basis and a per dollar basis.  It is reasonable to say Xeon can deliver better performance, better value, and equal or better reliability.  The only hurdle left is the software.

 

Linux has come a long ways.  It is no longer a university OS, run by geeky dudes in black T-shirts emblazoned with the quadratic formula.  It is mainstream and solidly supported.  Linux is the primary development and delivery platform for Oracle.  Other OS environments are ports, delaying support and innovation.  Linux is used by major financial companies.  Linux is available in solid and well supported distributions with a 20 year history of enterprise business.  Linux experts are broad community, worldwide, and growing in number.  Linux is economical vs proprietary RISC.

 

In an era of big budgets and conservative (don’t make any changes) philosophies, businesses will always stick to their proprietary RISC systems.  That era is over.  Sticking to your RISC systems may seem like the safe move, but failing to examine the opportunities for better performance and lower cost with Linux on Intel Xeon platforms is business negligence.  Business negligence is seldom rewarded. 

Performance, Price, Infrastructure, Ecosystem, TCO, RAS – when the decision factors are examined it becomes clear -  we are in the RISC twilight and the sun will set on Sparc-Solaris and Power-AIX.

7 Comments Permalink
0

Running multiple Unix environments across a range of locations adds increased complexity and cost to the IT environment. I came across an interesting case study and wanted to highlight some of the key findings

 

YPF SAis the largest company in Argentina operating in the Oil and Gas industry. The company has 29 gas plants around Argentina running different Unix environments such as HP-UX, AIX and Solaris.

 

YPF SA consolidated their SAP ERP and Oracle DB environment from multiple Unix environments to Red Hat Enterprise Linux 5 with integrated virtualization running on Intel Xeon based platforms from IBM System X

 

Some of the key findings to highlight

  • Key requirement from Unix Administration Team that "migrating from old RISC/Unix and proprietary servers to open and flexible platforms would pose no risk to the reliability, availability and performance of the systems"
  • Positive impact on cost and performance; Lowered costs, simplified management and increased compatibility
  • Reduction in costs especially when compared to license costs of RISC based platforms
  • Increased performance and availability drove decision to scale with RHEL and Xeon
  • Ability to leverage Redhat integrated virtualization. Free up internal hardware and technical resources for other projects

 

 

I guess the combination of Redhat and Intel deliver the business results that customers are seeking. What do you think?

0 Comments Permalink
0

I have been watching the social chatter today about the latest Top500 supercomputing list and seeing companies, manufacturers, application vendors and even countries compete for mind share of this most recent list on twitter.

 

However, as I read about and explored this list, the things that jumped out at me were not the who’s number one, two, three … or who grew what number of spots ... but rather the trends that have occurred over time. These trends have not happened in the last 6 months or the last 6 years but instead over the course of nearly a decade of innovation

 

1)      Today, the #10 posting (a cluster using the 3-month old Xeon X5570 processor (Nehalem-EP)) delivers the same FLOPS performance capability equal to the entire June 2000 TOP 500 computers list. (see below)

 

 

top 500 over time jun 09 Performance_Development.png

source: http://www.top500.org/lists/2009/06/performance_development

 

2)      Also, the emergence of multi-core intel-based servers complemented by affordable open-source software solutions have enabled a transformation of how supercomputing performance is delivered. Intel based servers have gone from nearly “0” to nearly “400” over this decade.

 

 

IntelTOP500history.jpg

source: http://www.intel.com/pressroom/images/IntelTOP500history.jpg

 

I recently had the opportunity to co-present a webinar with Matt Jacob’s of Penguin Computing where we talked about how High Performance Computing is changing the way that businesses innovate, research, design, analyze and create. What used to be only done in large datacenters and universities are now available to mainstream IT and businesses.

 

This is extremely important for areas like health care, financial services, manufacturing and many other industries.  Equally important are the software technologies (intel cluster ready software) that can make clustering available and easy to use so that this performance capability can be tapped without a ton of complexity.

 

So, while the Top500 list may be interesting for bragging rights, what excites me and many of the end users that I talk to are is the power, affordability and accessibility that high performance computing has to mainstream business users and the innovation and creativity that brings to the marketplace.

 

How are you using computing perfomance to do things that once were not possible in your business?  Share your story with us !!!

 

Chris

http://twitter.com/chris_p_intel

 

 

0 Comments Permalink
1

Every day in our personal lives, we’re bombarded with “opportunities” to get a better deal.  At the grocery store, we might be able to buy a single item for $2.50 or 3 for $5.00…which then forces us to go thru the mental gymnastics of figuring out how good of a deal it is, and whether or not we really need three 96 oz. bottles of salad dressing.

 

But there are some opportunities out there for adding a bunch of compute performance are a bit more straight-forward.


Case in point: Dell recently had Principled Technologies compare the performance for the Intel® Xeon® Processor E5520 and E5506 CPUs each running on a PowerEdge R710 server.  Both are 4 core processors, but the E5520 has many advantages over the E5506: 

  • higher frequency (2.26 GHz vs. 2.13 GHz)
  • faster QuickPath speeds (5.86 GT/s vs. 4.8 GT/s)
  • faster memory support (1066 MHz vs. 800 MHz)
  • Turbo Boost
  • Hyper-Threading support.

 

Long story short:  Buying a slightly better processor with a server purchase can drastically increase your performance.  So if you are looking to buy a Dell PowerEdge server configured with Microsoft SQL Server 2008* and an Intel® Xeon® Processor E5506, for an additional $300 you can get up to 75% more performance by upgrading to an E5520 CPU.  More performance headroom in a similar power envelope, faster QuickPath and memory speeds, Hyper-Threading and Turbo Boost functionality – all for $300.  NOW THAT’S A GREAT VALUE!

 

Check out the summary document for the Dell R710 Principled Technologies performance testing, which also has comparative performance testing for the Xeon® E5540 and X5550 CPUs (also a great value for the money!), along with results for Microsoft Exchange.

 

NOTE:  System pricing from www.dell.com as of May 13, 2009.  Actual performance will vary based on configuration, usage and manufacturing variability. See the actual Principled Technology report in the following link for complete system configuration

1 Comments Permalink
1

Are you a developer writing applications to run on the Solaris operating system?. Are you looking for ways to optimize your Solaris solution on industry standard architecture based on Intel microprocessor? If you answer yes to either of these questions then please read on.

 

Intel and SUN have been working closely together to optimize the Solaris operating system on the Intel Xeon 5500 processor. Most of you probably know the Xeon 5500 better by its product codename Nehalem. The Xeon 5500 is the the product that fits into 2 socket platforms.

 

SUN have just published a very compelling quick reference guidethat will assist both Developers and System Administrators looking to optimize Solaris solutions on Xeon based processors. The guide talks about the work that Intel and SUN are doing together, technical descriptions of specific features and capabilities that can be implemented in the Solaris OS to optimize the capabilities of the Xeon.

 

I have just finished reading this and it is a very compelling paper covering topics such as

- How Solaris takes advantage of Intel Turbo Boost Technology to use available power headroom to deliver higher performance based on workload demand

- How Solaris can take advantage of new Intel Quickpath Interconnect (better known as QPI) and other innovations in the OS to reduce memory latency

- How Solaris performance counters help to better manage workloads

- How Solaris takes advantage of many of the power efficiency capabilities in the processor. Things like Power Aware Dispatched in Solaris enable the processor to stay longer in idle states. In non tech talk this saves power.

 

Solaris has been a tried and tested operating system for along time for companies running their most business critical workloads. This paper talks about the combination of Solaris and Xeon to deliver improved reliability and availability for these critical workloads. Detail information on predictive self healing, fault management, leveraging Intel Machine Check Architecture and more all included in this paper.

 

Probably my favourite section is around the developer tools optimizations and the different tools available for developers that want to run and optimize their applications on Solaris and Xeon.

 

Ok, I'll stop waxing lyrical now. This is a very compelling paper and it does certainly construe that Solaris and Xeon 5500 could be the perfect combination for your Solaris solution. What do you think?

1 Comments Permalink
1

The debate on how to best increase system capacity to accommodate growing applications has raged on for years; “scale up” with more CPU, memory, and I/O, or “scale out” with loosely connected systems.    Scaling out by adding networked systems to increase capacity has been a good economical solution for many IT managers because it allows them to grow by using less expensive, industry standard building blocks.  However, there are some notable exceptions to this line of thought.  One is that the class of applications that require shared memory and large database support are much better suited to run on a single, expandable system that scales up.  These are typically transaction processing, business intelligence and ERP solutions.   Until now, IT managers running applications that require scale-up systems larger than 4 or 8 CPUs have had limited platform choices and most were proprietary and expensive RISC-based servers.

 

The other problem with the scale out approach is the people, facilities, software and overhead costs and complexity of managing very large numbers of servers, which can grow to a point where the costs outweigh the performance and system cost benefits.  The industry solution to achieving better ROI has been to consolidate multiple scale-out servers onto single industry standard scale-up servers with virtualization solutions.  This is a good solution, but is limited by the number of application loads the IT manager feels comfortable placing on a single server, given the need to maintain peak performance and availability for each application.

 

Well, it looks like the scale-up, scale-out debate is about to take another turn.  In the server product update Intel gave on May 26th, they talked about new levels of system scalability and choice supported by the upcoming Nehalem-EX processor.  This processor will support systems that scale up to 8 sockets natively (shared memory, without any additional silicon), and up to 16 sockets and higher with node controllers from system manufactures that allow single systems to share memory beyond 8 sockets.   So far there are over 15 different designs from 8 OEMs that offer 8 socket or higher scalability.  But of course, for the class of application where scaling is important, socket count doesn’t tell the whole story of what’s needed for scalable performance.  Thread support, key for transaction processing and virtualization, scales at the rate of 16 threads per socket with 8 cores and Hyper Threading (2 threads per core).  That would be 128 threads for an 8-socket system, and 256 threads for 16 sockets.   And in order to keep those threads fed with data close to the CPU, each processor supports up to 24 MB of shared cache (1.5X current generation Xeon), and an impressive 16 memory slots per socket or 128 DIMMs on an 8-socket system.  In addition, the Scalable Memory Interconnect gives these systems 9 times the memory bandwidth of today’s top Xeon processor.  Finally, four QuickPath interconnect links per socket allow for high-bandwidth sharing of data across the system.

 

So the net of it is that the industry is going to see a broad selection of highly scalable, next-generation servers that significantly extend the economic advantage of industry standard scale-up solutions for business-critical, large database, and high-end virtualization/consolidation deployments.     I would expect these systems to give IT managers a very cost-effective alternative to the much more expensive and proprietary RISC-based servers they use today.

 

What are your thoughts?  Mike

 

Related Topics:

 

 

 

 

1 Comments Permalink
0

As companies face the economic downturn, they are being asked to trim their IT budgets -- essentially, do more with less. Meanwhile, IT folks are also being asked to make sure their companies remain competitive with the best server performance running best of breed IT solutions that operate in extremely efficient data centers as well as ensuring every IT dollar spent is showing an RIO within 12 months or less. That raises the question: “Can migrating applications from a RISC architecture to an Intel architecture save a company money and allow them to remain competitive?” In many cases the answer is “YES!”

 

I have been an Intel Enterprise Technical Specialist supporting many of the large financial customers in the NYC area. My customers have a mix of all sorts of platforms, from commodity X86 servers to large RISC servers and from Midrange to Mainframe systems. Customers perform tests to measure Performance, Performance per Watt and Performance per Dollar. The outcomes will determine the architecture that is best suited for their applications. Customers have also relied on industry benchmarks such as CPU2006, SPECint, SPECfp, SPECpower_ssj2008, and SPECjbb2005 whose results can be found at www.spec.org.

 

I have seen many custom and commercial applications that used to run on other architectures which have been ported and are now running on commodity Intel architectures. Why? The Intel Xeon 5500 Series microprocessor (codename Nehalem) is delivering increased performance, power efficiency, and overall lower cost needed to meet the IT requirements for their need. For example, in the financial sector several applications exist, such as Market Data Feed Handlers, High-Frequency Automated Trading, Risk Analytics, Monte Carlo (compute farms) which require high performance servers to gain a competitive advantage and increase revenues for the firm.

 

As an example, one of my customers migrated several of their company’s in-house developed applications that were running on legacy RISC servers. Migrating applications to Intel servers was a straight forward process since many of them were written in Java and were fairly easy to port. Other applications that were written in C/C++ could be migrated using Intel software tools, (i.e. Intel C/C++ compiler, Thread Checker, Thread Profile and Vtune) to make the job were extremely helpful in migrating their applications to the Intel architecture. For example, using Intel servers for their Risk Analytic application provided increased compute performance over their legacy RISC servers which helped complete their Risk Analytic runs much faster with fewer servers leading to an overall lower TCO.

 

Using Intel Xeon 7400 & 5500 Series has not only provided increased overall performance but has decreased the number of servers through server consolidation in the data center which also requires less energy.  This has helped prevent the data center from reaching the capacity of power and cooling. For some of my customers, using Intel Xeon 7400 & 5500 servers has extended the lifespan of their data center, saving millions of dollars not having to build new data centers due to its increased power efficiency while reducing overall operational costs.

 

0 Comments Permalink
2

Looks like the Intel® Xeon® processor 5500 series is making lots of noise in HPC.  The QPI and integrated memory controller are really providing the boost necessary to make it an all around performance leader for HPC applications.  With all this performance why did Intel add a third memory channel?

The third memory channel enables the platform to support a boat load of memory.  Matter-of-fact, up to 192GB can be supported in a two socket configuration.  It wasn’t too long ago when only 32GB was supported in a dual socket configuration.  By having the ability to support so much memory you can now meet the needs of almost every HPC application.  The 5500 series is intended for all server markets, but let’s face it, with the design changes Intel made with the new architecture the server segment gaining the most benefit appears to be HPC. 

It seemed like yesterday when the only way to have access to large memory configurations was through expensive, proprietary SMP systems.  The HPC market for large SMP systems is still out there but it is shrinking…fast.  Today, we are clustering low cost solutions to create some of the most powerful systems in the world.  Standard components are leading to lower and lower system costs, delivering a price/performance advantage alternative solutions cannot meet.

Now that a single dual socket node can support up to 192GB’s it is important to understand how to get there.  First, to enable 192GB you need 16GB DIMMs x 12 memory slots.  There will be a premium for a 16GB DIMM.  Knowing the options and determining the best, most cost effective solution is going to be dependent upon your environment.  When a large memory node is required, do you purchase the 16GB DIMM’s or go up to a Multi-socket solution?  If I decide to scale back on the memory (use 4GB or 8GB DIMMs instead of 16GB DIMMs) what is the performance impact to my application?  If I am cost sensitive, will the lower cost outweigh the lack of performance?  Can I use SSD’s (Solid State Disk drives) to compensate for any performance loss due to lower memory capacity?  There are many questions to think about when deciding the right configuration for your application and environment and I certainly can’t answer them here.

Let’s not forget the third memory channel enables a different set of optimal memory configurations.  Think x3 when deciding on how much memory to install into your node; 12GB, 24GB, 48GB, etc.  What happens when you don’t use an optimal configuration?  Well it depends, in most cases the impact is minimal, but let me add a bit of context around minimal:

·         Low bandwidth sensitivity (more dependent upon the processor for performance)

        E.g. Monte Carlo, Black-Scholes (financial modeling), BLAST (bioinformatics), AMBER (molecular dynamics)

        Expect less than a 2% difference between memory configurations*

Ÿ  Medium bandwidth sensitivity (somewhat balanced between memory and CPU usage)

        E.g. CFD, Explicit FEA, Implicit FEA (with robust I/O system)

        Expect approx. 5% degradation for non-optimal symmetrical configurations*

Ÿ  High bandwidth sensitivity (high access to the system memory)

        E.g. WRF (weather), POP (climate), MILC (physics), Reservoir Simulation

        Expect approx. 10% degradation for non-optimal symmetrical configurations*

The results are interesting.  In all three cases above, the degraded performance is always better than the performance you would have with only two memory channels.

When you hear about performance impact of non-optimal memory you can see by the examples above, it is application dependent and will not have a severe impact on your overall system performance.   

The Intel Xeon processor 5500 series offers support for huge memory nodes with the addition of the third memory channel.  Memory configurations in multiples of three are ideal, but if you decide to stay with a power of two configuration the performance should still exceed that of a solution based upon only two memory channels.

*Based upon Intel internal measurements

2 Comments Permalink
2

A MONSTER CHIP IS COMING. The next generation of MP processor is targeted for production later this year, and by all accounts it is going to be a monster. Nehalem-EX is part of the Nehalem family of processors, but compared to its siblings it has the highest cores/threads count, largest shared cache, highest CPU-to-CPU bandwidth, highest I/O bandwidth, highest memory capacity, highest memory bandwidth, greatest scalability, and highest level of Reliability/Availability/Serviceability. It’s expected to bring a gargantuan, unprecedented leap in capabilities and performance--the biggest leap in all of Xeon product history.

 

IT’S TARGETED AT “BIG BOXES”. Big box servers are multiprocessor systems using the most capable processors and platform components. These systems are targeted at applications and usages that require the largest memory footprints, the highest amounts of single-box processing power (for workloads that don’t decompose well into lots of independent threads) and/or advanced levels of RAS. Such systems are typically the best choice for large databases, ERP apps, Business Intelligence apps, large-scale server consolidation and business-critical virtualization, mission critical applications and large scale high performance computing.

 

IT USES THE SAME PROCESSING TECHNOLOGY AS THE SUCCESSFUL XEON 5500, BUT MORE OF IT. Just like with Xeon 5500, the Nehalem micro-architecture brings improved single-threaded performance via IPC (Instructions per Clock) enhancements and Intel’s Hi-k 45nm manufacturing process. Greater multi-threaded performance comes via Hyper-Threading and more cores. But while the Xeon 5500 has up to 4 cores/16threads per socket, the Nehalem-EX monster doubles that to 8 cores/16 threads.

 

HAS A BEEFIER MEMORY AND INTERCHIP COMMUNICATION SUBSYSTEMS. Monster thread processing capabilities require monster size feeding to bring out the best performance. Nehalem-EX’s raw processing potential is made viable by a heavy duty memory subsystem and inter-chip communication system.

Nehalem-EX has 24MB of shared level 3 cache--that’s 50% more than the current Xeon 7400 and 200% more than Xeon 5500. The memory channel bandwidth was increased to 9-times that of Xeon 7400. And it’s all attached to up to 16 DIMM slots per socket (that’s 64DIMMs slots for 4 sockets)—double the current generation of Xeon 7400.

In a multi-socket system, processors need to communicate with each other in order to most efficiently coordinate their shared workload. They also need lots of I/O bandwidth. Nehalem-EX has four QuickPath Interconnects on every socket--double that of Xeon 5500. The four QPI links enable Nehalem-EX processors to be directly connected to each other in a 4 socket system. This offers significant performance advantage over a so-called ring architecture wherein some processor-to-processor communication must go through an intermediary processor. The extra QPIs also mean that there’s plenty of CPU to I/O bandwidth.

 

EXPECTED TO BRING THE GREATEST LEAP FORWARD IN XEON PERFORMANCE EVER. On key server performance benchmarks (e.g. SPEC_int_rate, SPEC_floating point_rate, TPC-C, etc) Xeon 5500 using Nehalem technology brought gains of over 100-200% greater than prior generation. Generational gains of this magnitude come along just about once a decade. Nehalem-EX’s generation-to-generation performance gains are expected to be substantially higher than those of Xeon 5500. We’ve already seen measured memory bandwidth of 9X vs. prior generation. That’s an early indication of the level by which new performance records will be set when this monster chip comes to market.

Related Topics:

NHM-EX Press Fact Sheet

NHM-EX May 26th Press Briefing Video – condensed version

IBM 8Socket Demo Video

 

NHM-EX--A New Standard

2 Comments Permalink
0

Your most valuable employee is the one that creates tomorrow’s successes.  Providing them tools that help them do that faster will help your organization create new products or optimize old ones more rapidly.  The benefit to the organization is increased opportunities to win the customers attention via new products or your responsiveness to their request; the employee gets to brag on what he or she just helped bring to market.

Before we get to far let’s look at Intel’s mission with respect to workstations.  We are laser focused on supplying technology that provides users with an uncompromised experience in transforming their ideas into reality.  With that in mind we look at how users create; we try understanding their obstacles and work with the ecosystem of hardware and software providers to deliver solutions to real problems that may be inhibiting their opportunity to innovate.  

One technology that is helping users innovate faster is virtualization. 

The Observation

We saw workstation user’s innovation slow as they multitasked between tasks – some of them not even theirs.  The involuntary task included IT security patches, updates, and system backups to name a few.  We also saw that users were no longer just doing CAD, but they were doing CAD, using productivity tools, meshing, web surfing for supporting facts, collaborating via video, digital white boarding and trying to do analysis driven design.  They were very busy people.

In some cases we noticed that some users actually had not one, but two workstations running in completely different environments, many times different OS’s.

The Problem

What the above really lead to is a conclusion that too many task were going after too few resources and that the experience we had hoped the user would encounter was not happening.  In fact the reverse was happening – interactive creative task were slowing, system sluggishness was at an all time high.  The “uncompromised experience in transforming their ideas into reality” we wanted for a workstation user was not there and any innovation that was possible was slowed down to a crawl.

A Potential Solution

Intel® Virtualization Technology for Directed I/O, once just thought of for servers actually has a place in the workstation market. 

This technology provides an important step toward enabling a significant set of emerging usage models in the workstation. VT-d support on Intel platforms provides the capability to ensure improved isolation of I/O resources for greater reliability, security, and availability.  That is a mouth full let’s see it in action.

There are two key requirements that are common across workstation usage models.

1.       The first requirement is protected access to I/O resources from a given virtual machine (VM), such that it cannot interfere with the operation of another VM on the same platform. This isolation between VMs is essential for achieving availability, reliability, and trust.

2.       The second major requirement is the ability to share I/O resources among multiple VMs. In many cases, it is not practical or cost-effective to replicate I/O resources (such as storage or network controllers) for each VM on a given platform.

In the case of the workstation, virtualization can be used to create a self-contained operating environment, or "virtual appliance," that is dedicated to capabilities such as manageability or security. These capabilities generally need protected and secure access to a network device to communicate with down-the-wire management agents and to monitor network traffic for security threats. For example, a security agent within a VM requires protected access to the actual network controller hardware. This agent can then intelligently examine network traffic for malicious payloads or suspected intrusion attempts before the network packets are passed to the guest OS, where user applications might be affected. Workstations can also use this technique for management, security, content protection, and a wide variety of other dedicated services. The type of service deployed may dictate that various types of I/O resources, graphics, network, and storage devices, be isolated from the OS where the user's applications are running.

The Result

Working with Parallels Workstation Extreme VM application we looked at two problems.  First was the general overhead related to too many request and too few resources and then we explored the more complex problem of a single workstation with a need to display at near native performance in two different OS’s.

The former was straight forward, create VM’s, partition resources and your innovator now has a very resilient workstation that is capable of delivering the intended experience.  IT can have their VM’s and the user has his or her workstation back and the concept of digital prototyping to create and explore a complete product before it is built is a reality.  Your innovator can now iterate through more ideas in less time and your company’s opportunity to catch the customer’s attention just went through the roof.

The former provided a much harder challenge.  We tested the idea in the oil and gas market where users actually had two workstations; one running Windows, one running LINUX. Both had a requirement for visual display and both acted on that same reservoir data with applications that while similar in many ways, they were still different.  When preparing to drill a multimillion dollar well – the idea of more data saying the same thing is a very good thing.

The Proof Point For Virtualization In A Workstation Engineers from Schlumberger, a leading oil field service provider, run performance-demanding applications such as GeoFrame* and Petrel*.  These applications serve to analyze complex geologic and geophysical data and determine the viability of potential reservoirs, or to optimize production at existing sites. With GeoFrame running on Linux* and Petrel on Microsoft Windows*, Schlumberger engineers have been running these applications on two separate workstations, driving down productivity and increasing both power consumption and IT maintenance costs.

A New Paradigm

With the advent of Intel Xeon processor 5500 series-based workstations running Parallels Workstation Extreme, virtualization software has opened new horizons with breakthrough graphics performance.

Schlumberger compared the concurrent performance of applications running on a virtualized Intel Xeon processor 5400 series-based workstation with the same setup on the Intel Xeon processor 5500-based machine. The results were astounding. The first machine ran Petrel at full native speed, but performance for GeoFrame slowed enormously. While Petrel refreshed its graphics at a rate of 30 frames per second, GeoFrame crawled along at a graphics refresh rate of JUST one frame every 19 seconds, an agonizingly slow performance.

When the group tested both applications on the Xeon 5500 series workstation, the results were striking: Both applications ran at full native speed, and both were able to refresh graphics at 30 frames per second—a 570 times improvement over the first workstation.

Russ Sagert, Schlumberger’s Geoscience Technical Advisor for North America said “our engineers were blown away by the performance. We hammered these machines with extreme workloads that stressed every aspect of the system. Amazingly, the new workstation based on the Intel Xeon processor 5500 series provided performance enabling this multiple OS, multiple application environment for the first time.”

The key element in Schlumberger’s new environment is Intel Xeon processor 5500 series-based workstations with Intel® Virtualization Technology (Intel® VT) for Directed I/O (Intel® VT-d).  Together, these technologies enable direct assignment of graphics and network cards to virtual machines, enabling the machine to circumvent the interrupt and exit loop and clearing the previous performance problems.

Running in conjunction with Parallels Workstation Extreme, which effectively leverages Intel Virtualization Technology, including VT-d, the solution revolutionizes virtualization for high-end users. “High-performance virtualization on Intel Xeon processor 5500 series-based workstations is a game-changing capability,” says Sagert. “We can allocate multiple cores, up to 64 GB of memory and a dedicated graphics card to each machine. The results are spectacular.”

In the final analysis, moving to the Intel Xeon Processor 5500 series of next-generation workstations does far more than cut costs. It impacts the way that work gets done. If you have clients running the kind of resource-intensive, graphics-rich applications that traditionally slow to a crawl in a virtualized environment, consider the benefits of finally moving beyond the I/O barrier.

A fully configured Intel Xeon Processor 5500 series-based workstation running Parallels Workstation Extreme delivers the performance level that makes virtualization a contender for these users. A streamlined work interface, reduced office noise and clutter, and significant performance gains works on the user side. But the IT organization also gains benefits by lowering capital, management, support, space, and energy costs.

Moreover, the IT team can now standardize on a single OS image while addressing alternative requirements.

Learn More

Intel Workstation Processors http://www.intel.com/products/workstation/processors/index.htm

Parallels Workstation Extreme

http://www.parallels.com/products/extreme

 

0 Comments Permalink
0

Last week I wrote about the server product update for the upcoming Nehalem-EX processor and the expandable platforms based on it.  Today I wanted to provide you with a short 10 minute video captured from the event.  It’s a really good summary for those of you that want to learn more about Intel’s Xeon product roadmap but with limited time.

Also, as I mentioned earlier, look for some informative blogs over the next 1-2 weeks that will offer more of an in depth view of Nehalem-EX’s 4 Socket capabilities, performance, scalability, RAS, and Virtualization. 

bryce

0 Comments Permalink
1 2 3 4 5 ... 10 Previous Next

Filter Blog

By author: By date: By tag: