Home > Intel Communities > Open Port IT Community > The Server Room > Blog > Authors > Wesley Shimanek
1 2 Previous Next

The Server Room Blog

18 Posts authored by: Wesley Shimanek
1

The digital workbench is like the workbench at home where you have pliers, nails and hammers that we use to build or fix things—the workbench holds all the best, most useful tools to complete a project and makes them available at your fingertips.

The digital workbench replaces analog tools with digital tools and software suites from ISVs (e.g. Altair, ANSYS, Autodesk, Dassault CATIA, Dassault SIMULIA, ESI, MSC, PTC, Siemens PLM and others).  These ISV’s are all laser focused on enabling designers to move analysis further up the design chain.  Couple this with recent performance gains available on workstations based on the Intel® Xeon® processor 5500 series from suppliers like Boxx, Dell, HP and Lenovo and you have the opportunity to now view your workstations as a digital workbench.  The result is a new environment that enables users to rapidly test and refine their ideas potentially at the speed of thought. 

The digital workbench, powered by two intelligent Intel® Xeon® 5500 processors based on the Nehalem microarchitecture, can help you transform complex and visually intensive data into actionable information at near-supercomputer speeds. 

 

 

1 Comments Permalink
0

“Imagination is everything. It is the preview of life's coming attractions.” Albert Einstein

Today’s workstation can provide you with a magnificent digital canvas to create tomorrow today.

With workstations powered by two Intel® Xeon® 5500 series processors, engineers have the opportunity to create, shape, test and modify products before they become real. Engineers can now design, visualize and simulate products from the conceptual design phase through the entire manufacturing process. This is done virtually before any investments are made in a prototype.

“Experiment fearlessly.” “Innovation is bloody random.” Tom Peters

Peters, a world renowned author and management consultant, recognized that innovation is more art than science.

Consider this example: Taking innovation to an entirely new level, Boeing, in the late 1990s, employed a process known as algorithmic design to see what designs might be viable to meet a specified hypersonic aircraft design criteria. The algorithmic design process enabled computers to create and test new ideas against the specified design criteria without human intervention. As a result, more models were evaluated in less time, and a vehicle that was counterintuitive to what many engineers may have thought possible was evaluated. Innovation just accelerated.

Intel technology has seen dramatic changes since Boeing first tested the idea of algorithmic design in the last decade. Workstation performance has gone up Dual-processor workstations have yielded to workstations with two processors, eight cores and 16 computational threads. Science or simulation that was never tractable on a workstation before is now standard, and it is getting faster.

“I confess that in 1901 I said to my brother Orville that man would not fly for fifty years.” Wilbur Wright

You think all you need is an entry-level workstation with a single Intel® Xeon® processor.                       After all , you only do CAD—right?

However, as you begin to adopt modern workflows and realize the dramatic impact that simulation-based engineering or digital prototyping can have on your product development cost and schedules, you realize that the cost of the second processor and additional memory necessary to support digital prototyping was far less expensive than the cost of multiple physical prototypes and the associated time to produce them. Instead of investigating hundreds of digital prototypes, you only have time to look at a single physical prototype and ask: What if I …?

Those “what ifs” could have been played out on a dual-processor Intel Xeon processor 5500 series-based digital workbench faster, and your time and cost of physical prototypes could have been significantly reduced.

 

 

The digital workbench, powered by two Intel® Xeon® 5500 series processors, can have an enormous impact on your organization’s ability to design, visualize and simulate products, from the conceptual design phase through the entire manufacturing process, and it is all done virtually before a prototype is ever invested in. These digital workbenches exceed the computational power of the Cray C90 series, which in the 1990s was revered as the fastest ever.

Without question we all recognize that simulation and modeling have become indispensable tools in design. But visualization remains the principal conduit to transforming data into knowledge and actionable information. The digital workbench can provide you with both the compute capacity and the visualization capability you need to innovate faster.

If all you are doing is CAD on your workstation, then an entry workstation may be best your solution. However, as others around you adopt modern workflows that incorporate simulation-based engineering and digital prototyping, you may want to step up to a more comprehensive digital workbench solution that provides an entire suite of tools to help you play more “what ifs” locally and faster than ever before.

One more point on this: If you are stuck on the entry workstation, then you may want to consider a mobile workstation. While the immediate cost will be higher than an entry-level workstation, the real cost may be lower. With mobile workstations you can design with your customers and not just for your customers. You may be able to reduce the number of design reviews by innovating with your customer right there as spontaneous ideas happen. The real cost of a tethered entry-level workstation may be indeed be much higher than you think.

Join the revolution and innovate faster with the digital workbench powered by two Intel(r) Xeon(r) 5500 processors

0 Comments Permalink
0

“Imagination is everything. It is the preview of life's coming attractions.” Albert Einstein

Today’s workstation can provide you with a magnificent digital canvas to create tomorrow ...... today!

With workstations powered by two Intel® Xeon® 5500 series processors, engineers have the opportunity to create, shape, test and modify products before they become real. Engineers can now design, visualize and simulate products from the conceptual design phase through the entire manufacturing process. This is done virtually before any investments are made in a prototype.

“Any color—so long as it's black.” Henry Ford

Like the automobile, the workstation has morphed into something much more than what it once was. It now has more capabilities and features than its predecessor and, if you allow it to, it can help you accelerate the pace of your innovation.

Today’s workstation gives engineers a new tool that can be likened to a digital workbench. This tool is powered by two Intel Xeon 5500 series processors with Intel® Turbo Boost Technology and Intel® Hyper Threading Technology to take advantage of the processor’s power and thermal headroom to enable increased performance of both multi-threaded and single-threaded workloads. 

Today's workstation can host a suite of software applications from ISV's like Autodesk, SolidWorks, PTC, Bentley and others to create and test their ideas. The pliers, hammer and nails found on a workbench in a garage or basement have now been replaced with digital tools that promise to accelerate innovations via a process known as digital prototyping. Its enablers include application tools like detailed CAD, CAE and PIM. Together they represent the new digital workbench—a powerful innovation tool you can use to bring your ideas forward faster than ever before.

“I confess that in 1901 I said to my brother Orville that man would not fly for fifty years.” Wilbur Wright

You think all you need is an entry-level workstation with a single Intel® Xeon® processor. After all, you only do CAD—right?  You may be thinking like Wilbur Wright.

Innovation in the workplace is paced by how well you can use technology to test and improve your ideas. As you begin to adopt modern workflows you may realize the dramatic impact that simulation-based engineering or digital prototyping can have on your product development cost and schedules.  You will soon realize that the cost of the second processor and additional memory necessary to support digital prototyping was far less expensive than the cost of multiple physical prototypes and the associated time to produce them. Instead of investigating hundreds of digital prototypes, you only have time to look at a single physical prototype and ask: What if I …?

Those “what ifs” could have been played out on a dual-processor Intel Xeon processor 5500 series-based digital workbench faster, and your time and cost of physical prototypes could have been significantly reduced.

Are you ready to adopt modern workflows and accelerate your innovation?

 

 

 

0 Comments Permalink
0

A virtual workstation uses both virtualization hardware and software technologies that, when combined, provides end users with an uncompromised workstation experience.  It gives engineers and IT user’s concurrent access to key workstation hardware functions previously not available with traditional virtualization technologies. Through this approach, you get near native access to key workstation services, such as those delivered by graphics cards or NICS, needed to run multiple high-performance applications regardless of the operating system they run on. . Best of all, with Intel® Virtualization technology for directed I/O, delivered by Parallels Workstation Extreme, you will be able to leverage this new virtual workstation capability in ways that improve workflows across operating systems while reducing IT management requirements.  This is a win/win for both you and IT.

Ok we have segregated compute resources between IT and the user. 

What can you get with Intel’s VT/d technology? 

How many times have you been faced with a need to run an application that runs on a prehistoric OS, or that runs 32 bit OS when your entire environment is running on 64 bit? Or maybe you need run two different graphics-intensive workloads in a LINUX and Microsoft Windows® environment.  With Intel VT-d and Parallels™ Workstation Extreme software you may be able to do just that at near-native speeds.

What else can you do with a virtualized workstation?

Have you ever been in a situation where one application requires version X and another application you use daily requires version Y of the same OS.  With Intel VT/d and Parallels™ Extreme® software you may be able run both at near native performance at the same time.  No rebooting, no dual booting or emulation required.  Just fast, seamless answers to complex problems – across multiple segments like Oil & Gas, DCC, Manufacturing, and Research.

Ever hear of a digital workbench?

It is a tool that designers and engineers use to perform what many call digital prototyping or simulation based engineering.  It is usually a set of tools that combine needs for LINUX and Microsoft Window® based applications to create and test their ideas.  Think of a virtual wind tunnel where simulations are performed in a LINUX environment and the design and visualization is performed in a friendly Microsoft Windows® environment.  With Intel VT-d and Parallels Workstation Extreme you can do both at near native speed.  That means interactive product development and engineering, and that leads to potentially better deigns in less time.

Do you need a virtualized workstation?

If you have a need to run applications in different OSes, diverse OS levels or types, or you need to visualize in different OSes, then answer is probably yes.

To learn more about Intel Virtualization Technology please visit www.intel.com/go/workstation.

To see an online demo of Parallels™ Workstation Extreme software please visit http://www.parallels.com/products/extreme

0 Comments Permalink
0

If you have been around the workstation community for a while you may be used to seeing numbers like 4, 8 12 64GB.  Those are the old numbers – sorry.

When the Intel® Xeon® processor 5500 series debuted it introduced systems with up to3 memory channels as opposed to just two.  So how much memory do you need now?  Well to get peak performance from your workstation you need to think in multiples of 3 and not 2.  That means not 4GB and certainly not 3 GB.  Numbers that work best are identified in the table below.

Number of Dimm’s

3

6

9

12

18

Size

Of

Dimm’s

2GB

6

12

18

24

36

4GB

12

24

36

48

72

8GB

24

48

72

96

144

Ok that is a lot of choices.  Which one is most likely to deliver the best workstation experience when you have three memory channels and why?

As before in the two channel days, the best experience will be arrived at when you evenly populate dimm slots.   That answer will vary on whether or not you have a single processor workstation or a dual processor digital workbench.

If you have a single processor entry level workstation you will want to configure your systems with 6, 12 or 18GB if you are using 2GB memory sticks.  If you are using 4GB sticks you want to think of 12, 24 and 36GB.  And if you are really thinking rich memory configurations you will want to focus on 24, 48 and 72GB configurations.  This assumes you have up to nine dim slots.

If you are using dual processor digital workbench and you are mega-tasking through a number of complex task you may want to consider the following sweet spots. 

With 2GB memory sticks you will need to think of 12, 24, 36Gb.  With 4GB memory sticks you should think of 24, 48 and 72GB and with 8GB memory sticks you should be thinking of 48, 96 and 144GB.  Again the goal is to keep the memory channels with the same memory sizes and speeds to see the best performance.  When you do that you are most likely going to see the best performance.

If you are wondering which kind of workstation is best for the work you do, you want to visit the Intel workstation technology page and use the workstation selection tool.  It can be found at www.intel.com/products/workstation/processors

0 Comments Permalink
0

At Intel when we think of scaling performance forward we think of one word, evolution, not revolution.

 

By evolution we mean developing high performance computing solutions that offer you the balance your applications require in order to deliver the best performance they can.  We do not maximize processor performance without matching it with the necessary memory capacity, bandwidth and system i/o.  We look to match these important components of performance to insure the data is where it needs to be, when it is needed to be in order to quickly and efficiently change it into actionable information.  We maximize your performance by minimizing your latencies.

 

Maximize your performance today, simplify your software development needs, and scale your performance forward as newer microarchitectures debut.

 

Seamless performance – bigger science – that is what we help you achieve faster than ever before.

 

To learn more about our approach to delivering highly effective HPC processors and software tools come to the Sun HPC Virtual Tradeshow on September 17th 2009 starting at 8am PDT.  In the virtual event attend the Intel presentation on “Accelerating Your Applications And Scaling It Forward” by Wes Shimanek & Dr. Nash Palaniswamy at 10:30am PDT.

0 Comments Permalink
1

The need to write scalable applications has been important for programmers in the HPC community for years. Now, with the proliferation of multi/many-core processors developing scalable software is now a top priority for many programmers. 

Andrew S. Tanenbaum stated at the USENIX ’08 conference last year that developing “sequential programming is really hard” … the difficulty is “parallel programming is a step beyond that.” 

He is right, but let’s illustrate why it is just a small step.

Here is the point – parallel architectures will continue proliferating and we will need to develop and refine parallel algorithms that exploit parallelism. While difficult, to develop and refine parallel algorithms, the actual programming of these new algorithms, does not need to be hard.  However, if the developer is required to know the intimate details of the hardware then the development and refinement parallel algorithms can be very difficult, and very time consuming.

One approach provided by Intel software developer tools is to abstract away the details of the hardware.  This allows the developer to focus on their algorithms /applications, and rely on Intel software developer tools to provide the best optimizations for current and future platform While you may give up some performance by being abstracted away, what you lose in performance will be rewarded by your ability to quickly iterate through more iteration of your parallelization ideas in less time.  You may find yourself designing and developing better approaches to parallelism because you were able to test more hypotheses. 

An additional by-product of being abstracted away from having to know the intricacies of the hardware is that your software will be highly adaptable to future platforms.  You will see tremendous improvements on multi-core solutions and will be in a great position scale your application performance forward as newer architectures are made available. 

To learn more Intel Software Tools and the benefits of optimizing your software on multi core based solutions first visit http://software.intel.com/en-us/intel-sdp-home/

1 Comments Permalink
0

Your most valuable employee is the one that creates tomorrow’s successes.  Providing them tools that help them do that faster will help your organization create new products or optimize old ones more rapidly.  The benefit to the organization is increased opportunities to win the customer’s attention via new products or your responsiveness to their request; the employee gets to brag on what he or she just helped bring to market.

Before we get too far let’s look at Intel’s mission with respect to workstations.  We are laser focused on supplying technology that provides users with an uncompromised experience in transforming their ideas into reality.  With that in mind we look at how users create; we try understanding their obstacles and work with the ecosystem of hardware and software providers to deliver solutions to real problems that may be inhibiting their opportunity to innovate.  

One technology that is helping users innovate faster is virtualization. 

No, we are not looking to remove the workstation from the user’s desk or share his or her workstation with peers, who also need a workstation.  We are using virtualization to deliver the performance they need to innovate faster.

The Observation

We saw workstation user’s innovation slow as they multitasked between tasks – some of them not even theirs.  The involuntary tasks included deploying IT security patches, updates, and system backups to name a few.  We also saw that users were no longer just doing Computer Aided Design (CAD) alone, but they were doing CAD, using productivity tools, meshing, web surfing for supporting facts, collaborating via video and Instant Messaging (IM) tools, digital white boarding and trying to do analysis-driven design.  They were very busy people who can’t afford any downtime or slow time.

In some cases we noticed that some users actually had not one, but two or more workstations running in completely different environments, many times with different OSs.

The Problem

What the above really lead to is a conclusion that too many tasks were going after too few resources and that the experience we had hoped the user would encounter was not happening.  In fact the reverse was happening – interactive creative tasks were slowing, system sluggishness was at an all time high.  The “uncompromised experience in transforming their ideas into reality” we wanted for a workstation user was not there and any innovation that was possible was slowed down to a crawl.

A Potential Solution

Intel® Virtualization Technology for Directed I/O (Intel VT-d), once just thought of for servers actually has a place in the workstation market. 

This technology provides an important step toward enabling a significant set of emerging usage models in the workstation. VT-d support on Intel platforms provides the capability to ensure improved isolation of I/O resources for greater reliability, security, and availability.  That is a mouth full let’s see it in action.

There are two key requirements that are common across workstation usage models.

1.       The first requirement is protected access to I/O resources from a given virtual machine (VM), such that it cannot interfere with the operation of another VM on the same platform. This isolation between VMs is essential for achieving availability, reliability, and trust. This helps you get the performance you want from your workstation.

2.       The second major requirement is the ability to share I/O resources among multiple VMs. In many cases, it is not practical or cost-effective to replicate I/O resources (such as storage or network controllers) for each VM on a given platform.

In the case of the workstation, virtualization can be used to create a self-contained operating environment, or "virtual software appliance[RC1] ," that is dedicated to capabilities such as manageability or security. These capabilities generally need protected and secure access to a network device to communicate with down-the-wire management agents and to monitor network traffic for security threats. For example, a security agent within a VM requires protected access to the actual network controller hardware. This agent can then intelligently examine network traffic for malicious payloads or suspected intrusion attempts before the network packets are passed to the guest OS, where user applications might be affected. Workstations can also use this technique for management, security, content protection, and a wide variety of other dedicated services. The type of service deployed may dictate that various types of I/O resources, graphics, network, and storage devices, be isolated from the OS where the user's applications are running.

The Result

In collaborating with virtualization and automation leader, Parallels, on its Parallels Workstation Extreme solution,  we identified two impediments to workstation user productivity.  The first was the issue around general resource overhead that afflict a traditional virtualized workstation system due to  insufficient resources to address the overload of requests. The second issue explored includes the more complex problem of a single workstation with the need to support multiple OSs and display visualization programs at near- or full-performance within virtualized machines.

The first issue was more straightforward - create VMs, partition resources and now the user has a very resilient workstation that is capable of delivering the intended experience.  IT can have their VMs and the user has his or her workstation back and the concept of digital prototyping to create and explore a complete product before it is built is a reality.  The creative innovator in the company can now iterate through more ideas in less time and your company created more opportunities to catch the customer’s attention just went through the roof.

The second issue offered a more complex challenge.  We identified certain industries such as the oil and gas exploration space where users actually had two or more physical workstations - one running Windows, the other running Linux. Both workstations had visual display requirements by the end user and both computers acted on the same reservoir data with applications that while similar in many ways, were still different in their functionalities and purpose.  In oil drilling projects that typically involve millions of dollars in capital investment, the confirmation of expected end results is an asset that far outweigh the costs of a few workstations. Nevertheless, in today’s economic setting, the ability to get the same functionalities at a lower cost is one of many key drivers in helping companies achieve healthy bottom lines.

The Proof Point For Virtualization In A Workstation Engineers from Schlumberger, a leading oil field service provider, run performance-demanding applications such as GeoFrame* and Petrel*.  These applications serve to analyze complex geological and geophysical data and determine the viability of potential reservoirs, or to optimize production at existing sites. With GeoFrame running on Linux* and Petrel on Microsoft Windows*, Schlumberger engineers have been using these applications on two separate physical workstations, driving IT spending higher, pushing down user productivity and increasing both power consumption and IT maintenance costs.

A New Paradigm For A New Day

With the availability of Intel Xeon processor 5500 series-based workstations, game-changing workstation virtualization software such as Parallels Workstation Extreme has opened up new horizons with breakthrough graphics performance with Intel’s latest processor technology. Parallels Workstation Extreme is built on top of the Parallels FastLane Architecture that effectively leverages the full potential of hardware resources such as graphics and networking cards to offer optimal workstation performance.

In comparison testing, Schlumberger compared the concurrent performance of applications running side-by-side on a virtualized Intel Xeon processor 5400 series-based workstation with the same setup on the newer Intel Xeon processor 5500-based machine. The results were astounding. The first machine with the older processor without Intel-VT-d support ran Petrel on the host OS at full native speed, but performance for GeoFrame in a VM slowed enormously. While Petrel refreshed its graphics at a rate of 30 frames per second, GeoFrame crawled along at a graphics refresh rate of JUST one frame every 19 seconds, an agonizingly slow performance on an older workstation without Intel VT-d support.

When the group tested the same applications on the newer Xeon 5500 series workstation with Intel VT-d support, the results were striking: Both applications – Petrel running on the host OS and GeoFrame in a guest OS in a VM - ran at full native speed, and both were able to refresh graphics at near 30 frames per second—a 570 times improvement over the first workstation.

Russ Sagert, Schlumberger’s Geoscience Technical Advisor for North America said “our engineers were blown away by the performance. We hammered these machines with extreme workloads that stressed every aspect of the system. Amazingly, the new workstation based on the Intel Xeon processor 5500 series provided performance enabling this multiple OS, multiple application environment for the first time.”

The key element in Schlumberger’s new environment is Intel Xeon processor 5500 series-based workstations with Intel® Virtualization Technology (Intel® VT) for Directed I/O (Intel® VT-d).  Together, these technologies enable direct assignment of graphics and network cards to virtual machines, enabling the machine to circumvent the interrupt and exit loop and clearing the previous performance problems.

Running in conjunction with Parallels Workstation Extreme, which effectively leverages Intel Virtualization Technology, including VT-d, the solution revolutionizes virtualization for high-end users. “High-performance virtualization on Intel Xeon processor 5500 series-based workstations is a game-changing capability,” says Sagert. “We can allocate multiple cores, up to 64 GB of memory and a dedicated graphics card to each machine. The results are spectacular.”

In the final analysis, moving to the Intel Xeon Processor 5500 series of next-generation workstations does far more than cut costs. It impacts the way that work gets done. If you have clients running the kind of resource-intensive, graphics-rich applications that traditionally slow to a crawl in a virtualized environment, consider the benefits of finally moving beyond the I/O barrier.

A fully configured Intel Xeon Processor 5500 series-based workstation running Parallels Workstation Extreme delivers the performance level that makes a virtualized workstation a leading contender for users with multi-workstation requirements. A streamlined work interface, reduced office noise and clutter, access to the same data repository and significant performance gains works on the user side. But the IT organization also gains benefits by lowering capital, management, support, provisioning, data protection, space, and energy and cooling costs.

Moreover, the IT team can now standardize on a single OS image while addressing alternative requirements.

Learn More

Intel Workstation Processors http://www.intel.com/products/workstation/processors/index.htm

Parallels Workstation Extreme

http://www.parallels.com/products/extreme


#

[RC1]To distinguish from the hardware appliance breed

0 Comments Permalink
1

I have been around the supercomputing market for over 25 years and have had an opportunity to see some interesting ideas come and go.  Let me share two that I experienced firsthand. 

·         CDC’s Cyber205 or a Cray 1S.  The CRAY-IS and the CDC Cyber 205 both offered effective vector processing, however, code conversion between them may have required some significant algorithmic changes. Cray of course won the HPC race at that time.  Note, the Cyber 205 was a tremendous performer, when you could keep their extreme.ly long vector pipeline busy. However, one branch or gap in the vector processing pipeline would cause a flush of the vector unit and what performance advantage you appeared to have vs. a Cray 1S was quickly erased.

·         An early day accelerator was Floating Point Systems.  In particular the FPS 164 was an awesome “off load” system where the needs of a few users were satisfied with better throughput than the Cray X-MP and Y-MP of the day. Convex, had a better idea.  It was better at serving the needs of more than an FPS 164 and was simpler to develop, maintain and scale software to next generation systems.

So what are the lessons from history? Perhaps it is that there it is there is a tight connection between application, architectures and algorithms and that it is extremely important to maintain a level of application flexibility and versatility in order to adopt new architectures as they become available in the market.  The old adage still remains true, software will outlive the useful life of hardware.  So it is important to be able to quickly adapt new shifts.

The same questions probably still apply today as they did when Cray, CDC and FPS were around.

When does an accelerator computing strategy work best?

The easiest answer is if your application is extremely data parallel in nature, then it may be well suited for an accelerator strategy. The word extremely is the critical part. 

If your application only performs some level of data parallelism and includes task, thread and cluster level parallelism or contains a small fraction of branching or is host to irregular data sizes, then perhaps an accelerator may not be the best fit.

How much real performance will an accelerator strategy deliver? 

Often times we hear claims of 10X, 20X or even greater than 30X. 

These are great headlines, but as many have noted, you need to understand an accelerators impact on the total execution time of your application.  What may have been 10X to 30X or more on a kernel of the application may only deliver a mere 2X to 3X or even less in terms of total application performance improvement.

Of course the real question is what are we really comparing performance speed ups to?

I have seen well tuned software on accelerators compared to “baseline” code running on one core of an old processor.  However, when you use available software technology and turn compiler flags on and add in a math kernel library call the performance on multi-core solutions can jump by over 10X and in some cases can exceed 30X multiples for total execution time.  This standards based accelerated software will scale forward as newer microarchitectures are made available from Intel.

Why is the difference between the promise and the actual performance so great? 

Always a good questionJ. 

The promise deals with a small part or a kernel of the software that is data parallel and can potentially scale linearly as more compute resources are added.  Again if the application is extremely data parallel, then an accelerator strategy may be the correct approach.

However, when the actual performance result, or total application performance, is significantly different it is often because of several things. 

·         One common reason is that you may be comparing optimized software on multi-core systems to optimized software on an accelerator.  When I compare similarly optimized software on a multi-core system I see that 20 – 30X difference often fades to less than 2X  and in most cases better than hardware accelerators.  This is because optimized software on a multi-core solution accelerates all components of the application.

·         Another situation is the bandwidth imbalance of the attach points of the accelerators, typically the attach speeds do not match the memory bandwidth or the ALU speed on the accelerators and the theoretical peak flops are tough to achieve.  Sometimes, for larger workloads due to limited amount of memory on the accelerator card, performance deteriorates.

·         Another situation may be that your application depends on different forms of parallelism which include task, thread or cluster level parallelism and even in some cases sequential forms of your software

So back to the differences in performance between the Cray 1 and CDC Cyber 205.

While Cyber 205 was great at edges of science the Cray proved to be the workhorse of high performance computing.  It offered better system balance than the Cyber 205.  Here is an example, if you take great care to optimize your software for a particular architecture you will no doubt see tremendous performance gains.  However, like the Cyber 205, if you break that pipeline you need to pay for the overhead to restart the long vector pipeline.  Often times, even with today’s accelerators, that start up cost reduces what appears to be stellar performance gains of the Cyber 205 to being no better than, or sometimes, even slower than the Cray 1.  There were of course examples with the Cyber 205, as there is today with accelerators that demonstrate where select sciences can see tremendous advantages over traditional computing solutions.

What other considerations may weigh in your decision to adopt an accelerator strategy?

Are you constantly refining your software?
Many researchers would probably answer yes.  They are constantly refining their software to improve the results the performance or both.

As I mentioned at the beginning of the blog, the old adage still remains true, software will outlive the useful life of hardware.  So it is important to be able to quickly adapt to new shifts.  One way to simplify these moves is to use standards based tools which can give you the flexibility to create applications that can use the multiple types of parallelism mentioned above via tools, compilers, and libraries.  You may also want to use standards based tools to acquire the versatility you need in order to scale your software across multiple architectures – e.g. large, many and heterogeneous cores. 

The caveat with respect to using non standard tools is that you become locked into a specific architecture.  If that architecture from the same vendor would happen to change, you may be required to make some significant changes (e.g. tuning to grain sizes).

Do you want to maintain, support and update multiple code bases?
I don’t.  I want to invest n the development of parallel algorithms.  The old adage is that software will far out live any hardware implementation still applies and I need the flexibility and versatility to quickly and as painlessly as possible be able to adopt new architectures as they are made available.  I do not want to invest in maintaining, supporting and updating an ever increasing set of code streams as newer architectures are made available.

Our team goal at Intel is to develop software tools and hardware technology that can help you scale-forward your application performance to future platforms without requiring a massive rebuild – just drop-in a new runtime that is optimized for the new platform to experience the improvement (akin to the printer/display driver model, buy a new printer/display, install the respective driver, and your system enjoys improved benefits).  That is the goal.

If you want to learn more about what we are doing to deliver high performing HPC solutions that are both flexible and versatile please visit www.intel.com/go/hpc

1 Comments Permalink
0

Your most valuable employee is the one that creates tomorrow’s successes.  Providing them tools that help them do that faster will help your organization create new products or optimize old ones more rapidly.  The benefit to the organization is increased opportunities to win the customers attention via new products or your responsiveness to their request; the employee gets to brag on what he or she just helped bring to market.

Before we get to far let’s look at Intel’s mission with respect to workstations.  We are laser focused on supplying technology that provides users with an uncompromised experience in transforming their ideas into reality.  With that in mind we look at how users create; we try understanding their obstacles and work with the ecosystem of hardware and software providers to deliver solutions to real problems that may be inhibiting their opportunity to innovate.  

One technology that is helping users innovate faster is virtualization. 

The Observation

We saw workstation user’s innovation slow as they multitasked between tasks – some of them not even theirs.  The involuntary task included IT security patches, updates, and system backups to name a few.  We also saw that users were no longer just doing CAD, but they were doing CAD, using productivity tools, meshing, web surfing for supporting facts, collaborating via video, digital white boarding and trying to do analysis driven design.  They were very busy people.

In some cases we noticed that some users actually had not one, but two workstations running in completely different environments, many times different OS’s.

The Problem

What the above really lead to is a conclusion that too many task were going after too few resources and that the experience we had hoped the user would encounter was not happening.  In fact the reverse was happening – interactive creative task were slowing, system sluggishness was at an all time high.  The “uncompromised experience in transforming their ideas into reality” we wanted for a workstation user was not there and any innovation that was possible was slowed down to a crawl.

A Potential Solution

Intel® Virtualization Technology for Directed I/O, once just thought of for servers actually has a place in the workstation market. 

This technology provides an important step toward enabling a significant set of emerging usage models in the workstation. VT-d support on Intel platforms provides the capability to ensure improved isolation of I/O resources for greater reliability, security, and availability.  That is a mouth full let’s see it in action.

There are two key requirements that are common across workstation usage models.

1.       The first requirement is protected access to I/O resources from a given virtual machine (VM), such that it cannot interfere with the operation of another VM on the same platform. This isolation between VMs is essential for achieving availability, reliability, and trust.

2.       The second major requirement is the ability to share I/O resources among multiple VMs. In many cases, it is not practical or cost-effective to replicate I/O resources (such as storage or network controllers) for each VM on a given platform.

In the case of the workstation, virtualization can be used to create a self-contained operating environment, or "virtual appliance," that is dedicated to capabilities such as manageability or security. These capabilities generally need protected and secure access to a network device to communicate with down-the-wire management agents and to monitor network traffic for security threats. For example, a security agent within a VM requires protected access to the actual network controller hardware. This agent can then intelligently examine network traffic for malicious payloads or suspected intrusion attempts before the network packets are passed to the guest OS, where user applications might be affected. Workstations can also use this technique for management, security, content protection, and a wide variety of other dedicated services. The type of service deployed may dictate that various types of I/O resources, graphics, network, and storage devices, be isolated from the OS where the user's applications are running.

The Result

Working with Parallels Workstation Extreme VM application we looked at two problems.  First was the general overhead related to too many request and too few resources and then we explored the more complex problem of a single workstation with a need to display at near native performance in two different OS’s.

The former was straight forward, create VM’s, partition resources and your innovator now has a very resilient workstation that is capable of delivering the intended experience.  IT can have their VM’s and the user has his or her workstation back and the concept of digital prototyping to create and explore a complete product before it is built is a reality.  Your innovator can now iterate through more ideas in less time and your company’s opportunity to catch the customer’s attention just went through the roof.

The former provided a much harder challenge.  We tested the idea in the oil and gas market where users actually had two workstations; one running Windows, one running LINUX. Both had a requirement for visual display and both acted on that same reservoir data with applications that while similar in many ways, they were still different.  When preparing to drill a multimillion dollar well – the idea of more data saying the same thing is a very good thing.

The Proof Point For Virtualization In A Workstation Engineers from Schlumberger, a leading oil field service provider, run performance-demanding applications such as GeoFrame* and Petrel*.  These applications serve to analyze complex geologic and geophysical data and determine the viability of potential reservoirs, or to optimize production at existing sites. With GeoFrame running on Linux* and Petrel on Microsoft Windows*, Schlumberger engineers have been running these applications on two separate workstations, driving down productivity and increasing both power consumption and IT maintenance costs.

A New Paradigm

With the advent of Intel Xeon processor 5500 series-based workstations running Parallels Workstation Extreme, virtualization software has opened new horizons with breakthrough graphics performance.

Schlumberger compared the concurrent performance of applications running on a virtualized Intel Xeon processor 5400 series-based workstation with the same setup on the Intel Xeon processor 5500-based machine. The results were astounding. The first machine ran Petrel at full native speed, but performance for GeoFrame slowed enormously. While Petrel refreshed its graphics at a rate of 30 frames per second, GeoFrame crawled along at a graphics refresh rate of JUST one frame every 19 seconds, an agonizingly slow performance.

When the group tested both applications on the Xeon 5500 series workstation, the results were striking: Both applications ran at full native speed, and both were able to refresh graphics at 30 frames per second—a 570 times improvement over the first workstation.

Russ Sagert, Schlumberger’s Geoscience Technical Advisor for North America said “our engineers were blown away by the performance. We hammered these machines with extreme workloads that stressed every aspect of the system. Amazingly, the new workstation based on the Intel Xeon processor 5500 series provided performance enabling this multiple OS, multiple application environment for the first time.”

The key element in Schlumberger’s new environment is Intel Xeon processor 5500 series-based workstations with Intel® Virtualization Technology (Intel® VT) for Directed I/O (Intel® VT-d).  Together, these technologies enable direct assignment of graphics and network cards to virtual machines, enabling the machine to circumvent the interrupt and exit loop and clearing the previous performance problems.

Running in conjunction with Parallels Workstation Extreme, which effectively leverages Intel Virtualization Technology, including VT-d, the solution revolutionizes virtualization for high-end users. “High-performance virtualization on Intel Xeon processor 5500 series-based workstations is a game-changing capability,” says Sagert. “We can allocate multiple cores, up to 64 GB of memory and a dedicated graphics card to each machine. The results are spectacular.”

In the final analysis, moving to the Intel Xeon Processor 5500 series of next-generation workstations does far more than cut costs. It impacts the way that work gets done. If you have clients running the kind of resource-intensive, graphics-rich applications that traditionally slow to a crawl in a virtualized environment, consider the benefits of finally moving beyond the I/O barrier.

A fully configured Intel Xeon Processor 5500 series-based workstation running Parallels Workstation Extreme delivers the performance level that makes virtualization a contender for these users. A streamlined work interface, reduced office noise and clutter, and significant performance gains works on the user side. But the IT organization also gains benefits by lowering capital, management, support, space, and energy costs.

Moreover, the IT team can now standardize on a single OS image while addressing alternative requirements.

Learn More

Intel Workstation Processors http://www.intel.com/products/workstation/processors/index.htm

Parallels Workstation Extreme

http://www.parallels.com/products/extreme

 

0 Comments Permalink
1

What do Jack Welch, Henry Ford and Albert Einstein have in common?

In their day they were innovators and they all accepted the need to adopt and change. 

Einstein once said “The definition of insanity is doing the same thing over and over again and expecting different results.” He obviously had no interest in repeating history over and over again.

Ford was heard to say “I am looking for a lot of men who have an infinite capacity to NOT know what can't be done.” He obviously was looking for people who asked “why” as opposed “why not.”  He sought thinkers and tinkerers.

Jack Welch had the shortest, but the most interesting quote.  “Change before you have to.”  As we all know he embraced change and created it too.

Technology gives us an opportunity to look at what we are doing now and find a better way to do it.  Used correctly, it can help you do more in less time with higher quality results.  Take the digital workbench (aka dual processor Intel® Xeon processor 5500 series based workstation) as an example.  While it can do CAD, it is really capable of much more.  It can help users model more what if’s than ever.  It can help users create and test ideas digitally long before they are made into physical prototypes.  While these new workstations can do this many continue to do just CAD.

What new workflows can you think of that will radically change the rate of your innovation?

1 Comments Permalink
2

Go Beyond the Kernel:

Refocusing HPC Benchmarking on Total Application Performance

 

Want to improve application performance by 10x or 100x? Few HPC customers would say no. Yet in some cases, the promises of tremendous performance improvements from accelerators, attached processors, field-programmable gate arrays, and the like evaporate when total application performance is evaluated. Benchmarks that focus on kernel or even partial application performance provide incomplete picture with respect to the impact on total application benchmarking.  While difficult, HPC customers should look to test total application performance.

Why benchmark?

Benchmarking is an essential means for helping end users choose and configure HPC systems. An end user has a problem and needs to know the best way to solve it. More specifically, the end user has a specific workload to run and needs to find hardware that can deliver the best performance, reliability, application portability, and ease of application maintenance. As Purdue University researchers wrote in a recent IEEE article that argued for real application benchmarking, an HPC benchmark should, among other things, produce metrics that help customers evaluate the overall or total time to solution for their problems.

The claim of a 10x to 100x improvement from a particular product can easily grab someone’s attention. But what does that 10x measurement really mean? In many cases, these claims are derived from kernel or partial application benchmarking, which might fail to tell the whole story. While an increase in floating-point performance or the addition of a CPU accelerator could deliver a significant improvement for one kernel, the total application improvement depends on additional HPC system elements. As one participant argued in a recent HPC conference reported by IDC, solution time can be represented as an equation:

Solution time = processing time + memory time + communication time + I/O time – all four combine to form total application time.  The Caveat Emptor is to make sure you analyze your application; understanding what you are measuring and ensuring that you have the balanced architecture to deliver the best performance.  I am biased but I think Intel’s soon to be announced Nehalem processor delivers just that. 

Kernel benchmarking has its place, but benchmarking total (or “real”) application performance is critical for accurately evaluating HPC systems.

2 Comments Permalink
0

Do you really need a workstation?  It depends…. While I am biased, the actual investment difference between an entry level Intel-based workstation and an Intel-based business desktop with similar graphics features has compressed so much that today the answer is -------------------------------------yes you really do need a workstation. I was recently at an ISV's user meeting and workstaion vendors were demonstrating visually compelling reasons to make that leap to a workstation from a desktop.   So what is an Intel-based workstation when compared to a business desktop?  Similar to the difference between a professional athlete and recreational athlete; workstations are typically faster and smarter at what they do.  Workstations are purpose built to do a job.  They provide the necessary processing capacity, access to the professional grade graphics adapters and enough expansion to accommodate the memory capacity you need to work with a: • Bigger canvas if your are digital content creator, • Larger assemblies if your are product designer, or • More complete oil reservoirs if you are a geophysicist. But if you want to really increase the pace you can create, you may want to step up to a virtual workbench that delivers near supercomputing performance to your workstation.  That is another blog. To learn which Intel based workstation is best for your needs please use our “Mobile, Professional, Expert—Which processor is right for your workstation needs tool” on http://www.intel.com/products/workstation/processors

0 Comments Permalink
0

Today's workstation with (2) Intel® Xeon® 5400 Series processors transforms a workstation from a simple design terminal into a powerful engineering tool that helps users potentially compress the time between an idea and a product. At Intel we call that "working differently". In the manufacturing vertical market it may also be called:

  • digital prototyping,

  • analysis driven design or

  • simulation based product design.

 

No matter what you call it, these new ways of working help manufacturers virtually explore complete products before they are built-so they can create, validate, optimize, and manage designs from the conceptual design phase through the manufacturing process.

 

 

Digital prototyping processes may also require that you also rethink your workstation deployment strategy; moving from single processor workstations, to two (2) Intel Xeon 5400 processor based workstations with 8 computational cores and up to 100 peak gigaflops of floating point performance. This compute capacity coupled with the robust visualization environment enabled by two (2) PCI Gen 2 graphics adapters changes would be workstations into workstation supercomputers and enables engineers and designers to concurrently perform traditional CAD design as well moderate size analysis (e.g. over 5M degrees of freedom finite element modeling jobs or up to 10M cell fluid dynamic simulations).

 

 

This combination (CAD and Analysis) can help organization optimize CAD parts or assemblies under a wide range of physical scenarios including mechanical and thermal effects. Net result is workstation supercomputers based on Intel® Xeon® 5400 Series processor can help to bring higher quality or more innovative ideas to markets faster than ever before.

 

 

0 Comments 0 References Permalink
0

 

Ok, so you have bought into the potential benefits of simultaneous workflows from my previous blogs where organizations may choose work differently and develop parallel workflows in order to potentially:

 

  • compress the time between an idea and a product or,

  • innovate faster and deliver finished animations in less time

  • or simply work on larger more complex assemblies or analyses

 

 

 

Ok so now it's time to say I doubt the performance is really available to get this done.

 

 

 

 

The honest answer is it depends on how much you want to stress the system, because there are limits. But Russ Sagert, Petrel Portfolio Manager from Schlumberger Information Systems suggests "workstations based on the new Intel® Xeon® 5400 series processors change the game for our customers. These new workstations featuring faster processors, larger memories, two new advanced PCI Gen2 graphics adapters give users a remarkably interactive experience and an opportunity to employ parallel geophysical workflows that combine complex large scale attribute analysis and volume visualization. Net result: Higher quality results and, better more informed decisions in less time than ever before."

 

 

 

 

Alright that is mouthful, but today's Intel® Xeon® based workstations do present the performance necessary to potentially change your game and help you innovate faster - are you game?

 

 

 

 

So what is the secret behind the performance?

 

 

 

 

The "Core" is. Not 4 cores, not 2 cores, or 8 even cores -- just Intel's new Core microarchitecture. Introduced in 2006, and extended with the availability of the Intel® Xeon® 5400 series, Intel's new Intel's new Core microarchitecture is the foundation that enables demanding users with the processing capacity and visualization capability to change the way they work and use simultaneous workflows to continuously iterate through design ideas, rapidly test via simulation, and find design errors earlier. The Intel Core Microarchitecture helps deliver the power to innovate.

 

 

 

 

Here is a quick look at some of the Intel innovations that are changing the performance game - we will discuss them more in future blogs:

 

 

 

  • Intel® Wide Dynamic Execution - Increases performance and energy efficiency by enabling the delivery of more instructions per clock cycle and enhancing the efficiency of instruction execution.

  • Intel® Advanced Digital Media Boost - Effectively doubles the instructions' execution speed compared to previous Intel based solutions and accelerates a broad range of applications, including financial, engineering, scientific video, speech and image, photo processing, and encryption software.

  • Intel® Advanced Smart Cache, which is optimized for multi-core processing and reduces latency to frequently used data

  • Intel® Smart Memory Access optimizes the use of the available data bandwidth from the memory subsystem and

  • Intel® Intelligent Power Capability improves energy efficient performance and reduces noise and energy consumption.

  • Macro Fusion combines instructions into a single instruction for execution, delivering more efficient processing and faster throughput.

 

 

 

Net result is Intel's new Core Microarchitecture can help you hit a high note in performance and an even higher note in innovation when you move to simultaneous workflows.

 

 

0 Comments Permalink
1 2 Previous Next

Filter Blog

By author: By date: By tag: