I have a parallel code which uses both pthread and threaded mkl.
Attached is a minimal working example. I set the environment
variable MKL_NUM_THREADS equal to 2. I run this code and
the ratio walltime/cputime is 2.5.
When I comment out the pthread part in the for loop, compile and run,
the ratio is 1.96. When I comment out the dgemv_ and compile, the
ratio is 1.81.
If my understanding is correct, the ratio for the first case should be
below 2? Or there are some conflicts between pthread and threaded mkl?
Please correct me if I was wrong.
I'm sorry for the delay in response! It looks like you have a software question and this is the forum for Intel's technical products. Here's a link to our Intel Developer Zone forums:
This might be the specific forum that could answer your question: Intel® Moderncode for Parallel Architectures
I hope that helps. Thanks,
Support Community Manager