I would just like to mention that I also confirmed this performance issue for NikosD previously via the iGPU on my own Pentium G3258 (overclocked to 4.6GHz) on Windows 7 64bit.
For me it was a difference of 42fps vs 56fps (both with 100% CPU utilization); as someone that watches a great deal of 50fps content (yes 50, not a typo), that would be the difference between perfectly fine and completely unacceptable performance.
UPDATE: I can also confirm that, on my old Intel 965GMA integrated graphics, the performance of NV12 matches the performance of YV12.
I tried an ancient laptop with Win 10 x64 - iGPU 945GM (GMA 950) and the results were the same like 965.
Same performance for NV12/YV12
I tried my SandyBridge system Win 10 x64 - Core i5 2400 - iGPU HD 2000 - v4229 and Radeon 5750 - Catalyst 15.10
The system has two graphics cards, so I tested both on the above clip 4K VP9
The performance is the same using Intel or ATI and exactly the same using NV12/YV12.
In all cases the average fps is 84fps.
Playback (vanilla EVR renderer - scale to 1280x720)
NV12/YUY2 same performance like decode (renderless) mode ~84fps
Intel HD 2000
YV12 50/60/66 CPU usage 69%
NV12 42/51/64 CPU usage 86%
Even though NV12 for SandyBridge HD 2000 uses a lot more CPU than YV12, still doesn't catch YV12's performace.
From the above tests it is obvious that older Intel GPUs (like 945, 965) don't suffer from performance hit using NV12, but from SandyBridge and onwards the performance hit is clear when NV12 is used, instead of YV12.
Still, waiting for your feedback.
I can confirm.
I'll get the investigation started on this one. I'll be in touch as needed for updates/questions. Thanks for raising this.
Bug submitted and in queue for investigation. Will follow up as updates come. Thanks.
Thank you very much for your prompt actions!
Hope we have good news soon.
Still work-in-progress on debugging root cause. Stay tuned...
I have a few indications showing me that the problem has been - somehow - fixed, but not completely.
For Haswell owners, latest drivers for Win 7/8/8.1 and Win 10 seem to have the same performance using NV12 and YV12.
I think the same goes to Broadwell and Skylake, but I don't know about Ivy and SandyBridge, which I think don't have updated drivers.
BUT even for my Haswell using 4326 latest driver for Win 10, I 've seen some "ugly" scaling algorithms using NV12, like those of YV12.
It seems that the new drivers somehow "cheat" in quality (scaling algorithms and others) in order to achieve the same performance as YV12, which already had lower quality and possibly that was the reason to be faster.
We, as users, want both. Speed and previous quality using NV12.
Hope it helps in order to further investigate the issue.