NEW: DXVA2-CopyBack uses D3D9Ex to allow headless operation
Changed: Increased the maximum number of decode threads to 32
Changed: Rebalanced the "Auto" thread strategy to use the exact number of available CPU cores, instead of 1.5x the cores
Fixed: Certain H264 streams could crash in 0.70 due to lack of buffer padding