One additional (to the larger bandwidth needed and the faster readout) reason should be the different design of V3. It looks from sensorgen results that it is optimized for higher photo-electron capacity (larger floating diffusion)

(13544/11502)*(18/14) is around 50% more capacity per area ..

which drives the designers to use lower conversion gain (which means more read noise)