That's because pdfbox is no longer used by the Tika bundle (in CQ 5.6), but the the Adobe PDF Toolkit (com.adobe.granite.gibson). If you want to extract text from PDF you should utilize Tika and not the parser directly.