File file = new File("C:/PdfBox_Examples/new.pdf");
PDDocument document = PDDocument.load(file);
//Instantiate PDFTextStripper class
PDFTextStripper pdfStripper = new PDFTextStripper();
//Retrieving text from PDF document
String text = pdfStripper.getText(document);
P. Moreira, Y. Bizzoni, K. Nielbo, I. Lassen, und M. Thomsen. Proceedings of the The 5th Workshop on Narrative Understanding, Seite 25--35. Toronto, Canada, Association for Computational Linguistics, (Juli 2023)
Z. Yang, D. Yang, C. Dyer, X. He, A. Smola, und E. Hovy. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Seite 1480--1489. San Diego, California, Association for Computational Linguistics, (Juni 2016)
A. Nenkova, und R. Passonneau. Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics: HLT-NAACL 2004, Seite 145--152. Boston, Massachusetts, USA, Association for Computational Linguistics, (2004)