It is super slow, I would suggest you use PyMuPDF, it is built directly on C language and provides nearly 10x the speed. I used it in production where i had to index quite close to 33,000 files ...
Microsoft Threat Intelligence analyzed a cryptocurrency clipper campaign that combines clipboard theft, wallet replacement, ...