Forum Moderators: Robert Charlton & goodroi
If you block PDFs from crawling via robots.txt (or return noindex response header for PDFs) then yes, you could re-use PDF text on website pages with no problems.
I'll see if there is some viable way for those PDFs to be taken down from the various sites that host them, but I think it might be a pretty big challenge.