Extract text from various file types before resorting to an OCR solution.
antiword
pdftotext/poppler
Nick Weiland
April 22, 2025 1:54am
MIT