I try to extract the text of this pdf https://gofile.io/?c=6U8qE8. I have a rack application inside a docker container running on Ubuntu 18.04.
After calling Docsplit.extract_text('spec/test.pdf', ocr: true, language: 'eng', output: 'spec/output.txt') I see the process gs uses the most cpu power and I lose 1GB of diskspace every 5 seconds until there is no space left.
Maybe someone has an idea what is going wrong here?