OCR Verse is a Streamlit-based app for extracting text from PDFs and images using Mistral OCR. It supports input via document URLs, file uploads, and image URLs. The extracted text is displayed in an expandable section for easy viewing.
Access the application here.
- Checkout the GitHub repo.
gh repo clone lavvsharma/ocr_verse - Install the required dependency
pip install .- Run the streamlit application
streamlit run OcrVerse.pyThe app aims to integrate multiple OCR models, bringing them together in one platform for easy testing and comparison. Additionally, users will be able to configure their own API keys, providing more flexibility and customization for different OCR services.
This package generally follows SemVer conventions, though certain backwards-incompatible changes may be released as minor versions:
- Changes that only affect static types, without breaking runtime behavior.
- Changes to library internals which are technically public but not intended or documented for external use. (Please open a GitHub issue to let us know if you are relying on such internals).
- Changes that we do not expect to impact the vast majority of users in practice.
We take backwards-compatibility seriously and work hard to ensure you can rely on a smooth upgrade experience.
We are keen for your feedback; please open an issue with questions, bugs, or suggestions.
Python 3.12 or higher.
