A tensorflow siamese network implementation. Illustrated using singature recognition/identification.
-
Updated
Jun 25, 2019 - Python
A tensorflow siamese network implementation. Illustrated using singature recognition/identification.
Creates synthetic degraded image documents that could be used to train Neural Networks
Tools necessary to perform a multi-fold pretrained voting approach utlizing OCRopus.
~1000 book pages + OpenCV + python = page regions identified as paragraphs, lines, images, captions, etc.
A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.
Online-handwritten version of the George Washington Dataset.
A synthetic data generator for text recognition
Code and procdures for handwriting object detection and recognition
A repository with anonymized invoices
Total Text Dataset - ICDAR 2017. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one of a kind.
Distorted Document Images dataset (DDI-100).
Dataset for scene text removal
A tensorflow reproducing of paper “Editing Text in the wild”
CORD: A Consolidated Receipt Dataset for Post-OCR Parsing
This Web application crawls PDFs from governement websites, performs table detection and displays advanced statistics.
Generate text images for training deep learning ocr model
Ground truth line annotations for the Berliner Börsen-Zeitung
Add a description, image, and links to the aniketdata topic page so that developers can more easily learn about it.
To associate your repository with the aniketdata topic, visit your repo's landing page and select "manage topics."