SkeView is an annotation tool based on the theory of diagnosticity hypothesis (Amos Tversky, Psychological Review, 1977) for shape and image skeletons. Based on SkeView, we annotate high-quality skeleton Ground Truth (GT) of 16 existing shape and image datasets.
Paper link: https://link.springer.com/article/10.1007/s11263-023-01926-3 (Accepted by IJCV 2023).
- Image datasets:
- SK506: SK506 (also known as SK-SMALL) was selected from the MS COCO dataset, with 506 natural images (300 for training and 206 for testing) and 16 object classes, including humans, animals, and artifacts. (Shen et al. IEEE CVPR 2016)
- SK1491: SK1491 (also known as SK-LARGE) is an extension of the SK506 by selecting more images from the MS COCO dataset. It includes 1,491 images (746 for training and 745 for testing). (Shen et al. IEEE TIP 2017)
- SYMMAX300: SYMMAX300 is adapted from the Berkeley Segmentation Dataset (BSDS300) with 300 images (200 for training and 100 for testing). There are multiple targets in most images. This dataset is used for local reflection symmetry detection, which is a low-level image feature, without paying attention to the concept of "object". (Tsogkas et al. ECCV 2012)
- SymPASCAL: SymPASCAL was selected from the PASCAL-VOC dataset, with 1,435 images (648 for training and 787 for testing). Most images contain multiple targets, partial visibility, and complex backgrounds. (Ke et al. IEEE CVPR 2017)
- Both image and shape datasets:
- EM200: EM200 contains 200 microscopic images (10 classes) of environmental microorganisms (EM). (Yang et al. ICPR 2014)
- SmithsonianLeaves: SmithsonianLeaves contains 343 leaves (187 for training and 156 for testing) from 93 different species of plants. (Ling et al. IEEE T-PAMI 2007)
- WH-SYMMAX: WH-SYMMAX contains 328 cropped images (228 for training and 100 for testing) from the Weizmann Horse dataset. Each image contains one manually segmented target. (Shen et al. Pattern Recognition 2016)
- Shape datasets:
- Animal2000: Animal2000 contains 2,000 shapes (20 categories, 100 shapes each) ranging from poultry and domestic pets to insects and wild animals. Each class is characterized by large intra-class shape variations. (Bai et al. ICCV Workshops 2009)
- ArticulatedShapes: ArticulatedShapes contains 40 images from eight different objects. This challenging dataset consists of various tools, including scissors with holes. (Ling et al. IEEE T-PAMI 2007)
- Kimia99: Kimia99 contains 99 shapes (9 categories, 11 shapes each) assembled from various sources such as tools and hands, etc. (Sebastian et al. IEEE T-PAMI 2004)
- MPEG7: MPEG7 contains 1,400 (70 categories, 20 shapes each) shapes defined by their outer closed contours. It poses challenges with respect to deformation (e.g., change of viewpoints and non-rigid object motion) and noises (e.g., quantization and segmentation noise). This dataset is actively used for benchmarking shape representation, matching, and retrieval algorithms. (Latecki et al. IEEE CVPR 2000)
- Kimia216: Kimia216 contains 216 shapes (18 categories, 12 shapes each) selected from the MPEG7 dataset. It is actively used in skeleton extraction, pruning, matching, and shape retrieval scenarios. (Sebastian et al. IEEE T-PAMI 2004)
- MPEG400: MPEG400 contains 400 shapes selected from the MPEG7 dataset (20 categories, 20 shapes each). Instead of directly using the original shapes, boundary noises of these shapes were manually removed for ablation study. (Yang et al. IEEE ICIP 2014)
- Tetrapod120: Tetrapod120 contains 120 tetrapod animal shapes from six classes. As the shapes of some species are visually similar, this dataset is normally employed to evaluate shape matching and fine-grained classification algorithms. (Yang et al. Pattern Recognition 2016)
- SwedishLeaves: SwedishLeaves contains 1,125 leaf shapes from 15 different Swedish tree species, with 75 leaves per species (25 for training, 50 for testing). This dataset is challenging as some species are quite similar. (Oskar Söderkvist, Linköping University, 2001)
- Tari56: Tari56 contains 56 shapes (14 categories, 4 shapes each) for evaluating matching performance under visual transformations. Shapes of the same category show variations in orientation, scale, articulation, and small boundary details. (Asian et al. ICCV 2005)
To ensure the smoothness of your research, we provide the original shapes/images, ground truth, and thumb on this page:
- Original Shapes/Images: Original data from various sources, such as GitHub pages, personal webpages, and email exchanges. To the best of our knowledge, this is the most complete zoo of existing shape and image datasets. We do our best to push this domain forward. If you find any copyright issues, please contact me via email.
- Ground Truth: With .mat and .png formats.
- .mat: for Matlab users. Each file contains three cells:
- cell 1: binary shape (aa = mysaving_mat{1};)
- cell 2: component of ground truth (aa = mysaving_mat{2};), with two sub_cells:
- sub_cell 1: list of endpoints;
- sub_cell 2: list of branches; Each branch is represented by endpoint1, endpoint2, and skeleton path between them. Users can conveniently evaluate skeleton matching methods (e.g., skeleton path similarity) with them.
- cell 3: binary skeleton ground truth (aa = mysaving_mat{3};)
- .png: for Python users. A binary image with skeleton ground truth. Alternatively, users can covert the endpoint and branch lists from .mat files.
- .mat: for Matlab users. Each file contains three cells:
- Thumb: Visualization of the integrated shape/image and the ground truth.
*: Since each zip file surpasses 25M, please download them via the Google Drive disc links in the Download section.
Freely download the aforementioned files independently or in one folder from Google Drive based on your needs:
- Independently
| Dataset | Type | Original Data | Ground Truth | Thumb |
|---|---|---|---|---|
| SK506 | image | Download | Download | Download |
| SK1491 | image | Download | Download | Download |
| SYMMAX300 | image | Download | Download | Download |
| SymPASCAL | image | Download | Download | Download |
| EM200 | image and shape | Download | Download | Download |
| SmithsonianLeaves | image and shape | Download | Download | Download |
| WH-SYMMAX | image and shape | Download | Download | Download |
| Animal2000 | shape | Download | Download | Download |
| ArticulatedShapes | shape | Download | Download | Download |
| Kimia99 | shape | Download | Download | Download |
| MPEG7 | shape | Download | Download | Download |
| Kimia216 | shape | Download | Download | Download |
| MPEG400 | shape | Download | Download | Download |
| Tetrapod120 | shape | Download | Download | Download |
| SwedishLeaves | shape | Download | Download | Download |
| Tari56 | shape | Download | Download | Download |
- All in one folder
If you benefit from this work, please cite our paper:
@article{Yang2023SkeView,
author = {Yang, Cong and Indurkhya, Bipin and See, John and Gao, Bo and Ke, Yan and Boukhers, Zeyd and Yang, Zhenyu and Grzegorzek, Marcin},
title = {Skeleton Ground Truth Extraction: Methodology, Annotation Tool and Benchmarks},
journal = {International Journal of Computer Vision},
volume = {},
pages = {1-23},
year = {2023}
}

