Aassassin bugs Image Classification System

A Deep Learning Framework for Hierarchical Classification of Assassin Bugs (Reduviidae)

📖 Overview

This project implements a comprehensive deep learning framework for hierarchical classification of assassin bugs (Reduviidae family) using state-of-the-art computer vision models. The system performs multi-level taxonomic classification across three hierarchical levels: subfamily, genus, and species.

📦 Included Assets (ready to use)

Sample images for inference and demos: example/Ectrychotes andreae_*.jpg|.JPG|.png
A pretrained checkpoint for quick testing: example/DenseNet_121_epoch_84_94.04.pth
Dependency lock file for reproducible setup: requirements.txt
Full dataset and all trained model files: downloadable from Figshare at https://figshare.com/s/b6729a5f514b7e8cc71b

Key Features

Multi-Model Support: Implementation of 7 popular deep learning architectures
Hierarchical Classification: Three-level taxonomic classification (subfamily → genus → species)
Configurable Training: Flexible configuration system for different model architectures
Advanced Metrics: Comprehensive evaluation with Top-k accuracy, F1-score, precision, and recall
Cross-Validation: Built-in k-fold cross-validation for robust model evaluation
Pretrained Weights: Support for loading pretrained models

🏗️ Architecture

Supported Models

Model Family	Variants	Description
ResNet	ResNet-18, 34, 50, 101, 152	Deep residual networks with skip connections
DenseNet	DenseNet-121, 169, 201, 161	Densely connected convolutional networks
EfficientNet	EfficientNet-B0 to B7	Compound scaling for efficient CNNs
MobileNetV3	MobileNetV3-Large, Small	Lightweight networks for mobile deployment
ConvNeXt	ConvNeXt-Tiny, Small, Base, Large, XLarge	Modern ConvNet with Transformer design principles
Swin Transformer	Swin-Tiny, Small, Base, Large	Hierarchical vision transformer with shifted windows
Vision Transformer	ViT-Tiny, Small, Base, Large	Pure transformer architecture for image classification

🚀 Installation

Setup

Clone the repository

git clone https://github.com/caomuxifeng/assassinbug_classification.git
cd assassinbug_classification

Create virtual environment

python -m venv venv
source venv/bin/activate

Install dependencies

pip install -r requirements.txt

If you only want to run the provided demo, installing from requirements.txt is the quickest way to match the tested environment.

🐳 Docker-based environment (recommended to avoid setup errors)

Use the published Docker image to get CUDA/toolchain prerequisites, then install Python packages inside the container:

# 1) Pull the prebuilt image from Docker Hub
docker pull kkybp/pytorch-2-0-0:22.04

# 2) Launch a container with the project mounted; add --gpus all if you have NVIDIA GPUs
docker run --rm -it --gpus all \
  -v $(pwd):/workspace \
  -w /workspace \
  kkybp/pytorch-2-0-0:22.04 \
  bash

# 3) Inside the container, install Python dependencies
pip install -r requirements.txt

If you prefer CPU-only, drop --gpus all. Update the image name/tag if you host it under a different namespace.

📁 Dataset Structure

Organize your dataset in the following structure:

data/
├── taxonomy.csv                 # Taxonomic mapping file
├── complete/train/                      # Training images
│   ├── subfamily1_genus1_species1/
│   │   ├── image1.jpg
│   │   ├── image2.jpg
│   │   └── ...
│   ├── subfamily1_genus1_species2/
│   └── ...
├── complete/val/                        # Validation images
│   ├── subfamily1_genus1_species1/
│   └── ...
├──cross-domain test
├──genus_zeroshot
└──species_zeroshot

## 🏃‍♂️ Usage

### Training Single Model

```bash
python train.py \
    --data_dir ./data/comolete/train \
    --val_data_dir ./data/complete/val \
    --model_configs '[{"type": "resnet", "name": "resnet_50", "pretrained": true}]' \
    --batch_size 64 \
    --epochs 100 \
    --num_workers 4

Training Multiple Models

python train.py \
    --data_dir ./data/train \
    --val_data_dir ./data/val \
    --model_configs '[
        {"type": "resnet", "name": "resnet_50", "pretrained": true},
        {"type": "efficientnet", "name": "efficientnet_b0", "pretrained": true},
        {"type": "swin", "name": "swin_tiny", "pretrained": true}
    ]' \
    --batch_size 64 \
    --epochs 100

Cross-Validation Training

python train.py \
    --data_dir ./data/train \
    --model_configs '[{"type": "resnet", "name": "resnet_50", "pretrained": true}]' \
    --cross_validation \
    --cv_folds 5 \
    --batch_size 32 \
    --epochs 100

Inference

python inference.py \
    --model_path ./results/resnet_50/checkpoints/best_model.pth \
    --model_type resnet \
    --model_name resnet_50 \
    --image_path ./test_image.jpg \
    --taxonomy_file ./data/taxonomy.csv

Quick Demo with Provided Assets

Use the bundled checkpoint and sample image to verify the pipeline end-to-end:

python inference.py \
    --model_path ./example/DenseNet_121_epoch_84_94.04.pth \
    --model_type densenet \
    --model_name densenet_121 \
    --image_path ./example/Ectrychotes\ andreae_001.jpg \
    --taxonomy_file ./data/taxonomy.csv

The command will output predicted subfamily, genus, and species for the sample image. Swap image_path to any other file in example/ to test additional samples.

External data & models (Figshare)

Due to GitHub file size limits, only the lightweight densenet_121 checkpoint is bundled here.
The complete dataset and all trained model files can be downloaded from Figshare: https://figshare.com/s/b6729a5f514b7e8cc71b

📊 Model Performance

Evaluation Metrics

The system provides comprehensive evaluation metrics:

Top-k Accuracy (k=1,3,5): Classification accuracy for top-k predictions
Precision: True positives / (True positives + False positives)
Recall: True positives / (True positives + False negatives)
F1-Score: Harmonic mean of precision and recall

📁 Output Structure

Training results are organized as follows:

results/
├── model_name/
│   ├── checkpoints/           # Model checkpoints
│   │   └── epoch_100_86.50.pth
│   ├── logs/                  # Training logs
│   │   ├── model_config.log
│   │   └── training.log
│   └── metrics/               # Performance metrics
│       ├── training_history.json
│       ├── train_metrics.csv
│       └── val_metrics.csv

🔄 Updates

v1.1.0 (2025-12): Added Docker-based setup (kkybp/pytorch-2-0-0:22.04), documented quick-start inference with bundled demo assets (example/ images + DenseNet checkpoint), pinned requirements.txt, and clarified full datasets/all trained models are hosted on Figshare with GitHub carrying only the lightweight checkpoint.
v1.0.0 (2025-06): Initial release with 7 model architectures

⭐ If you find this project helpful, please consider giving it a star!

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
configs/training		configs/training
data		data
example		example
models		models
utils		utils
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
inference.py		inference.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aassassin bugs Image Classification System

A Deep Learning Framework for Hierarchical Classification of Assassin Bugs (Reduviidae)

📖 Overview

📦 Included Assets (ready to use)

Key Features

🏗️ Architecture

Supported Models

🚀 Installation

Setup

🐳 Docker-based environment (recommended to avoid setup errors)

📁 Dataset Structure

Training Multiple Models

Cross-Validation Training

Inference

Quick Demo with Provided Assets

External data & models (Figshare)

📊 Model Performance

Evaluation Metrics

📁 Output Structure

🔄 Updates

About

Uh oh!

Releases

Packages

Languages

License

caomuxifeng/assassinbug_classification

Folders and files

Latest commit

History

Repository files navigation

Aassassin bugs Image Classification System

A Deep Learning Framework for Hierarchical Classification of Assassin Bugs (Reduviidae)

📖 Overview

📦 Included Assets (ready to use)

Key Features

🏗️ Architecture

Supported Models

🚀 Installation

Setup

🐳 Docker-based environment (recommended to avoid setup errors)

📁 Dataset Structure

Training Multiple Models

Cross-Validation Training

Inference

Quick Demo with Provided Assets

External data & models (Figshare)

📊 Model Performance

Evaluation Metrics

📁 Output Structure

🔄 Updates

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages