pyLectureMultiModalAnalysis - A project for lecture classification

General

This a project for the "Multimodal information processing and analysis" lesson of the MSc Data Science program between the National Centre for Scientific Research "Demokritos" and the University of the Peloponnese. The goal is to classify a target lecture video into 3 different categories (boring, neutral, interesting) based on viewers stimulation. A sample collection of manually annotated videos used as training and evaluation dataset. The algorithm used for training is the SVM from sklearn library. For audio feature extraction pyAudioAnalysis library used. For video feature extraction VGG16 is used from Keras library.

Prerequesites

Python 3.7.6
pip 20.0.2
Supported video format: .mp4
Supported audio format: .wav

Installation

Clone the source:

git clone https://github.com/cjd1884/pyLectureMultiModalAnalysis.git

Install dependencies

pip install -r ./requirements.txt

Usage

The main file is lecture_classifier.py. It can be run in 3 different modes:

Mode 1: Training evaluation

This is the primary evaluation mode used to assess SVM algorithm performance on training data. Accuracy is calculated using leave-one-video-out cross validation (different video is same as different speaker - we assume one video per speaker).

Video files to be used for training need to be placed in data/source/ folder. Format to be used is .mp4. Annotation file should be provided and placed under data/ folder named as index.csv. Index file should have the following format:

FILE;SEG;CLASS_1
video_0;part_0;boring
video_0;part_1;interesting
video_0;part_2;boring
video_0;part_3;interesting
video_1;part_0;interesting
video_1;part_1;neutral
video_1;part_2;interesting
video_1;part_3;boring

python lecture_classifier.py -a eval_train

Mode 2: Training

In this mode, the SVM model is trained on the entire training dataset and it is then saved to disk.

python lecture_classifier.py -a train

Mode 3: Target evaluation

The trained model (loaded from disk) is used to evaluate the target video provided for classification. Target video to be annotated by the algorithm should be placed under data/target/ folder.

python lecture_classifier.py -a eval_target

Authors

Konstantinos Dimitros | email | github
Karozis Stelios | email | github
Karousis Konstantinos | email | github
Mastrapas Anastasios | email
Nikoloutsakos Nikos | email | github

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
Labels2Summary		Labels2Summary
Video2Features		Video2Features
audio		audio
classification		classification
data		data
helpers		helpers
jupyter		jupyter
old		old
.gitignore		.gitignore
DELETE_ME.pkl		DELETE_ME.pkl
LICENSE		LICENSE
LectureSpeechAnalytics_PRES.pdf		LectureSpeechAnalytics_PRES.pdf
LectureSpeechAnalytics_REPORT.pdf.pdf		LectureSpeechAnalytics_REPORT.pdf.pdf
README.md		README.md
lecture_classifier.py		lecture_classifier.py
random.pickle		random.pickle
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pyLectureMultiModalAnalysis - A project for lecture classification

General

Prerequesites

Installation

Usage

Mode 1: Training evaluation

Mode 2: Training

Mode 3: Target evaluation

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

cjd1884/pyLectureMultiModalAnalysis

Folders and files

Latest commit

History

Repository files navigation

pyLectureMultiModalAnalysis - A project for lecture classification

General

Prerequesites

Installation

Usage

Mode 1: Training evaluation

Mode 2: Training

Mode 3: Target evaluation

Authors

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages