`CoLLT: Contrastive Learning for Long-document Transformers`

CoLLT is a contrastive learning framework for training BERT and its variants to classify long input sequence.

How to run the file

Download the zip folder for the code
Run the main.py notebook to execute the experiments

About the code

The code is written in a well designed modular way so that implementing new contrastive loss, augmentation technique or data encoder is easy. The code is divided into 4 main files:

augmenters.py: contains code for different augmentation techniques which are needed for view construction
models.py: contains code for different data encoders
contrast_models.py: contains pre processing step (like sample positive and negative samples, etc) before applying contrastive loss
losses.py: contains code for barlow twin's loss. We have a main.py file which handles the training of end-to-end model pipeline.

Other files:

Bert_baseline.ipynb: contains code to run the BERT baseline model
Data_filter.ipynb: contains code for data preprocessing
baselines.py: conatines baseline models
baselines.ipynb: used to run baseline.py
data.pickle, data_val.pickle, data_test.pickle: contains train, validation and test data
data_visualization.ipynb: contains data visualization tools and techniques

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`CoLLT: Contrastive Learning for Long-document Transformers`

How to run the file

About the code

Other files:

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
images		images
.gitignore		.gitignore
Bert_baseline.ipynb		Bert_baseline.ipynb
Data_filter.ipynb		Data_filter.ipynb
IMDB Dataset.csv		IMDB Dataset.csv
README.md		README.md
augmentors.py		augmentors.py
baselines.ipynb		baselines.ipynb
baselines.py		baselines.py
contrast_models.py		contrast_models.py
data.pickle		data.pickle
data_test.pickle		data_test.pickle
data_val.pickle		data_val.pickle
data_visualization.ipynb		data_visualization.ipynb
losses.py		losses.py
main.py		main.py
models.py		models.py

siddhantshingi/CoLLT

Folders and files

Latest commit

History

Repository files navigation

CoLLT: Contrastive Learning for Long-document Transformers

How to run the file

About the code

Other files:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

`CoLLT: Contrastive Learning for Long-document Transformers`

Packages