MindSLM

Privacy-Centric Small Language Models (SLMs) for Mental Health Therapy

MindSLM is designed to deploy small language models that prioritize user privacy, specifically tailored for mental health therapy applications.

Project Files

Qwen2_5_Unsloth_2x_faster_finetuning.ipynb
This notebook provides the training workflow using UnSloth for MindSLM. Adjust the parameters and configurations as needed to suit your specific use case.
eval.py
This script handles inference tasks for large language models (LLMs) on the testing dataset.
trad-eval.py
This script handles evaluating the models' performance on the testing dataset using traditional evaluation metrics including RougeL and BERTScore, and output the results in CSV files in the eval_outputs/ directory and the plots in the plots/ directory.
llm_aj_seq.ipynb, llm_aj.py These notebooks contains the code for evaluating the models' performance on the testing dataset using llm-as-a-judge by querying the OpenAI API. The _seq version is for sequential querying, while the other uses the OpenAI Batch API.

Clone the repository and ensure all dependencies are installed.
Customize the training notebook to match your requirements.
Use Few-Shot Prompting
- Request access to the Psych8k dataset, which is currently gated on Hugging Face: https://huggingface.co/datasets/EmoCareAI/Psych8k
- Once you have access, run example.py to generate the few-shot examples.
- Replace the placeholder text in eval.py with the examples to enable few-shot prompting.
Use eval.py to generate the trained models' output on the test dataset.
Use trad-eval.py to evaluate the models' performance using traditional evaluation metrics.
Use llm_aj_seq.ipynb or llm_aj.py to evaluate the models' performance using llm-as-a-judge.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
MindQwen-2.5-0.5B-Instruct		MindQwen-2.5-0.5B-Instruct
eval_outputs		eval_outputs
plots		plots
trash		trash
.gitignore		.gitignore
Qwen_2_5_+_Unsloth_2x_faster_finetuning.ipynb		Qwen_2_5_+_Unsloth_2x_faster_finetuning.ipynb
README.md		README.md
batchinput.jsonl		batchinput.jsonl
dataset.py		dataset.py
eval.py		eval.py
example.py		example.py
llm_aj.ipynb		llm_aj.ipynb
llm_aj_seq.ipynb		llm_aj_seq.ipynb
model.py		model.py
openai_batch.ipynb		openai_batch.ipynb
rouge.ipynb		rouge.ipynb
rouge.py		rouge.py
trad-eval copy.py		trad-eval copy.py
trad-eval.py		trad-eval.py