Skip to content
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
27 commits
Select commit Hold shift + click to select a range
d9eaa45
Initialize code for iterative finetuning
martinakaduc Jun 9, 2025
f8dd429
WIP: Creating dataset by prompting the LLM
martinakaduc Jun 12, 2025
20ab384
Fix import bug
martinakaduc Jun 12, 2025
346ec50
Remvoe unnecessary required arguments
martinakaduc Jun 12, 2025
6c4c8ba
Remove default flash_attention_2
martinakaduc Jun 12, 2025
c078e8f
Fix system prompt
martinakaduc Jun 12, 2025
5864992
Update finetuning code
martinakaduc Jun 14, 2025
a555293
Finalize finetuning code
martinakaduc Jun 15, 2025
1d471fb
Fix duplicate spk_dict download
martinakaduc Jun 15, 2025
95ed8a5
Add HELM config
martinakaduc Jun 15, 2025
330f87c
Fix bugs and add requirements
martinakaduc Jun 15, 2025
53d62f7
Revise folder naming and skip dataset creation when created
martinakaduc Jun 15, 2025
d3c87c1
Enhance transcript creation
martinakaduc Jun 15, 2025
653f4c8
Revise transcripts in finetuning
martinakaduc Jun 17, 2025
7d6f944
Fix number of samples and disable evaluation cache
martinakaduc Jun 18, 2025
3ce8ced
Fix bugs and improve pipeline
martinakaduc Aug 10, 2025
6ca63fc
Add child-voice processing and disordered voice verification
martinakaduc Aug 16, 2025
57b0e5f
Fix speech verification bugs
martinakaduc Aug 18, 2025
f7b83f8
Enhance code
martinakaduc Aug 20, 2025
67f4c42
Update working configs
martinakaduc Aug 22, 2025
c76e4ad
Remove old scripts
martinakaduc Sep 14, 2025
6b9ec03
Add code and data for finetuning with ultrasuite
martinakaduc Sep 14, 2025
8988eb7
Update scripts for self-improvement
martinakaduc Sep 14, 2025
3b64cd8
Add finetune configs
martinakaduc Sep 14, 2025
6c73147
Fix typo
martinakaduc Sep 14, 2025
927b56c
Update requirements
martinakaduc Sep 14, 2025
6025806
Update requirements
martinakaduc Sep 17, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 5 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -163,4 +163,8 @@ cython_debug/

logs/*

*.ipynb
*.ipynb
finetune/data/
finetune/*results
finetune/configs
finetune/prod_env
71 changes: 51 additions & 20 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,27 +1,58 @@
# SpeechEval
# SLPHelm

## 0. Env Set up:
This project uses a Conda environment defined in conda_env.yml. To create and activate the environment:
```sh
# Create the environment
conda env create -f environment.yml
This repository contains scripts and instructions to run the SLPHelm benchmark.

# Activate the environment
conda activate SpeechEval
```
There are two sub-folders:
- `finetune`: scripts to finetune models with self-generated data.
- `finetune-ultrasuite`: instructions to create UltraSuite dataset and finetune models with LLaMa-Factory framework.

Alternatively, if you prefer using pip directly, a requirements.txt file is provided:
```sh
# (Optional) create or activate your own environment, then:
pip install -r requirements.txt
## How to run the benchmark
1. Install Helm:
```bash
git clone https://github.com/martinakaduc/helm/ -b slp_helm
cd helm
pip install -e .
```
## 1. Get model list from huggingface:

We first iterate the models on huggingface. Filter out the model satisfied the following requirement:
1. Has model tag of: any-to-any, audio-text-to-text
2. Has vllm support (test by run `vllm serve MODEL_ID`)
3. Accept audio, text input and output text. (test by run a sample request)
2. Run the benchmark:
```bash
# Binary Classification
helm-run --run-entries \
ultra_suite_classification:model={model_name} \
--suite binary-suite \
--output-path {evaluation_dir} \
--disable-cache \
--max-eval-instances 1000

# ASR Classification
helm-run --run-entries \
ultra_suite_classification:model={model_name} \
--suite asr-suite \
--output-path {evaluation_dir} \
--disable-cache \
--max-eval-instances 1000

# ASR Transcription
helm-run --run-entries \
ultra_suite_asr_transcription:model={model_name} \
--suite trans-suite \
--output-path {evaluation_dir} \
--disable-cache \
--max-eval-instances 1000

# Type Classification
helm-run --run-entries \
ultra_suite_classification_breakdown:model={model_name} \
--suite type-suite \
--output-path {evaluation_dir} \
--disable-cache \
--max-eval-instances 1000

```sh
cd tools && python get_model_list.py
# Symptom Classification
helm-run --run-entries \
ultra_suite_disorder_symptoms:model={model_name} \
--suite symp-suite \
--output-path {evaluation_dir} \
--disable-cache \
--max-eval-instances 1000
```
Binary file removed audio/alloy.wav
Binary file not shown.
Binary file removed audio/atypical.wav
Binary file not shown.
Binary file removed audio/typical.wav
Binary file not shown.
Loading
Loading