heed

Overview

heed is a library which fits a collection of deep learning models trained on the task of key-word spotting and locally deploy one via a webserver which provides a user interface for these trained models to be queried through, using an audio input device of the host machine.

Usage

To begin, clone the repository and run a script to check that you have all the required CLI tools:

./check_cli_tools.sh

Unfortunately, the dataset used in this application is private. So running the entire pipeline end to end may not work with a single command for you. Nethertheless, you can used your own binary classification dataset from the audio domain to substitute for the one used in this repository, or remove the fitting service from the docker-compose.yaml file and use a pre-trained model that accompanies this repository.

To spin up the model fitting job, serving and web server containers, please run:

docker-compose -f docker-compose.yaml up

once the fitting job has been completed (which should be apparent from the logs), the serving container will have a interactive API documentation page accessible for end users to test out the inference API directly. This will be accessible from:

http://localhost:6000/docs

there will also be a webserver serving an application allowing end users to test out the inference API via recording their own clips and posting them to the API through a simple user interface. This will be accessible from:

http://localhost:7000

Running tests

heed has three components; the fitting, serving and webserver parts. Each of these has its own suite of tests which can be run via:

Further improvements

Expanding the model zoo. Websockets for real-time stream predictions. see website/dev.README.md

Notes for devs

See the dev.README.mds in each subfolder; models/, website/ and serve/, for more information.

Create environment for model development.

To develop the zoo of models available, create a virtual environment and whilst in it, run the command: pip install -r requirements.txt and then pip install -e ./models This will install the dependencies and python package that contains the logic for all the models, allowing you to import it like

import kws
...

To do: 14 Jan 2025

Research

Representational learning autoencoder visualising embeddings via tensorbaord - build downstrain classifier based auto-encoded representation
Synthetic data generation using variational autoencoder - Train classifier on both mix, purely synthetic and purely the original dataset and compare the results
Is it possible to to generator a KWS alogrithm with synthetic data generation? As in, is it possibly to generate dataset for any given key-word to be stopped?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

heed

Overview

Usage

Running tests

Further improvements

Notes for devs

Create environment for model development.

To do: 14 Jan 2025

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 289 Commits
.dvc		.dvc
fit		fit
img		img
models		models
serve		serve
tests		tests
website		website
.dvcignore		.dvcignore
.gitignore		.gitignore
README.md		README.md
check_cli_tools.sh		check_cli_tools.sh
docker-compose.yaml		docker-compose.yaml
requirements.txt		requirements.txt

akinwilson/heed

Folders and files

Latest commit

History

Repository files navigation

heed

Overview

Usage

Running tests

Further improvements

Notes for devs

Create environment for model development.

To do: 14 Jan 2025

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages