Add GGUF model support with GPU acceleration via llama-cpp-python module #3

gkielian · 2023-10-05T22:52:31Z

Adding README.md and scripts for quickly setting up GPU-accelerated python wrapper for running GGUF models (llama.cpp).

Also adds a test script to setup and test quickly by grabbing Mistral 7B and running the example prompt.

Soon we can begin augmentation via methods along the lines of:

https://arxiv.org/abs/2309.09530
Adapting Large Language Models via Reading Comprehension

Which is opened up by Mistral's apache-2.0 license.

This should allow us to further increase the efficacy of our datasets, and reduce the number of parameters required for excellent models.

This will be used by data augmentation llms like mistral

Tests gpu running with mistral Q5_K_M

Adjust snac whisper script names and paths

Refactor to allow for compilation

gkielian added 3 commits October 5, 2023 22:26

Add llama.cpp as a submodule

a3c9b1b

This will be used by data augmentation llms like mistral

Add install and test scripts for llama_cpp_python

4873354

Tests gpu running with mistral Q5_K_M

Add README and documentation for mistral

32f30b7

msaligane approved these changes Oct 6, 2023

View reviewed changes

msaligane merged commit e0e4a93 into ReaLLMASIC:master Oct 6, 2023

gkielian pushed a commit that referenced this pull request Aug 17, 2024

Merge pull request #3 from gkielian/add_snac_tokens_over_dir

e890fe0

Adjust snac whisper script names and paths

gkielian pushed a commit that referenced this pull request Nov 8, 2024

Merge pull request #3 from gkielian/add_flex_attn

e571be3

Refactor to allow for compilation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GGUF model support with GPU acceleration via llama-cpp-python module #3

Add GGUF model support with GPU acceleration via llama-cpp-python module #3

Uh oh!

gkielian commented Oct 5, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add GGUF model support with GPU acceleration via llama-cpp-python module #3

Add GGUF model support with GPU acceleration via llama-cpp-python module #3

Uh oh!

Conversation

gkielian commented Oct 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gkielian commented Oct 5, 2023 •

edited

Loading