Skip to content

Will you please guide how to run the conversion script? #36

@AayushSameerShah

Description

@AayushSameerShah

Hie 👋🏻
Coming from this GGML conversion script and the issue that you commented in ggml-org/ggml#280 (comment)

Now...

I have a fine-tuned model found on huggingface called "NumbersStation/nsql-350M" which basically is a CodeGen model so I will need to convert into GPT-J model so that I can convert into GGML.

Conversion in GPT-J

I have used this script: https://gist.github.com/moyix/7896575befbe1b99162ccfec8d135566
To convert the CodeGen model into GPT-J which worked correctly.

Then...

After converting into GPT-J, I tried to use the GGML script but was getting:

KeyError                                  Traceback (most recent call last)
Cell In[42], line 2
      1 for key in encoder:
----> 2     text = bytearray([byte_decoder[c] for c in key])
      3     fout.write(struct.pack("i", len(text)))
      4     fout.write(text)

Cell In[42], line 2, in <listcomp>(.0)
      1 for key in encoder:
----> 2     text = bytearray([byte_decoder[c] for c in key])
      3     fout.write(struct.pack("i", len(text)))
      4     fout.write(text)

KeyError: '\t'

So, I came to this repo

Where I found this: https://github.com/LLukas22/llm-rs-python/blob/main/llm_rs/convert/models/gptj.py

But... can't figureout how to convert my GPT-J model to GGML

Will you please help @LLukas22 ?
Thanks 🙏🏻

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions