Hi,
I have a few questions regarding the current status of the project. Is it still actively maintained? Are you open to new pull requests at this time?
I noticed the project seems to be a bit behind the latest updates in GGML, and I believe it could benefit from upgrading to the latest GGML version.
In my local branch, I made some changes to allow selecting the backend and device at runtime for inference. Unfortunately, I couldn't get inference to work successfully yet. I'm not sure if this is due to recent changes in GGML or if I'm using an incompatible model for testing.
Could you provide any guidance on this?
Also, if I manage to get it working, would you be open to reviewing and accepting a PR for these changes?
Thanks in advance.