LLAMA-quantized TheBloke/Llama-2-7B-Chat-GGML Quantized model use case on RAG #CPU Optimized ue of CTransformers library to run on CPU/Local machine