Improve chunking for /rerank when history is too long

              This is fine for now, I think:). The better place would be in my opinion here [1]. The reason for this is:

- We can calculate how to chunk the results from the vector db so that they fit the context of the re-rank model. If the chunks would be too small, we could proceed to chunking of the history.

- We can send bigger chunk to the embedding model. With this change, we are truncating the input for the embedding model as well. This might not always be necessary, as the rerank model has to accommodate both for the vector database results and the history. In contrast with the embedding model which has to accommodate only for the history. But this one is up for discussion I guess:). I see some downsides with this as well.

Anyway, this is just me thinking aloud here. We can improve later. This fixes the issue for now. 

[1] https://github.com/RCAccelerator/chatbot/blob/9bc0e742fe35579e36ebdde34a3eda67b764c544/src/embeddings.py#L134

_Originally posted by @lpiwowar in https://github.com/RCAccelerator/chatbot/pull/143#discussion_r2081211939_
            

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve chunking for /rerank when history is too long #149

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Improve chunking for /rerank when history is too long #149

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions