Implementation of InscturOR in Jupyter Notebook
Then use to compare InstructOR results with others Open-Source Embedders
The goal is to chunkunize some PDFs (here 1 chunk = 1/2 pdf page) then we compute Cosine Similarities between the Query and the vectorized chunks.
We plot and keep some top k (here Top 5) best results.
To be continued.