← Back to grid

Saturday, June 20, 2026

2026-06-20

  1. Today, I worked on creating script for the #cosine-similarity blog. I used the BGE-M3 to create initial dense embeddings. I retrieve top 50 results and do re-ranking using BGE-reranker-v2 to get top 5 results. I have setup search and retrieval pipelines for GitHub commits and Youtube videos. I can fetch commits based on repo name, and date range. For YouTube videos, I use a database from kaggle. Tomorrow, I plan to finish up the presentation and add a #blog entry.