Completing the RAG Loop: From Vector Search to LLM Recommendations - Part 2

The LLM acts as the final reasoning layer, turning vector based search results into tailored viewing recommendations. Where We Left Off In Part 1, we built the foundation of a movie recommendation RAG pipeline. We took raw user preferences, generated 1024-dimension vector embeddings using bge-large, and stored them in a Qdrant vector database. The result was a semantic search engine that could find similar preferences with accuracy relevant to the user’s interests and mood of the moment. ...

September 6, 2025 · 15 min · 3076 words · Mark Holton

Vector Databases in Action: Building a RAG Pipeline for Movie Recommendations - Part 1

AI robot aiming an arrow, symbolizing how a RAG pipeline targets relevant information for a user’s query. Using a movie recommendation agent to explore these architectural concepts Why I Am Building This Do you and your significant other ever sit down to watch a movie, and find yourself surfing previews for 30-45 minutes before giving up? Of course Netflix has a strong recommendation engine, but it’s tuned for maximizing engagement across their catalog, not necessarily reflecting your personal taste over time. ...

September 1, 2025 · 14 min · 2971 words · Mark Holton