by jonas_kgomo on 4/29/23, 11:12 PM with 0 comments
To achieve this, we used Cohere's 100M embeddings on Wikipedia, which allowed us to map each article to a high-dimensional vector space [786] where similar articles are cosine close to each other.
We then built a simple web app that allows users to search for articles using natural language queries and returns the top results, along with a 2D visualization of the articles using UMAP.