Milvus Adventures July 29, 2024
Chris Churilo

Chris Churilo @chrischurilo

About: Working at another open source database company and still having fun! I am always looking to hear from Developers who are using our projects. I am always amazed at how clever some developers are!

Location:
San Francisco, CA
Joined:
Mar 10, 2022

Milvus Adventures July 29, 2024

Publish Date: Jul 29 '24
6 0

COMMUNITY

We had so much fun at the meetup this week in Palo Ato and can't wait to see you all again next month. We haven't had the chance to upload the video, yet, however, the Berlin and SF videos are up for your viewing pleasure.

  • Unstructured Data SF Meetup video
  • Unstructured Data Berlin Meetup video

Learn About Vector Databases

There are so many databases with Vector Search capabilities that it can be overwhelming to know where to start! This week, let's focus on learning about similarity metrics, the diffrence between sparse and dense vectors and get our hands dirty with some hands-on tutorials.

Get Started with Milvus

Milvus is an open source vector database that is a popular choice for builing all kinds of AI applications.

Vector Embeddings

In general, there are two types of vectors: dense vectors and sparse vectors. While they can be utilized for similar tasks, each has advantages and disadvantages.

You can also train your own models, learn more about sentence transformers and even give time series embedding a go!

Vector Indexes

Most vector search solutions rely on HNSW, but there are many other vector indexes and understanding the differences will help you create a performant and cost effective AI application. Here are two that you might not have heard about yet:

Learn RAG

Chunking Strategies

Optimizing your RAG applications

More cool tutorials on agents with Llama 3

GITHUB REPOS

Milvus Milvus is an open-source vector database built to power embedding similarity search and AI applications.

Akcio: Enhancing LLM-Powered ChatBot with CVP Stack A full chatbot app all open-source for you to try out for your self!

GPT Cache. GPTCache is an open-source tool designed to improve the efficiency and speed of GPT-based applications by implementing a cache to store the responses generated by language models.

VectorDBBench. VectorDBBench is an open-source benchmarking tool to help you evaluate the performance of mainstream vector databases and cloud services with yoru specific use case.

Comments 0 total

    Add comment