Categories: Technology

Vector database company Qdrant wants RAG to be more cost-effective

We want to hear from you! Take our quick AI survey and share your insights on the current state of AI, how you’re implementing it, and what you expect to see in the future. Learn More


More companies are looking to include retrieval augmented generation (RAG) systems in their technology stack, and new methods to improve it are now coming to light. 

Vector database company Qdrant believes its new search algorithm, BM42, will make RAG more efficient and cost-effective. 

Qdrant, founded in 2021, developed BM42 to provide vectors to companies working on new search methods. The company wants to offer more hybrid search—which combines semantic and keyword search—to customers. 

Andrey Vasnetsov, co-founder and chief technology officer of Qdrant, said in an interview with VentureBeat that BM42 is an update to the algorithm BM25, which “traditional” search platforms use to rank the relevance of documents in search queries. RAG often uses vector databases or databases that store data as mathematical metrics that make it easy to match data.


Countdown to VB Transform 2024

Join enterprise leaders in San Francisco from July 9 to 11 for our flagship AI event. Connect with peers, explore the opportunities and challenges of Generative AI, and learn how to integrate AI applications into your industry. Register Now


“When we apply traditional keyword matching algorithms, the most commonly used one is BM25, which assumes documents have enough size to calculate statistics,” Vasnetsov said. “But we’re working with chunks of information now with RAG, so it doesn’t make sense to use BM25 anymore.”

Vasnetsov added that BM42 uses a language model, but instead of creating embeddings or representations of information, the model extracts the information from the documents. This information becomes tokens, which the algorithm then scores or weights in order to rank its relevance to the search question. This lets Qdrant pinpoint the exact information needed to answer a query.

Hybrid search has many options

However, BM42 is not the first method to look to overtake BM25 to make it easier to do hybrid research and RAG. One such option is Splade, which stands for Sparse Lexical and Expansion model

It works with a pre-trained language model that can identify relationships between words and include related terms that may not be the same between the search query text and the documents it references. 

While other vector database companies use Splade, Vasnetsov said BM42 is a more cost-efficient solution. “Splade can be very expensive because these models tend to be really huge and require a lot of computation. So it’s still expensive and slow,” he said. 

RAG is quickly becoming one of the hottest topics in enterprise AI, as companies want a way to use generative AI models and map these to their own data. RAG could bring more accurate and real-time information from company data to employees and other users. 

Companies like Microsoft and Amazon now offer infrastructure for cloud computing clients to build RAG applications. In June, OpenAI acquired Rockset to beef up its RAG capabilities. 

But while RAG lets users ground the information AI models read to company data, it is still a language model that can be prone to hallucinations. 

News Today

Share
Published by
News Today

Recent Posts

Kareena Kapoor’s Next Untitled Film With Meghna Gulzar Gets Prithviraj Sukumaran On Board

Kareena Kapoor is working with Raazi director Meghna Gulzar for her next film. The project,…

2 weeks ago

Purdue basketball freshman Daniel Jacobsen injured vs Northern Kentucky

2024-11-09 15:00:03 WEST LAFAYETTE -- Daniel Jacobsen's second game in Purdue basketball's starting lineup lasted…

2 weeks ago

Rashida Jones honors dad Quincy Jones with heartfelt tribute: ‘He was love’

2024-11-09 14:50:03 Rashida Jones is remembering her late father, famed music producer Quincy Jones, in…

2 weeks ago

Nosferatu Screening at Apollo Theatre Shows Student Interest in Experimental Cinema – The Oberlin Review

2024-11-09 14:40:03 A silent German expressionist film about vampires accompanied by Radiohead’s music — what…

2 weeks ago

What Are Adaptogens? Find Out How These 3 Herbs May Help You Tackle Stress Head-On

Let's face it - life can be downright stressful! With everything moving at breakneck speed,…

2 weeks ago

The new Mac Mini takes a small step towards upgradeable storage

Apple’s redesigned Mac Mini M4 has ditched the previous M2 machine’s SSD that was soldered…

2 weeks ago