Blog | Fission

Hybrid Search = Spare + Dense RAG

January 2, 2025 · 4 min read

Fission Team

Why We Use Hybrid Search RAG (Sparse + Dense Embedding + ReRanker) Instead of Naive RAG?

Problem Statement: Decentralized Web3 Agents and the Need for Efficient Data Retrieval

The emergence of decentralized Web3 agents has redefined the landscape of AI-driven automation. Unlike traditional centralized frameworks, these agents operate on decentralized platforms, emphasizing transparency, user ownership, and multi-modal data processing. However, managing and retrieving data in decentralized environments poses unique challenges:

Data Fragmentation: Information is scattered across multiple decentralized nodes, making efficient retrieval complex.
Diverse Data Modalities: Web3 agents require access to text, images, and structured metadata to function effectively.
Performance Bottlenecks: Standard retrieval mechanisms struggle with scalability and semantic understanding in decentralized systems.

This is where Hybrid Search RAG—a sophisticated blend of sparse and dense embedding retrieval with re-ranking—becomes a game-changer. It not only addresses these challenges but also sets a new benchmark for data retrieval in decentralized frameworks.

What is Naive RAG?

Naive RAG integrates a generative AI model with a retrieval component that fetches relevant documents from a database. This retrieval is typically based on:

Sparse Embeddings: Techniques like TF-IDF or BM25 for keyword-based matching (Robertson, S. et al., 2009).
Dense Embeddings: Vectorized representations using deep learning models like Sentence Transformers (Reimers & Gurevych, 2019) or BERT (Devlin et al., 2018).

While effective for basic applications, naive RAG has critical shortcomings:

Limited Context Understanding: Sparse embeddings often fail to capture semantic nuances, especially in multi-modal data.
Suboptimal Ranking: Dense embeddings can retrieve irrelevant documents due to lack of fine-grained ranking mechanisms.
Scalability Issues: Naive implementations struggle to efficiently handle large-scale or multi-modal datasets.

RAFT - RAG based Finetunning

January 1, 2025 · 4 min read

Damon Lee

Fission Team

Why We Need RAFT: Adapting Language Models to Domain-Specific RAG

The evolution of Retrieval-Augmented Generation (RAG) has unlocked unprecedented possibilities in AI, enabling generative models to retrieve and incorporate external data dynamically. However, as AI frameworks increasingly interface with domain-specific contexts like Web3, there is a growing need for a specialized adaptation mechanism—RAFT (Retrieval-Adapted Fine-Tuning). This blog explores why RAFT is essential for adapting language models to domain-specific RAG, enhancing real-time interactions with the Web3 community and its users.

The Challenge: Domain-Specificity in RAG

Web3 ecosystems are inherently dynamic and domain-specific, characterized by:

Unique Jargon and Concepts: Terms like "staking," "DAO," "NFT minting," and "gas fees" are ubiquitous in Web3 but rarely encountered in general-purpose datasets.
Rapidly Evolving Information: Web3 platforms are continuously updated with new protocols, smart contracts, and token standards.
Decentralized Data Sources: Information is dispersed across blockchains, decentralized file systems, and community-managed repositories.

While RAG frameworks excel in retrieving relevant data, they often struggle with adapting generative outputs to these domain-specific requirements. Without fine-tuning, language models risk producing generic or irrelevant responses that fail to meet the expectations of Web3 users.

Problem Statement: Decentralized Web3 Agents and the Need for Efficient Data Retrieval​

What is Naive RAG?​

Why We Need RAFT: Adapting Language Models to Domain-Specific RAG​

The Challenge: Domain-Specificity in RAG​

Problem Statement: Decentralized Web3 Agents and the Need for Efficient Data Retrieval

What is Naive RAG?

Why We Need RAFT: Adapting Language Models to Domain-Specific RAG

The Challenge: Domain-Specificity in RAG