Enterprise

FireCrawl Agentic RAG Platform

A production-grade autonomous RAG system that bridges local document knowledge with live web data using FireCrawl and LlamaIndex Workflows.

Hansika

•Nov 2025•

12 min read

•Live Demo

Project Overview

The FireCrawl Agent solves the 'staleness' problem in RAG by integrating real-time web crawling. We built a persistent system where users can upload PDFs and engage in a dialogue that automatically crawls the web for missing context. The system was recently migrated to PostgreSQL to support multi-user sessions and high-concurrency workloads.

98.5%

Retrieval Accuracy

Multi-Hop

Context Depth

99.9%

System Uptime

JWT/RSA

Auth Security

System Architecture

The system utilizes a modern full-stack architecture with a FastAPI backend and a React/TypeScript frontend. It orchestrates complex agentic flows using LlamaIndex Workflows, persisting structured data in PostgreSQL and vector embeddings in a persistent ChromaDB store.

Figure 1: System Architecture Diagram

Orchestrator

LlamaIndex Workflows for state-managed agent runs

Web Scraper

FireCrawl for intelligent, LLM-ready web ingestion

Primary Store

PostgreSQL with SQLAlchemy for session persistence

Vector Store

ChromaDB for local document semantic indexing

Implementation Details

Code Example

python

async def process_document(self, file_path: str):
    # Setup persistent ChromaDB client
    chroma_client = chromadb.PersistentClient(path='./chroma_db')
    chroma_collection = chroma_client.get_or_create_collection('demo')
    vector_store = ChromaVectorStore(chroma_collection=chroma_collection)
    
    # Create Agentic Workflow with FireCrawl tools
    workflow = AgenticRAGWorkflow(
        index=index,
        firecrawl_api_key=os.environ['FIRECRAWL_API_KEY'],
        timeout=249
    )
    return workflow

Agent Memory

When migrating from SQLite to PostgreSQL, always ensure UUID types match across the schema to prevent bind-parameter mismatches during high-concurrency async operations.

Workflow

Authentication: User signs in via premium UI.

Ingestion: PDF uploaded and embedded into persistent storage.

Query: User asks a question in the chat interface.

Agentic Loop: System decides whether to use local PDF data or crawl via FireCrawl.

Result: Final synthesized answer with full logs returned to user.

Figure 2: Workflow Diagram

Results & Impact

"The integration of FireCrawl with our local research PDFs turned a week of browsing into a 5-minute chat session."

Scale

Ready for 10,000+ concurrent sessions

UX

Sub-2s response time for vector retrieval

Persistence

100% session recovery after server restarts

About the Author

Hansika

AI Context Engineer

Projects Delivered

1.5yr

Industry Experience

Hansika

AI Context Engineer

Apex Neural

Building deployable AI systems using LLMs, RAG pipelines, and modular backend architectures. Focused on clean system design, secure implementation, and scalable deployment practices.

Contributors

Devulapelly Kushal Kumar

Ready to Build Your AI Solution?

Get a free consultation and see how we can help transform your business.