🛠️

Tool

rag

Name: rag
Author: RoodyCode

by RoodyCode gh-tool--roodycode--rag

Nexus Index

40.9 Top 100%

S: Semantic 50

A: Authority 0

P: Popularity 26

R: Recency 96

Q: Quality 50

Tech Context

Vital Performance

0 DL / 30D

0.0%

Python Lang

Open Source 2 Stars

1.0.0 Version

Alpha Reliability

Tool Information Summary
Entity Passport
Registry ID	gh-tool--roodycode--rag
Provider	github

📜

Cite this tool

Academic & Research Attribution

BibTeX

@misc{gh_tool__roodycode__rag,
  author = {RoodyCode},
  title = {rag Tool},
  year = {2026},
  howpublished = {\url{https://free2aitools.com/tool/gh-tool--roodycode--rag}},
  note = {Accessed via Free2AITools Knowledge Fortress}
}

APA Style

RoodyCode. (2026). rag [Tool]. Free2AITools. https://free2aitools.com/tool/gh-tool--roodycode--rag

🔬Technical Deep Dive

Full Specifications [+]

Quick Commands

🐍 PIP Install

pip install rag

⚖️ Nexus Index V2.0

Methodology Index Protocol

40.9

TOP 100% SYSTEM IMPACT

Semantic (S) 50

Authority (A) 0

Popularity (P) 26

Recency (R) 96

Quality (Q) 50

💬 Index Insight

FNI V2.0 for rag: Semantic (S:50), Authority (A:0), Popularity (P:26), Recency (R:96), Quality (Q:50).

Free2AITools Nexus Index

Verification Authority

HuggingFace API GitHub Metadata Arxiv Citation DB System Audit

Unbiased Data Node Refresh: VFS Live

📋 Specs

Language: Python
License: Open Source
Version: 1.0.0

📦

Usage documentation not yet indexed for this tool.

Technical Documentation

RAG Knowledge Base

A private document RAG (Retrieval-Augmented Generation) system that ingests PDFs and exposes a search tool via an MCP server. Retrieval combines vector search (pgvector) and BM25 keyword search with cross-encoder reranking, and answers are generated by an AWS Bedrock LLM.

Architecture

mermaid

flowchart LR
    subgraph Ingestion
        direction TB
        A[📄 data/ PDFs] --> B[Docling
PDF Parser]
        B --> C[HybridChunker
BAAI/bge-m3 tokenizer]
        C --> D[HuggingFace Embeddings
BAAI/bge-m3 · 1024-dim]
        D --> E[(pgvector
PostgreSQL)]
        C --> F[(Redis
BM25 Docstore)]
    end

    subgraph Query["Query  —  mcp_server.py"]
        direction TB
        G[search_knowledge
tool call] --> H[Vector Retriever
pgvector]
        G --> I[BM25 Retriever
Redis]
        H & I --> J[QueryFusionRetriever
relative_score fusion]
        J --> K[Cross-encoder Reranker
BAAI/bge-reranker-large]
        K --> L[BedrockConverse LLM]
        L --> M[Answer + Sources]
    end

    E --> H
    F --> I

Prerequisites

Python 3.11+
uv
Docker & Docker Compose
AWS credentials with Bedrock access

Build and Run

Configuration

ingestion/config.py is the source of truth for supported environment variables and defaults. Copy .env.example to .env and update credentials/endpoints for your environment.

env

DATABASE_URL=postgresql://chat-app:admin@localhost:5432/chat_app
BEDROCK_API_KEY=
AWS_REGION=eu-central-1
DATA_DIR=./data

Local Build and Run

Install Python dependencies:

bash

uv sync

Start backing services:

bash

docker compose up pgvector redis -d

Enqueue ingestion jobs:

bash

uv run python ingest.py

Start one or more workers:

bash

uv run python worker.py

Query locally (optional):

bash

uv run python ask.py "your question"

Run MCP server:

bash

uv run python mcp_server.py

The server starts on http://localhost:8000 and exposes:

Tool	Description
`search_knowledge`	Searches the knowledge base and returns an answer with source file citations

Docker Build and Run

Build images:

bash

docker compose build

Run full stack:

bash

docker compose up -d

Run only ingestion infrastructure + workers:

bash

docker compose up -d pgvector redis worker

Scale worker count:

bash

docker compose up -d --scale worker=3 worker

Run ingestion from inside the worker container (optional):

bash

docker compose exec worker bash
uv run python ingest.py

Maintenance

Add dependencies with uv add <package> and commit both pyproject.toml and uv.lock.
Rebuild images after dependency changes with docker compose build.
When adding or renaming settings, update both ingestion/config.py and .env.example.

Project Structure

text

.
├── data/                  # PDF documents to ingest
├── ingestion/
│   ├── config.py          # Pydantic settings (loaded from .env)
│   ├── pipeline.py        # Docling parsing, embedding, pgvector + Redis ingestion
│   ├── queue.py           # Redis/RQ enqueueing for ingestion jobs
│   └── tasks.py           # Worker task wrappers around ingestion functions
├── query/
│   └── engine.py          # Hybrid retriever + reranker + Bedrock LLM query engine
├── ingest.py              # Ingestion queue producer entry point
├── worker.py              # RQ worker entry point
├── ask.py                 # Local interactive query CLI
├── mcp_server.py          # FastMCP server exposing search_knowledge tool
├── Dockerfile
└── docker-compose.yml

Social Proof

GitHub Repository

2Stars

🐙 Data Source: GitHub ↗

🔄 Daily sync (03:00 UTC)

AI Summary: Based on GitHub metadata. Not a recommendation.

📊 FNI Methodology 📚 Knowledge Baseℹ️ Verify with original source

🛡️ Tool Transparency Report

Technical metadata sourced from upstream repositories.

Open Metadata

🆔 Identity & Source

id: gh-tool--roodycode--rag
slug: roodycode--rag
source: github
author: RoodyCode
license
tags: ai, document-ingestion, embeddings, llamaindex, pdf-processing, personal-knowledge-base, postgres, rag, redis, retrieval-augmented-generation, self-hosted, semantic-search, vector-database, python

⚙️ Technical Specs

architecture: null
params billions: null
context length: null
pipeline tag: feature-extraction

📊 Engagement & Metrics

downloads: 0
stars: 2
forks: 0
github stars: 2

Data indexed from public sources. Updated daily.

Welcome to Free2AI Tools!

Smart Search

FNI Score

You're All Set!

Cite this tool

🔬Technical Deep Dive

Quick Commands

⚖️ Nexus Index V2.0

💬 Index Insight

Verification Authority

📋 Specs

Technical Documentation

RAG Knowledge Base

Architecture

Prerequisites

Build and Run

Configuration

Local Build and Run

Docker Build and Run

Maintenance

Project Structure

Social Proof

🛡️ Tool Transparency Report

🆔 Identity & Source

⚙️ Technical Specs

📊 Engagement & Metrics