Document Retrieval System

Overview

This document retrieval system provides a backend for retrieving documents to be used as context in chat applications. It includes caching, rate limiting, and background scraping features.

Features

REST API with endpoints /health and /search
Caching with Redis
Rate limiting for users
Background scraping of news articles

Installation

Clone the repository:

git clone https://github.com/PXDHU/Document-Retrieval-System

Navigate to the project directory:
```
cd document-retrieval-system
```

Build and run the Docker container:

docker build -t document-retrieval-system .
docker run -p 5000:5000 document-retrieval-system

Usage

Health Check Endpoint:
- URL: /health
- Method: GET
- Description: Returns a random response to check if the API is active.

Search Endpoint:

URL: /search
Method: POST

Body:

{
  "text": "search query",
  "top_k": 5,
  "threshold": 0.7,
  "user_id": "unique_user_id"
}

Description: Returns a list of top results for the given query.

Caching

Redis is used for caching search results. Cached results are stored with an expiration time of 300 seconds.

Rate Limiting

Users are limited to 5 requests per hour. Exceeding this limit will result in a 429 status code.

Background Scraper

A background thread scrapes news articles every hour and updates the document database.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.idea		.idea
.gitattributes		.gitattributes
.gitignore		.gitignore
Chroma_db.py		Chroma_db.py
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
search_documents.py		search_documents.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document Retrieval System

Overview

Features

Installation

Usage

Caching

Rate Limiting

Background Scraper

About

Releases

Packages

Languages

PXDHU/Document-Retrieval-System

Folders and files

Latest commit

History

Repository files navigation

Document Retrieval System

Overview

Features

Installation

Usage

Caching

Rate Limiting

Background Scraper

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages