Basic_Medical_Chatbot_Using_RAG

This project implements a Retrieval-Augmented Generation (RAG) based medical chatbot using a single Jupyter Notebook. The chatbot aims to provide accurate and contextually relevant responses to medical queries by combining retrieval from a vector database with generative capabilities.

Project Overview

Medical chatbots can play a crucial role in providing healthcare information, symptom assessment, and wellness advice, particularly for remote areas or during non-office hours. This project leverages a RAG-based approach to build a medical chatbot that retrieves relevant contexts and generates responses, enhancing user interaction and information accuracy.

Key Components of the Project

Data Gathering and Preparation:
- Load and prepare a medical dataset from Hugging Face (e.g., PubMed QA) to train the chatbot.
- Preprocess and chunk context data to fit within the model's embedding limitations.
Vector Database Creation:
- Create dense and sparse embeddings of medical contexts using Sentence Transformers and SpladeEncoder models.
- Store embeddings in a Pinecone vector database to enable fast and accurate retrieval.
Retrieval-Augmented Generation (RAG) Pipeline:
- Build a RAG pipeline using LangChain to combine retrieval from Pinecone with text generation using OpenAI’s GPT model.
- Queries are passed to the RAG pipeline, which retrieves relevant contexts and generates a response.
Evaluation:
- Use the RAGAS evaluation metrics to assess the quality of responses based on context recall, context precision, faithfulness, answer relevancy, answer similarity, and answer correctness.
- Visualize evaluation results for better insights into model performance.

Project Flow

Import Libraries: Import all necessary Python libraries such as Hugging Face datasets, Pinecone, Sentence Transformers, LangChain, and RAGAS.
Data Loading and Preprocessing:
- Load a dataset from Hugging Face.
- Chunk contexts into manageable sizes.
Vector Database Setup:
- Generate dense and sparse embeddings for each context.
- Upload embeddings to Pinecone for efficient retrieval.
RAG Pipeline Setup:
- Set up LangChain with Pinecone for context retrieval.
- Use OpenAI’s GPT model to generate answers based on retrieved contexts.
Evaluation:
- Test the chatbot's responses on sample queries.
- Calculate performance metrics using RAGAS.

Example Usage

Question and Answering

# Ask a question using the RAG pipeline
query = "What are the symptoms of diabetes?"
result = qa.invoke(query)
print("Generated Response:", result['result'])

Evaluation Example

# Evaluate RAG pipeline with sample queries and responses
eval_data = [...]  # Prepare sample evaluation data
result = evaluate(
    dataset=ragas_eval,
    metrics=[faithfulness, answer_relevancy, context_precision, context_recall]
)
print("Evaluation Results:", result)

Setup Instructions

Install Dependencies: Make sure to install all necessary Python libraries. Use the following command to install dependencies:
```
pip install -r requirements.txt
```
API Keys:
- Set up an API key for Pinecone in your environment variables.
- Set up an API key for OpenAI in your environment variables if using GPT for generation.
Run the Notebook: Open the Jupyter Notebook and follow the code sections to execute each part of the workflow.

Dependencies

Python 3.8+
transformers
datasets
pinecone-client
sentence-transformers
langchain
ragas
matplotlib
seaborn

Limitations

The model's responses are limited to the quality and scope of the dataset provided.
Requires access to Pinecone and OpenAI API for the retrieval and generation process.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Advance_NLP_Project.pptx		Advance_NLP_Project.pptx
Medical_Chatbot.ipynb		Medical_Chatbot.ipynb
Medical_Chatbot_final.ipynb		Medical_Chatbot_final.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Basic_Medical_Chatbot_Using_RAG

Project Overview

Key Components of the Project

Project Flow

Example Usage

Question and Answering

Evaluation Example

Setup Instructions

Dependencies

Limitations

About

Releases

Packages

Languages

Vaibhav99mishra/Basic_Medical_Chatbot_Using_RAG

Folders and files

Latest commit

History

Repository files navigation

Basic_Medical_Chatbot_Using_RAG

Project Overview

Key Components of the Project

Project Flow

Example Usage

Question and Answering

Evaluation Example

Setup Instructions

Dependencies

Limitations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages