stassi_statistics_assistant

Stassi is a Streamlit app that is designed to assist in solving statistical data analysis questions in an easy-to-use chatbot format courtesy of GPT-4 by OpenAI.

A user can input a columnar data set (up to 200 MB) and analyze it by asking questions based on statistical methods.

The app provides three pathways for analysis:

Data Analysis – The user input data is read into a Pandas dataframe which can be interacted with by asking questions about the data. In the back-end Python code is executed to provide answers.
Retrieval – A Retrieval Augmented Generation(RAG) based LLM can access data from some books on Statistics to answer questions of a technical nature.
Web Search – An option is provided to query the internet in case additional information is required or user wants to check more sources.

Technologies used: LangChain, Streamlit

Repository Structure

code

'data_analysis_llm.py'

Contains bulk of the implementation of RAG, Python based data analysis, and web search.

'prompts.py'

Contains the prompts used by LangChain agents and chains.

'RAG_embeddings.py'

Contains code to generate and locally store FAISS embeddings of some books on Statistics.

docs

Directory for books used by the RAG. Book sources listed below:

statistics_faiss_llm_index

Contains the relevant FAISS embeddings for easy retrieval generated by executing RAG_embeddings.py.

Running Stassi

To run the app follow the steps:

Clone the repo
Run the streamlit app by executing: streamlit run ~/code/data_analysis_llm.py
There is no need to re-generate the embeddings as they are already present in the repo. But if you would like to regenerate them, then execute following using command line: python3 ~/code/RAG_embeddings.py
Once the app is running in your browser, enter your OpenAI API key with access to GPT-4 in the sidebar widget to get started.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
code		code
docs		docs
statistics_llm_faiss_index		statistics_llm_faiss_index
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

stassi_statistics_assistant

Repository Structure

code

docs

statistics_faiss_llm_index

Running Stassi

About

Releases

Packages

Languages

ridhi96/stassi_statistics_assistant

Folders and files

Latest commit

History

Repository files navigation

stassi_statistics_assistant

Repository Structure

code

docs

statistics_faiss_llm_index

Running Stassi

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages