Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PolkaGPT - All Polka-lover #2138

Closed
wants to merge 6 commits into from
Closed
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
146 changes: 146 additions & 0 deletions applications/PolkaGPT.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,146 @@
# PolkaGPT - All about Polka-lover

- **Team Name:** OraSci
- **Payment Address:** ETH : 0xb210Ed16Dbcb2C08b9216DD62Eaa64F7FF2eDF4B
- **[Level](https://github.com/w3f/Grants-Program/tree/master#level_slider-levels):** 1

## Project Overview :page_facing_up:

`PolkaGPT` is an application tool for the `LLM (Large Language Model)` model designed for developers, researchers, and users to search for documents, suggest code, hint at bug solutions, and more. Additionally, the Polkadot-loving open-source community can contribute to the data repository so that PolkaGPT can be updated continuously

1. **In-Depth Content:**

`PolkaGPT` provides detailed content covering various aspects of the Polkadot ecosystem:

+ Parachain
+ Polkadot/Kusama
+ Polkadot SDK
+ Developer tool
+ More

2. **Developer Resources:**

`PolkaGPT` offers a good resources, including coding examples, best practices, and insights into the tools and frameworks available for efficient development.

3. **Open-source Contribution**:

Based on feedback from the developer community, users, and researchers, `PolkaGPT` can provide more accurate information more quickly. At the same time, it offers the source documentation that `PolkaGPT` has been trained on

4. **Specialized Knowledge:**

`PolkaGPT` is designed to be a specialized hub for anyone wanting to explore the **Polkadot** network comprehensively.

### Project Details

To illustrate how `PolkaGPT` works:

#### Architecture

```mermaid
flowchart LR
subgraph Backend_Service
A[(Multiple Database)] -->C
B[User]-->C{Answer in database?}
end
C --> |False| D[LLM Agent Validation Service]
subgraph LLM_Agent_Model
E[Document Search] --> H[LLM]
F[Youtube Search Agent] --> I((List Link))

end

C --> |True|E
C --> |True|F
I --> L((Answer))
H --> L
L --> K((Final Answer))
D --> K
G[(Vector Database)] --> E

```
#### Demonstration
![Image 1](https://polkagpt.infura-ipfs.io/ipfs/QmYhGEVDJpvn6Spnzrf82DomMiY2UTHYQFHVhmnWoWsfMF)

![Image 2](https://polkagpt.infura-ipfs.io/ipfs/QmTviZGs8MB9RqifkYP4sRzvcvBgXGPjiHUq26rVdstfpd)

### Ecosystem Fit

`PolkaGPT` will help new developers or users in easily and quickly accessing the **Polkadot** ecosystem. The system will provide user-friendly information surrounding **Polkadot**, offering valuable and in-depth insights. Therefore, the information provided is highly useful and specialized. **`BUT POLKAGPT MAY MAKE MISTAKES, SO IT'S NECESSARY TO DOUBLE-CHECK THE INFORMATION`**


## Team :busts_in_silhouette:

### Team members / Contact

- **Contact Name:** Dung Ho Dinh (Dustin) / Nhat Cuong / Dzung Doan / Tinh Do Pham Phuc
- **Contact Email:** [email protected] / [email protected] / [email protected] / [email protected]


### Team's experience

+ Dustin is Blockchain Developer. He has 3+ years of experience in Smart Contract like: Solidity language programming (EVM-based) in ETH, BSC,... or Rust in NEAR, Polkadot. And He also have worked in Rust/Substrate in the Polkadot ecosystem.

+ Nhat Cuong is currently working in Research and Development at VBI Academy with 2 years of experience in the field of artificial intelligence. He is currently researching the application of LLMs in educational systems. Additionally, he has experience in Rust and DevOps.

+ Dzung is an AI/Blockchain Developer with over 5 years of expertise in the AI sector, contributing to the development of diverse products including eKYC, Chatbots, and Recommendation systems. Additionally, he possesses proficiency as a blockchain developer, demonstrating familiarity with Solidity and Rust. His portfolio includes on-chain projects like Nearlend protocol, DinoLand, and Mechmaster

+ Tinh's major is Data Science at University of Information Technology, VNU-HCM. He is interested in natural language processing, especially question answering and conversation AI (chatbot)


### Team Code Repos

- https://github.com/OraSci-Lab/polkaGPT

## Development Status :open_book:
`PolkaGPT` stops at the level of ideas and is currently implementing a complete MVP


## Development Roadmap :nut_and_bolt:

### Overview

- **Total Estimated Duration:** 8 weeks
- **Full-Time Equivalent (FTE):** 1.0 FTE
- **Total Costs:** 10,000 USD.

### Milestone 1

- **Estimated duration:** 1 month
- **FTE:** 1
- **Costs:** 3,000 USD


| Number | Deliverable | Specification |
| -----: | ----------- | ------------- |
| **0a.** | License | Apache 2.0 / GPLv3 / MIT / Unlicense |
| **0b.** | Documentation | Documents containing the description of whole architecture for PolkaGPT including AI prompt, Backend|
| **0c.** | Testing Guide | Providing a full test suite and guide for POC |
| **1a.** | AI prompt | Researching and implementing about features in PolkaGPT: retrieval tools, search tools, interaction with user, ... |
| **1b.** | Backend Service | Design and implement backend service for AI model |

| **2.** | Frontend | POC backend for chatbot PolkaGPT and refer docs to users |
| **3.** | Docker | Dockerise backend service and automatically trained AI model |



### Milestone 2

- **Estimated Duration:** 1 month
- **FTE:** 1
- **Costs:** 7,000 USD



| Number | Deliverable | Specification |
| -----: | ----------- | ------------- |
| **1a.** | Backend | Apply Redis and microservice technologyto enhance response quickly, accurately |
| **1b.** | AI Prompt | Optimize AI model |
| **1c.** | Blockchain | Apply Trustworthy data based on blockchain technolgy to train model -> more precise and robustness |


## Future Plans
+ Optimize Backend Service
+ Optimize AI prompt
+ `Trustworthy Data` based on `Polkadot`` Technology ( In Researching)


Loading