feat - G_EVAL #16

yeowonh · 2024-03-25T06:56:31Z

Overview

G_EVAL, UniEVAL 논문 및 레포지토리를 참고하여 평가 지표 구성
GPT-4를 통해 평가 지표에 대한 세부 평가 단계 (이하 CoT) 생성
실험에 맞춰 한국어 MSC dataset 전처리 (반말 변환, Person1-Person2의 대화를 User-Bot 대화로 변경)
Persona Extraction model(model=et5, max_token=200, beam_size=2) 사용하여 7-turn 이상 쌓인 유저의 발화를 토대로 유저 페르소나 추출
챗봇 페르소나는 한국어 msc dataset 내의 제공된 label 사용
사용자 페르소나, 챗봇 페르소나, 직전 5-tur의 대화를 가지고 다음 발화를 생성하는 실험 구성
1. 사용자 페르소나를 포함한 프롬프트 2. 사용자 페르소나를 포함하지 않은 프롬프트 두 가지로 나누어 OPEN-SOLAR-KO-10.7B 모델에 삽입 후 다음 턴 응답 생성
4가지 평가지표를 통해 생성된 두 가지 응답 평가
평가에는 GPT-4(n=10) 를 이용한 가중평균, human evaluation 사용

feat - Django 기반 main_app 및 서버 띄우는법 README.md 추가

feat - postgres 이미지에서 사용될 DB init 쉘 스크립트 추가

bug - retrospective view 에서 httpx 에 대한 전체 Exception 핸들링 추가

iloveonsen and others added 17 commits March 24, 2024 12:34

feat - Django 기반 main_app 및 서버 띄우는법 README.md 추가

aeff424

#6

Merge pull request #7 from boostcampaitech6/feat/6

a4811ac

feat - Django 기반 main_app 및 서버 띄우는법 README.md 추가

feat - postgres 이미지에서 사용될 DB init 쉘 스크립트 추가

4d3467c

#6

Merge pull request #8 from boostcampaitech6/feat/6

5a7be53

feat - postgres 이미지에서 사용될 DB init 쉘 스크립트 추가

bug - retrospective view 에서 httpx 에 대한 전체 Exception 핸들링 추가

c2742d1

#6

Merge pull request #9 from boostcampaitech6/feat/6

99bb842

bug - retrospective view 에서 httpx 에 대한 전체 Exception 핸들링 추가

Make CoT prompts in English and Korean

ca67590

Preprocess korean msc dataset

5a3e0dc

Make single-turn uttrance prediction with SOLAR-KO

d4105dd

Add convert_to_json function

17e2311

Add Evaluate code by GPT-4

f95bce0

Add Evaluate result files

b4659dc

Add Human evaluation code

5ac83ca

Add Human Evaluation code and data

c853553

Modify g_eval_model_prediction.ipynb

7db7983

Merge branch 'feat/17/response_experiments'

fe381df

Merge dialog test to G-EVAL and make ml folder

d447df9

yeowonh changed the base branch from develop to main March 29, 2024 03:47

yeowonh merged commit 4ae4436 into main Mar 29, 2024

yeowonh deleted the feat/3/geval branch March 29, 2024 03:48