Skip to content

Official repository for Decentralized Arena via Collective LLM Intelligence

Notifications You must be signed in to change notification settings

maitrix-org/de-arena

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Decentralized Arena via Collective LLM Intelligence: Building Automated, Robust, and Transparent LLM Evaluation for Numerous Dimensions

[Leaderboard] [Blog]

We release Decentralized Arena that automates and scales “Chatbot Arena” for LLM evaluation across various fine-grained dimensions (e.g., math – algebra, geometry, probability; logical reasoning, social reasoning, biology, chemistry, …). The evaluation is decentralized and democratic, with all LLMs participating in evaluating others. It achieves a 95% correlation with Chatbot Arena's overall rankings, while being fully transparent and reproducible.

More details coming soon...

About

Official repository for Decentralized Arena via Collective LLM Intelligence

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published