Chaiverse calculates the ELO of LLMs via human feedback, obtained through blind selection testing using the Chai app. This involves gathering positive and negative upvotes from randomly selected models on our leaderboard, as experienced by end users. With over one million samples a day, we are able to build a ranking of LLMs - which for our use-case, seems to be the most accurate.

Submit your model now, and start collecting feedback on your model’s ranking in under five minutes! Feel free to join us on our Discord server to discuss the leaderboard and chat with other participants!

Submissions in the last 24 hours: 19
1 hour ago by chai_backend_admin πŸš€
Model: blend_johuf_2024-09-11
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
1 hour ago by zonemercy πŸš€
Model: chaiml-test-feed-convo-v1-1e5_v5
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
2 hours ago by valentin87 πŸš€
Model: valentin87-duke-jack-dra_2042_v3
HuggingFace: valentin87/Duke_Jack_Dragonbane
Model Size: 13B
Elo: 1224
Human feedbacks: 10839
4 hours ago by valentin87 πŸš€
Model: valentin87-duke-jack-dra_2042_v2
HuggingFace: valentin87/Duke_Jack_Dragonbane
Model Size: 13B
Elo: 1240
Human feedbacks: 13152
4 hours ago by valentin87 πŸš€
Model: valentin87-duke-jack-dra_2042_v1
HuggingFace: valentin87/Duke_Jack_Dragonbane
Model Size: 13B
Elo: 1159
Human feedbacks: 13119
6 hours ago by zonemercy πŸš€
Model: chaiml-test-feed-convo-v1-1e5_v2
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
6 hours ago by zonemercy πŸš€
Model: chaiml-test-feed-convo-v1-1e5_v1
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
6 hours ago by Trace2333 πŸš€
Model: trace2333-mistral-trail10_v2
HuggingFace: Trace2333/mistral_trail10
Model Size: 13B
Elo: 1235
Human feedbacks: 12809
7 hours ago by Trace2333 πŸš€
Model: trace2333-mistral-trail10_v1
HuggingFace: Trace2333/mistral_trail10
Model Size: 13B
Elo: 1240
Human feedbacks: 12984
7 hours ago by chai_backend_admin πŸš€
Model: blend_nujon_2024-09-11
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
Loading...