Chaiverse calculates the ELO of LLMs via human feedback, obtained through blind selection testing using the Chai app. This involves gathering positive and negative upvotes from randomly selected models on our leaderboard, as experienced by end users. With over one million samples a day, we are able to build a ranking of LLMs - which for our use-case, seems to be the most accurate.

Submit your model now, and start collecting feedback on your model’s ranking in under five minutes! Feel free to join us on our Discord server to discuss the leaderboard and chat with other participants!

Submissions in the last 24 hours: 17
1 hour ago by Nitral-AI πŸš€
Model: nitral-ai-captain-bmo-12b_v10
HuggingFace: Nitral-AI/Captain_BMO-12B
Model Size: 13B
Elo: 1267
Human feedbacks: 11368
1 hour ago by Nitral-AI πŸš€
Model: nitral-ai-captain-bmo-12b_v9
HuggingFace: Nitral-AI/Captain_BMO-12B
Model Size: 13B
Elo: 1269
Human feedbacks: 10629
1 hour ago by Nitral-AI πŸš€
Model: nitral-ai-captain-bmo-12b_v8
HuggingFace: Nitral-AI/Captain_BMO-12B
Model Size: 13B
Elo: 1291
Human feedbacks: 8779
1 hour ago by Nitral-AI πŸš€
Model: nitral-ai-captain-bmo-12b_v7
HuggingFace: Nitral-AI/Captain_BMO-12B
Model Size: 13B
Elo: 1283
Human feedbacks: 9088
1 hour ago by Nitral-AI πŸš€
Model: nitral-ai-captain-bmo-12b_v6
HuggingFace: Nitral-AI/Captain_BMO-12B
Model Size: 13B
Elo: 1280
Human feedbacks: 9082
1 hour ago by Nitral-AI πŸš€
Model: nitral-ai-captain-bmo-12b_v5
HuggingFace: Nitral-AI/Captain_BMO-12B
Model Size: 13B
Elo: 1281
Human feedbacks: 9027
1 hour ago by Jellywibble πŸš€
Model: chaiml-nemo-chai-5merge-ties_v34
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
1 hour ago by Jellywibble πŸš€
Model: chaiml-nemo-chai-5merge-ties_v33
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
1 hour ago by Jellywibble πŸš€
Model: chaiml-nemo-chai-5merge-ties_v32
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
1 hour ago by chai_backend_admin πŸš€
Model: function_lihum_2024-11-04
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
Loading...