Chaiverse calculates the ELO of LLMs via human feedback, obtained through blind selection testing using the Chai app. This involves gathering positive and negative upvotes from randomly selected models on our leaderboard, as experienced by end users. With over one million samples a day, we are able to build a ranking of LLMs - which for our use-case, seems to be the most accurate.

Submit your model now, and start collecting feedback on your model’s ranking in under five minutes! Feel free to join us on our Discord server to discuss the leaderboard and chat with other participants!

Submissions in the last 24 hours: 5
48 minutes ago by zonemercy πŸš€
Model: chaiml-lexens-v1-12b01e5r256_v13
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
2 hours ago by cloudyu πŸš€
Model: cloudyu-nemo-dpo-v20_v13
HuggingFace: cloudyu/Nemo-DPO-V20
Model Size: 13B
Elo: 1277
Human feedbacks: 13422
3 hours ago by cloudyu πŸš€
Model: cloudyu-nemo-dpo-v20_v12
HuggingFace: cloudyu/Nemo-DPO-V20
Model Size: 13B
Elo: 1272
Human feedbacks: 19418
8 hours ago by rirv938 πŸš€
Model: rirv938-testing-model-175_v4
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
8 hours ago by rirv938 πŸš€
Model: rirv938-testing-model-175_v3
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
8 hours ago by rirv938 πŸš€
Model: rirv938-testing-model-175_v2
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
9 hours ago by rirv938 πŸš€
Model: rirv938-testing-model-175_v1
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
9 hours ago by rirv938 πŸš€
Model: rirv938-testing-model-350_v1
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
9 hours ago by rirv938 πŸš€
Model: rirv938-testing-model-525_v1
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
9 hours ago by rirv938 πŸš€
Model: mistralai-mistral-nemo_9330_v217
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
Loading...