Chaiverse calculates the ELO of LLMs via human feedback, obtained through blind selection testing using the Chai app. This involves gathering positive and negative upvotes from randomly selected models on our leaderboard, as experienced by end users. With over one million samples a day, we are able to build a ranking of LLMs - which for our use-case, seems to be the most accurate.

Submit your model now, and start collecting feedback on your model’s ranking in under five minutes! Feel free to join us on our Discord server to discuss the leaderboard and chat with other participants!

Submissions in the last 24 hours: 9
1 hour ago by rirv938 πŸš€
Model: chaiml-mistral-24b-2048_72304_v2
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
1 hour ago by rirv938 πŸš€
Model: chaiml-mistral-24b-2048-1_404_v2
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
1 hour ago by rirv938 πŸš€
Model: chaiml-mistral-24b-2048_37531_v2
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
1 hour ago by rirv938 πŸš€
Model: chaiml-mistral-24b-2048_90555_v2
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
1 hour ago by rirv938 πŸš€
Model: chaiml-mistral-24b-204_54327_v11
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
1 hour ago by acehao-chai πŸš€
Model: chaiml-grpo-qwen25-3b-l_65726_v1
HuggingFace: ChaiML/grpo-qwen25-3b-lora-opus14k-chai-rm-max64-step-1591
Model Size: 3B
Elo: 1127
Human feedbacks: 10728
2 hours ago by acehao-chai πŸš€
Model: chaiml-grpo-q3b-merged-_26723_v3
HuggingFace: ChaiML/grpo-q3b-merged-nemo32b-step-900
Model Size: 3B
Elo: 990
Human feedbacks: 11564
2 hours ago by acehao-chai πŸš€
Model: chaiml-grpo-q235b-kimid_83709_v1
HuggingFace: ChaiML/grpo-q235b-kimid-v5a-merged-chai-rm-step-450
Model Size: 19B
Elo: 1334
Human feedbacks: 4331
2 hours ago by acehao-chai πŸš€
Model: chaiml-grpo-q3b-merged-_26723_v2
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
11 hours ago by zonemercy πŸš€
Model: chaiml-muster-v3a-kakit-_3784_v1
HuggingFace: N/A
Model Size: N/A
Elo: N/A
Human feedbacks: 0
Loading...