developer_uid: chai_evaluation_service
submission_id: function_kosuk_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T21:47:10+00:00
num_battles: 6697
num_wins: 3228
celo_rating: 1256.33
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.4820068687472002
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.0181620121002197s
Received healthy response to inference request in 2.276101589202881s
Received healthy response to inference request in 4.8234357833862305s
Received healthy response to inference request in 2.8266818523406982s
Received healthy response to inference request in 2.697338342666626s
Received healthy response to inference request in 3.085237503051758s
Received healthy response to inference request in 3.5099751949310303s
Received healthy response to inference request in 7.736812114715576s
Received healthy response to inference request in 6.918901205062866s
Received healthy response to inference request in 2.855665445327759s
10 requests
0 failed requests
5th percentile: 2.465658128261566
10th percentile: 2.6552146673202515
20th percentile: 2.800813150405884
30th percentile: 2.8469703674316404
40th percentile: 2.9531633853912354
50th percentile: 3.0516997575759888
60th percentile: 3.2551325798034667
70th percentile: 3.90401337146759
80th percentile: 5.242528867721558
90th percentile: 7.000692296028137
95th percentile: 7.368752205371856
99th percentile: 7.663200132846832
mean time: 3.9748311042785645
Pipeline stage StressChecker completed in 41.06s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_kosuk_2025-12-13 status is now deployed due to DeploymentManager action
function_kosuk_2025-12-13 status is now inactive due to auto deactivation removed underperforming models