developer_uid: chai_evaluation_service
submission_id: function_karos_2025-12-15
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-15T06:46:43+00:00
num_battles: 6372
num_wins: 3171
celo_rating: 1184.44
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.4976459510357815
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.229918956756592s
Received healthy response to inference request in 2.444776773452759s
Received healthy response to inference request in 2.24763560295105s
Received healthy response to inference request in 3.005843162536621s
Received healthy response to inference request in 2.8580219745635986s
Received healthy response to inference request in 2.2219886779785156s
Received healthy response to inference request in 1.8148365020751953s
Received healthy response to inference request in 3.261601686477661s
Received healthy response to inference request in 1.852410078048706s
Received healthy response to inference request in 1.879704475402832s
10 requests
0 failed requests
5th percentile: 1.8317446112632751
10th percentile: 1.848652720451355
20th percentile: 1.8742455959320068
30th percentile: 2.1193034172058103
40th percentile: 2.226746845245361
50th percentile: 2.238777279853821
60th percentile: 2.3264920711517334
70th percentile: 2.5687503337860105
80th percentile: 2.887586212158203
90th percentile: 3.031419014930725
95th percentile: 3.146510350704193
99th percentile: 3.2385834193229677
mean time: 2.381673789024353
Pipeline stage StressChecker completed in 25.71s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.62s
Shutdown handler de-registered
function_karos_2025-12-15 status is now deployed due to DeploymentManager action
function_karos_2025-12-15 status is now inactive due to auto deactivation removed underperforming models