developer_uid: chai_evaluation_service
submission_id: function_kojuk_2025-12-15
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-18T15:21:24+00:00
num_battles: 9224
num_wins: 4666
celo_rating: 1297.3
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-18
win_ratio: 0.5058542931483088
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.574657917022705s
Received healthy response to inference request in 1.7227442264556885s
Received healthy response to inference request in 1.9832403659820557s
Received healthy response to inference request in 2.350916862487793s
Received healthy response to inference request in 1.9556269645690918s
Received healthy response to inference request in 2.4585609436035156s
Received healthy response to inference request in 2.4473273754119873s
Received healthy response to inference request in 1.9551169872283936s
Received healthy response to inference request in 3.026108503341675s
Received healthy response to inference request in 2.2400333881378174s
10 requests
0 failed requests
5th percentile: 1.8273119688034059
10th percentile: 1.931879711151123
20th percentile: 1.9555249691009522
30th percentile: 1.9749563455581665
40th percentile: 2.137316179275513
50th percentile: 2.295475125312805
60th percentile: 2.3894810676574707
70th percentile: 2.4506974458694457
80th percentile: 2.4817803382873533
90th percentile: 2.619802975654602
95th percentile: 2.8229557394981377
99th percentile: 2.9854779505729674
mean time: 2.2714333534240723
Pipeline stage StressChecker completed in 24.47s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.79s
Shutdown handler de-registered
function_kojuk_2025-12-15 status is now deployed due to DeploymentManager action
function_kojuk_2025-12-15 status is now inactive due to auto deactivation removed underperforming models
function_kojuk_2025-12-15 status is now torndown due to DeploymentManager action