developer_uid: chai_backend_admin
submission_id: function_bujer_2026-02-03
model_name: 0203
model_group:
status: torndown
timestamp: 2026-02-06T13:41:43+00:00
num_battles: 10505
num_wins: 5980
celo_rating: 1353.22
family_friendly_score: 0.5282
family_friendly_standard_error: 0.007059812462098409
submission_type: function
display_name: 0203
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-02-03
win_ratio: 0.5692527367920038
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': True}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 6.018807649612427s
Received healthy response to inference request in 6.206420183181763s
Received healthy response to inference request in 5.266695261001587s
Received healthy response to inference request in 6.278190612792969s
Received healthy response to inference request in 5.043078660964966s
Received healthy response to inference request in 8.497274160385132s
Received healthy response to inference request in 3.61513614654541s
Received healthy response to inference request in 6.188424348831177s
Received healthy response to inference request in 3.8124818801879883s
Received healthy response to inference request in 5.8992438316345215s
10 requests
0 failed requests
5th percentile: 3.7039417266845702
10th percentile: 3.7927473068237303
20th percentile: 4.796959304809571
30th percentile: 5.1996102809906
40th percentile: 5.646224403381348
50th percentile: 5.959025740623474
60th percentile: 6.086654329299927
70th percentile: 6.1938230991363525
80th percentile: 6.2207742691040036
90th percentile: 6.5000989675521845
95th percentile: 7.498686563968656
99th percentile: 8.297556641101837
mean time: 5.682575273513794
Pipeline stage StressChecker completed in 58.69s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.60s
Shutdown handler de-registered
function_bujer_2026-02-03 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 2617.53s
Shutdown handler de-registered
function_bujer_2026-02-03 status is now inactive due to auto deactivation removed underperforming models
function_bujer_2026-02-03 status is now torndown due to DeploymentManager action