developer_uid: chai_backend_admin
submission_id: function_tijib_2025-12-09
model_name: function_tijib_2025-12-09
model_group:
status: torndown
timestamp: 2025-12-13T14:21:44+00:00
num_battles: 5935
num_wins: 3162
celo_rating: 1305.2
family_friendly_score: 0.5098
family_friendly_standard_error: 0.007069709470692555
submission_type: function
display_name: function_tijib_2025-12-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-10
win_ratio: 0.5327716933445661
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.415475606918335s
Received healthy response to inference request in 2.845320701599121s
Received healthy response to inference request in 2.3566384315490723s
Received healthy response to inference request in 1.038649559020996s
Received healthy response to inference request in 1.339653730392456s
Received healthy response to inference request in 2.041642427444458s
Received healthy response to inference request in 0.8327829837799072s
Received healthy response to inference request in 1.4431910514831543s
Received healthy response to inference request in 0.9299321174621582s
Received healthy response to inference request in 0.5521364212036133s
10 requests
0 failed requests
5th percentile: 0.6784273743629455
10th percentile: 0.8047183275222778
20th percentile: 0.910502290725708
30th percentile: 1.0060343265533447
40th percentile: 1.219252061843872
50th percentile: 1.3914223909378052
60th percentile: 1.6825716018676755
70th percentile: 2.136141228675842
80th percentile: 2.3684058666229246
90th percentile: 2.4584601163864135
95th percentile: 2.651890408992767
99th percentile: 2.8066346430778504
mean time: 1.579542303085327
Pipeline stage StressChecker completed in 19.09s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
Shutdown handler de-registered
function_tijib_2025-12-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 7372.14s
Shutdown handler de-registered
function_tijib_2025-12-09 status is now inactive due to auto deactivation removed underperforming models
function_tijib_2025-12-09 status is now torndown due to DeploymentManager action