developer_uid: chai_backend_admin
submission_id: function_nobil_2025-12-17
model_name: function_nobil_2025-12-17
model_group:
status: torndown
timestamp: 2025-12-20T16:21:20+00:00
num_battles: 7985
num_wins: 4045
celo_rating: 1297.84
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_nobil_2025-12-17
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.506574827802129
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 8.68096113204956s
Received healthy response to inference request in 13.137025833129883s
Received healthy response to inference request in 8.7877836227417s
Received healthy response to inference request in 8.773625373840332s
Received healthy response to inference request in 18.993679523468018s
Received healthy response to inference request in 5.908945083618164s
Received healthy response to inference request in 2.7584214210510254s
Received healthy response to inference request in 3.416942596435547s
Received healthy response to inference request in 9.004038572311401s
Received healthy response to inference request in 2.500556230545044s
10 requests
0 failed requests
5th percentile: 2.6165955662727356
10th percentile: 2.7326349020004272
20th percentile: 3.2852383613586427
30th percentile: 5.161344337463378
40th percentile: 7.572154712677002
50th percentile: 8.727293252944946
60th percentile: 8.77928867340088
70th percentile: 8.85266010761261
80th percentile: 9.830636024475098
90th percentile: 13.722691202163695
95th percentile: 16.358185362815853
99th percentile: 18.466580691337587
mean time: 8.196197938919067
%s, retrying in %s seconds...
Received healthy response to inference request in 10.171986103057861s
Received healthy response to inference request in 3.6407554149627686s
Received healthy response to inference request in 6.810067176818848s
Received healthy response to inference request in 2.6272363662719727s
Received healthy response to inference request in 8.024078130722046s
Received healthy response to inference request in 1.8265409469604492s
Received healthy response to inference request in 2.8126680850982666s
Received healthy response to inference request in 10.248764038085938s
Received healthy response to inference request in 4.419685363769531s
Received healthy response to inference request in 3.8740599155426025s
10 requests
0 failed requests
5th percentile: 2.1868538856506348
10th percentile: 2.5471668243408203
20th percentile: 2.7755817413330077
30th percentile: 3.392329216003418
40th percentile: 3.780738115310669
50th percentile: 4.146872639656067
60th percentile: 5.375838088989257
70th percentile: 7.174270462989807
80th percentile: 8.45365972518921
90th percentile: 10.179663896560669
95th percentile: 10.214213967323303
99th percentile: 10.24185402393341
mean time: 5.445584154129028
Pipeline stage StressChecker completed in 143.76s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_nobil_2025-12-17 status is now deployed due to DeploymentManager action
function_nobil_2025-12-17 status is now inactive due to auto deactivation removed underperforming models
function_nobil_2025-12-17 status is now torndown due to DeploymentManager action