developer_uid: chai_backend_admin
submission_id: function_tofes_2026-03-15
model_name: function_tofes_2026-03-15
model_group:
status: inactive
timestamp: 2026-03-16T09:38:28+00:00
num_battles: 10189
num_wins: 5818
celo_rating: 1342.57
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_tofes_2026-03-15
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-03-16
win_ratio: 0.5710079497497301
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.9871368408203125s
Received healthy response to inference request in 3.1817688941955566s
Received healthy response to inference request in 2.7629127502441406s
Received healthy response to inference request in 2.275857925415039s
Received healthy response to inference request in 3.690279006958008s
Received healthy response to inference request in 2.7324345111846924s
Received healthy response to inference request in 2.4325690269470215s
Received healthy response to inference request in 3.374640464782715s
Received healthy response to inference request in 6.231062650680542s
Received healthy response to inference request in 2.9726767539978027s
10 requests
0 failed requests
5th percentile: 2.3463779211044313
10th percentile: 2.416897916793823
20th percentile: 2.672461414337158
30th percentile: 2.753769278526306
40th percentile: 2.888771152496338
50th percentile: 3.0772228240966797
60th percentile: 3.2589175224304197
70th percentile: 3.4693320274353026
80th percentile: 3.949650573730469
90th percentile: 5.111529421806335
95th percentile: 5.6712960362434375
99th percentile: 6.1191093277931214
mean time: 3.464133882522583
Pipeline stage StressChecker completed in 80.76s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 8.32s
Shutdown handler de-registered
function_tofes_2026-03-15 status is now deployed due to DeploymentManager action
function_tofes_2026-03-15 status is now inactive due to auto deactivation removed underperforming models