developer_uid: chai_backend_admin
submission_id: function_sojob_2025-12-09
model_name: function_sojob_2025-12-09
model_group:
status: protected
timestamp: 2025-12-09T22:27:31+00:00
num_battles: 5941
num_wins: 3008
celo_rating: 1368.56
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_sojob_2025-12-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-09
win_ratio: 0.5063120686753072
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.675825834274292s
Received healthy response to inference request in 3.282766819000244s
Received healthy response to inference request in 3.89040470123291s
Received healthy response to inference request in 2.7403695583343506s
Received healthy response to inference request in 3.3967323303222656s
Received healthy response to inference request in 2.5388023853302s
Received healthy response to inference request in 1.8442270755767822s
Received healthy response to inference request in 6.758312225341797s
Received healthy response to inference request in 2.883679151535034s
Received healthy response to inference request in 3.3248343467712402s
10 requests
0 failed requests
5th percentile: 1.7516063928604126
10th percentile: 1.8273869514465333
20th percentile: 2.3998873233795166
30th percentile: 2.679899406433105
40th percentile: 2.826355314254761
50th percentile: 3.083222985267639
60th percentile: 3.2995938301086425
70th percentile: 3.3464037418365478
80th percentile: 3.4954668045043946
90th percentile: 4.177195453643797
95th percentile: 5.4677538394927945
99th percentile: 6.500200548171997
mean time: 3.233595442771912
Pipeline stage StressChecker completed in 33.68s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.75s
Shutdown handler de-registered
function_sojob_2025-12-09 status is now deployed due to DeploymentManager action
function_sojob_2025-12-09 status is now inactive due to auto deactivation removed underperforming models
function_sojob_2025-12-09 status is now protected due to ABTestQueueItem