developer_uid: chai_backend_admin
submission_id: function_nemul_2025-12-10
model_name: function_nemul_2025-12-10
model_group:
status: torndown
timestamp: 2025-12-13T17:35:47+00:00
num_battles: 5397
num_wins: 2799
celo_rating: 1313.24
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_nemul_2025-12-10
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-10
win_ratio: 0.5186214563646471
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.428959608078003s
Received healthy response to inference request in 1.8870937824249268s
Received healthy response to inference request in 3.993065357208252s
Received healthy response to inference request in 3.0335466861724854s
Received healthy response to inference request in 3.558535099029541s
Received healthy response to inference request in 2.8989810943603516s
Received healthy response to inference request in 2.0142533779144287s
Received healthy response to inference request in 2.7142717838287354s
Received healthy response to inference request in 3.0882022380828857s
Received healthy response to inference request in 2.7695348262786865s
10 requests
0 failed requests
5th percentile: 1.9443156003952027
10th percentile: 2.0015374183654786
20th percentile: 2.574268102645874
30th percentile: 2.752955913543701
40th percentile: 2.8472025871276854
50th percentile: 2.9662638902664185
60th percentile: 3.0554089069366457
70th percentile: 3.190429449081421
80th percentile: 3.4548747062683107
90th percentile: 3.601988124847412
95th percentile: 3.7975267410278315
99th percentile: 3.953957633972168
mean time: 2.9386443853378297
Pipeline stage StressChecker completed in 30.73s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_nemul_2025-12-10 status is now deployed due to DeploymentManager action
function_nemul_2025-12-10 status is now inactive due to auto deactivation removed underperforming models
function_nemul_2025-12-10 status is now torndown due to DeploymentManager action