developer_uid: chai_backend_admin
submission_id: function_popuf_2025-12-19
model_name: function_popuf_2025-12-19
model_group:
status: torndown
timestamp: 2025-12-22T22:41:25+00:00
num_battles: 6175
num_wins: 3191
celo_rating: 1304.96
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_popuf_2025-12-19
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-22
win_ratio: 0.5167611336032388
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.3850858211517334s
Received healthy response to inference request in 5.863019704818726s
Received healthy response to inference request in 2.0047709941864014s
Received healthy response to inference request in 4.141263723373413s
Received healthy response to inference request in 4.314989328384399s
Received healthy response to inference request in 4.17407488822937s
Received healthy response to inference request in 4.47018837928772s
Received healthy response to inference request in 1.4208259582519531s
Received healthy response to inference request in 2.6911699771881104s
Received healthy response to inference request in 4.564149379730225s
10 requests
0 failed requests
5th percentile: 1.6836012244224547
10th percentile: 1.9463764905929566
20th percentile: 2.5538901805877687
30th percentile: 3.176911067962646
40th percentile: 3.838792562484741
50th percentile: 4.157669305801392
60th percentile: 4.230440664291382
70th percentile: 4.361549043655396
80th percentile: 4.4889805793762205
90th percentile: 4.6940364122390745
95th percentile: 5.278528058528899
99th percentile: 5.746121375560761
mean time: 3.702953815460205
Pipeline stage StressChecker completed in 39.04s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.12s
Shutdown handler de-registered
function_popuf_2025-12-19 status is now deployed due to DeploymentManager action
function_popuf_2025-12-19 status is now inactive due to auto deactivation removed underperforming models
function_popuf_2025-12-19 status is now torndown due to DeploymentManager action