developer_uid: chai_backend_admin
submission_id: function_purek_2025-12-22
model_name: function_purek_2025-12-22
model_group:
status: torndown
timestamp: 2025-12-25T13:21:30+00:00
num_battles: 9644
num_wins: 5069
celo_rating: 1311.04
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_purek_2025-12-22
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-25
win_ratio: 0.5256117793446703
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 6.252958536148071s
Received healthy response to inference request in 9.197866916656494s
Received healthy response to inference request in 7.954401016235352s
Received healthy response to inference request in 14.688331365585327s
Received healthy response to inference request in 16.482106924057007s
Received healthy response to inference request in 12.401085376739502s
Received healthy response to inference request in 5.8788909912109375s
Received healthy response to inference request in 16.174576997756958s
Received healthy response to inference request in 7.76591944694519s
Received healthy response to inference request in 5.480428218841553s
10 requests
0 failed requests
5th percentile: 5.659736466407776
10th percentile: 5.839044713973999
20th percentile: 6.178145027160644
30th percentile: 7.312031173706054
40th percentile: 7.879008388519287
50th percentile: 8.576133966445923
60th percentile: 10.479154300689697
70th percentile: 13.08725917339325
80th percentile: 14.985580492019654
90th percentile: 16.205329990386964
95th percentile: 16.343718457221986
99th percentile: 16.454429230690003
mean time: 10.227656579017639
%s, retrying in %s seconds...
Received healthy response to inference request in 2.9357004165649414s
Received healthy response to inference request in 3.8614273071289062s
Received healthy response to inference request in 5.285435438156128s
Received healthy response to inference request in 12.629685878753662s
Received healthy response to inference request in 16.46292519569397s
Received healthy response to inference request in 8.504152774810791s
Received healthy response to inference request in 6.240319013595581s
Received healthy response to inference request in 3.7575573921203613s
Received healthy response to inference request in 5.009456396102905s
Received healthy response to inference request in 5.696876764297485s
10 requests
0 failed requests
5th percentile: 3.3055360555648803
10th percentile: 3.675371694564819
20th percentile: 3.840653324127197
30th percentile: 4.665047669410705
40th percentile: 5.175043821334839
50th percentile: 5.491156101226807
60th percentile: 5.914253664016723
70th percentile: 6.9194691419601435
80th percentile: 9.329259395599365
90th percentile: 13.013009810447691
95th percentile: 14.737967503070827
99th percentile: 16.117933657169342
mean time: 7.038353657722473
Pipeline stage StressChecker completed in 179.88s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_purek_2025-12-22 status is now deployed due to DeploymentManager action
function_purek_2025-12-22 status is now inactive due to auto deactivation removed underperforming models
function_purek_2025-12-22 status is now torndown due to DeploymentManager action