developer_uid: chai_backend_admin
submission_id: function_kosik_2025-12-20
model_name: function_kosik_2025-12-20
model_group:
status: torndown
timestamp: 2025-12-23T19:41:20+00:00
num_battles: 7631
num_wins: 3939
celo_rating: 1304.43
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_kosik_2025-12-20
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-23
win_ratio: 0.5161839863713799
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.093501567840576s
Received healthy response to inference request in 3.197498083114624s
Received healthy response to inference request in 5.045043230056763s
Received healthy response to inference request in 3.9268383979797363s
Received healthy response to inference request in 3.1035327911376953s
Received healthy response to inference request in 2.530362606048584s
Received healthy response to inference request in 2.0191941261291504s
Received healthy response to inference request in 4.5337419509887695s
Received healthy response to inference request in 2.480087995529175s
Received healthy response to inference request in 4.981004953384399s
10 requests
0 failed requests
5th percentile: 2.2265963673591616
10th percentile: 2.4339986085891723
20th percentile: 2.520307683944702
30th percentile: 2.9245598793029783
40th percentile: 3.0995203018188477
50th percentile: 3.1505154371261597
60th percentile: 3.4892342090606685
70th percentile: 4.108909463882446
80th percentile: 4.623194551467895
90th percentile: 4.987408781051636
95th percentile: 5.016226005554199
99th percentile: 5.03927978515625
mean time: 3.491080570220947
Pipeline stage StressChecker completed in 36.41s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_kosik_2025-12-20 status is now deployed due to DeploymentManager action
function_kosik_2025-12-20 status is now inactive due to auto deactivation removed underperforming models
function_kosik_2025-12-20 status is now torndown due to DeploymentManager action