developer_uid: chai_backend_admin
submission_id: function_metet_2025-12-05
model_name: function_metet_2025-12-05
model_group:
status: torndown
timestamp: 2025-12-12T18:29:39+00:00
num_battles: 5942
num_wins: 3494
celo_rating: 1350.34
family_friendly_score: 0.5720000000000001
family_friendly_standard_error: 0.00699737093485832
submission_type: function
display_name: function_metet_2025-12-05
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-05
win_ratio: 0.5880175025244025
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.845881223678589s
Received healthy response to inference request in 3.2218875885009766s
Received healthy response to inference request in 3.5027682781219482s
Received healthy response to inference request in 3.24043869972229s
Received healthy response to inference request in 3.338982582092285s
Received healthy response to inference request in 2.463469982147217s
Received healthy response to inference request in 0.3709712028503418s
Received healthy response to inference request in 0.34862208366394043s
Received healthy response to inference request in 0.34816956520080566s
Received healthy response to inference request in 0.35267114639282227s
10 requests
0 failed requests
5th percentile: 0.34837319850921633
10th percentile: 0.34857683181762694
20th percentile: 0.3518613338470459
30th percentile: 0.3654811859130859
40th percentile: 1.626470470428467
50th percentile: 2.8426787853240967
60th percentile: 3.229308032989502
70th percentile: 3.2700018644332887
80th percentile: 3.371739721298218
90th percentile: 3.5370795726776123
95th percentile: 3.6914803981781
99th percentile: 3.8150010585784915
mean time: 2.1033862352371218
Pipeline stage StressChecker completed in 22.40s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_metet_2025-12-05 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Generating Leaderboard row for %s
Generated Leaderboard row for %s
Pipeline stage OfflineFamilyFriendlyScorer completed in 3939.33s
Shutdown handler de-registered
function_metet_2025-12-05 status is now inactive due to auto deactivation removed underperforming models
function_metet_2025-12-05 status is now torndown due to DeploymentManager action