developer_uid: chai_backend_admin
submission_id: function_fujaf_2025-05-27
model_name: function_fujaf_2025-05-27
model_group:
status: torndown
timestamp: 2025-05-27T23:25:49+00:00
num_battles: 5863
num_wins: 3016
celo_rating: 1293.22
family_friendly_score: 0.5267999999999999
family_friendly_standard_error: 0.007060903058391327
submission_type: function
display_name: function_fujaf_2025-05-27
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-05-27
win_ratio: 0.5144124168514412
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.9362633228302s
Received healthy response to inference request in 3.9724888801574707s
Received healthy response to inference request in 4.192420482635498s
Received healthy response to inference request in 3.084212303161621s
Received healthy response to inference request in 2.6193041801452637s
5 requests
0 failed requests
5th percentile: 2.682696008682251
10th percentile: 2.746087837219238
20th percentile: 2.872871494293213
30th percentile: 2.9658531188964843
40th percentile: 3.025032711029053
50th percentile: 3.084212303161621
60th percentile: 3.4395229339599607
70th percentile: 3.7948335647583007
80th percentile: 4.016475200653076
90th percentile: 4.104447841644287
95th percentile: 4.148434162139893
99th percentile: 4.183623218536377
mean time: 3.3609378337860107
%s, retrying in %s seconds...
Received healthy response to inference request in 1.973616600036621s
Received healthy response to inference request in 3.21187686920166s
Received healthy response to inference request in 2.5698792934417725s
Received healthy response to inference request in 4.166499614715576s
Received healthy response to inference request in 6.48293399810791s
5 requests
0 failed requests
5th percentile: 2.0928691387176515
10th percentile: 2.212121677398682
20th percentile: 2.450626754760742
30th percentile: 2.69827880859375
40th percentile: 2.9550778388977053
50th percentile: 3.21187686920166
60th percentile: 3.5937259674072264
70th percentile: 3.9755750656127926
80th percentile: 4.629786491394043
90th percentile: 5.556360244750977
95th percentile: 6.019647121429443
99th percentile: 6.390276622772217
mean time: 3.680961275100708
%s, retrying in %s seconds...
Received healthy response to inference request in 2.6964290142059326s
Received healthy response to inference request in 3.4535887241363525s
Received healthy response to inference request in 2.291240930557251s
Received healthy response to inference request in 3.343393325805664s
Received healthy response to inference request in 2.528837203979492s
5 requests
0 failed requests
5th percentile: 2.338760185241699
10th percentile: 2.3862794399261475
20th percentile: 2.481317949295044
30th percentile: 2.56235556602478
40th percentile: 2.6293922901153564
50th percentile: 2.6964290142059326
60th percentile: 2.955214738845825
70th percentile: 3.2140004634857178
80th percentile: 3.3654324054718017
90th percentile: 3.4095105648040773
95th percentile: 3.431549644470215
99th percentile: 3.449180908203125
mean time: 2.8626978397369385
Pipeline stage StressChecker completed in 52.98s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.64s
Shutdown handler de-registered
function_fujaf_2025-05-27 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2997.10s
Shutdown handler de-registered
function_fujaf_2025-05-27 status is now inactive due to auto deactivation removed underperforming models
function_fujaf_2025-05-27 status is now torndown due to DeploymentManager action