developer_uid: chai_backend_admin
submission_id: function_fikob_2025-04-09
model_name: function_fikob_2025-04-09
model_group:
status: torndown
timestamp: 2025-04-09T03:35:27+00:00
num_battles: 6051
num_wins: 3016
celo_rating: 1275.83
family_friendly_score: 0.5542
family_friendly_standard_error: 0.007029400543431851
submission_type: function
display_name: function_fikob_2025-04-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-04-08
win_ratio: 0.49843001156833583
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.770713806152344s
Received healthy response to inference request in 3.2305614948272705s
Received healthy response to inference request in 4.760596752166748s
Received healthy response to inference request in 2.244065046310425s
Received healthy response to inference request in 2.810011148452759s
5 requests
0 failed requests
5th percentile: 2.3572542667388916
10th percentile: 2.4704434871673584
20th percentile: 2.696821928024292
30th percentile: 2.894121217727661
40th percentile: 3.0623413562774657
50th percentile: 3.2305614948272705
60th percentile: 3.8425755977630613
70th percentile: 4.454589700698852
80th percentile: 4.762620162963867
90th percentile: 4.766666984558105
95th percentile: 4.768690395355224
99th percentile: 4.77030912399292
mean time: 3.563189649581909
%s, retrying in %s seconds...
Received healthy response to inference request in 2.981285572052002s
Received healthy response to inference request in 3.3208746910095215s
Received healthy response to inference request in 4.377756357192993s
Received healthy response to inference request in 3.4240334033966064s
Received healthy response to inference request in 3.2451162338256836s
5 requests
0 failed requests
5th percentile: 3.034051704406738
10th percentile: 3.0868178367614747
20th percentile: 3.1923501014709474
30th percentile: 3.2602679252624513
40th percentile: 3.2905713081359864
50th percentile: 3.3208746910095215
60th percentile: 3.3621381759643554
70th percentile: 3.4034016609191893
80th percentile: 3.614777994155884
90th percentile: 3.9962671756744386
95th percentile: 4.187011766433716
99th percentile: 4.339607439041138
mean time: 3.4698132514953612
Pipeline stage StressChecker completed in 37.24s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_fikob_2025-04-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2937.38s
Shutdown handler de-registered
function_fikob_2025-04-09 status is now inactive due to auto deactivation removed underperforming models
function_fikob_2025-04-09 status is now torndown due to DeploymentManager action