developer_uid: chai_backend_admin
submission_id: function_jutas_2025-05-13
model_name: function_jutas_2025-05-13
model_group:
status: torndown
timestamp: 2025-05-13T00:35:07+00:00
num_battles: 5512
num_wins: 2902
celo_rating: 1298.85
family_friendly_score: 0.5378000000000001
family_friendly_standard_error: 0.007050832007642786
submission_type: function
display_name: function_jutas_2025-05-13
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-05-12
win_ratio: 0.5264876632801161
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 15.850011587142944s
Received healthy response to inference request in 3.596585273742676s
Received healthy response to inference request in 3.310417890548706s
Received healthy response to inference request in 3.258223295211792s
Received healthy response to inference request in 2.910555839538574s
5 requests
0 failed requests
5th percentile: 2.9800893306732177
10th percentile: 3.049622821807861
20th percentile: 3.1886898040771485
30th percentile: 3.268662214279175
40th percentile: 3.2895400524139404
50th percentile: 3.310417890548706
60th percentile: 3.4248848438262938
70th percentile: 3.539351797103882
80th percentile: 6.047270536422731
90th percentile: 10.948641061782837
95th percentile: 13.399326324462889
99th percentile: 15.359874534606933
mean time: 5.785158777236939
Pipeline stage StressChecker completed in 29.99s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.89s
Shutdown handler de-registered
function_jutas_2025-05-13 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2791.42s
Shutdown handler de-registered
function_jutas_2025-05-13 status is now inactive due to auto deactivation removed underperforming models
function_jutas_2025-05-13 status is now torndown due to DeploymentManager action