developer_uid: chai_backend_admin
submission_id: function_dulat_2025-05-12
model_name: function_dulat_2025-05-12
model_group:
status: torndown
timestamp: 2025-05-12T20:36:01+00:00
num_battles: 12172
num_wins: 6270
celo_rating: 1294.72
family_friendly_score: 0.56
family_friendly_standard_error: 0.007019971509913698
submission_type: function
display_name: function_dulat_2025-05-12
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-05-12
win_ratio: 0.5151166611896155
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1482255458831787s
Received healthy response to inference request in 3.39176344871521s
Received healthy response to inference request in 3.1910996437072754s
Received healthy response to inference request in 2.9513256549835205s
Received healthy response to inference request in 3.739128589630127s
5 requests
0 failed requests
5th percentile: 2.990705633163452
10th percentile: 3.0300856113433836
20th percentile: 3.108845567703247
30th percentile: 3.156800365447998
40th percentile: 3.1739500045776365
50th percentile: 3.1910996437072754
60th percentile: 3.271365165710449
70th percentile: 3.351630687713623
80th percentile: 3.4612364768981934
90th percentile: 3.60018253326416
95th percentile: 3.6696555614471436
99th percentile: 3.7252339839935305
mean time: 3.284308576583862
Pipeline stage StressChecker completed in 17.71s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
function_dulat_2025-05-12 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4693.35s
Shutdown handler de-registered
function_dulat_2025-05-12 status is now inactive due to auto deactivation removed underperforming models
function_dulat_2025-05-12 status is now torndown due to DeploymentManager action