developer_uid: chai_backend_admin
submission_id: function_jesir_2025-04-30
model_name: function_jesir_2025-04-30
model_group:
status: torndown
timestamp: 2025-04-30T00:48:47+00:00
num_battles: 5856
num_wins: 3238
celo_rating: 1326.72
family_friendly_score: 0.5598000000000001
family_friendly_standard_error: 0.007020312813543283
submission_type: function
display_name: function_jesir_2025-04-30
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-04-29
win_ratio: 0.5529371584699454
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.136822938919067s
Received healthy response to inference request in 3.7541444301605225s
Received healthy response to inference request in 3.164318561553955s
Received healthy response to inference request in 1.92441725730896s
Received healthy response to inference request in 4.284467697143555s
5 requests
0 failed requests
5th percentile: 2.172397518157959
10th percentile: 2.420377779006958
20th percentile: 2.916338300704956
30th percentile: 3.2822837352752687
40th percentile: 3.5182140827178956
50th percentile: 3.7541444301605225
60th percentile: 3.9072158336639404
70th percentile: 4.060287237167358
80th percentile: 4.1663518905639645
90th percentile: 4.22540979385376
95th percentile: 4.254938745498658
99th percentile: 4.278561906814575
mean time: 3.4528341770172117
%s, retrying in %s seconds...
Received healthy response to inference request in 3.4019572734832764s
Received healthy response to inference request in 2.745692491531372s
Received healthy response to inference request in 2.347517728805542s
Received healthy response to inference request in 2.6198995113372803s
Received healthy response to inference request in 2.445258140563965s
5 requests
0 failed requests
5th percentile: 2.3670658111572265
10th percentile: 2.386613893508911
20th percentile: 2.4257100582122804
30th percentile: 2.480186414718628
40th percentile: 2.550042963027954
50th percentile: 2.6198995113372803
60th percentile: 2.670216703414917
70th percentile: 2.7205338954925535
80th percentile: 2.876945447921753
90th percentile: 3.1394513607025147
95th percentile: 3.2707043170928953
99th percentile: 3.3757066822052
mean time: 2.712065029144287
Pipeline stage StressChecker completed in 33.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.65s
Shutdown handler de-registered
function_jesir_2025-04-30 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3177.25s
Shutdown handler de-registered
function_jesir_2025-04-30 status is now inactive due to auto deactivation removed underperforming models
function_jesir_2025-04-30 status is now torndown due to DeploymentManager action