developer_uid: chai_backend_admin
submission_id: function_setil_2025-11-20
model_name: function_setil_2025-11-20
model_group:
status: torndown
timestamp: 2025-11-23T18:26:04+00:00
num_battles: 7769
num_wins: 4059
celo_rating: 1300.79
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_setil_2025-11-20
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-11-20
win_ratio: 0.522461063199897
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.2138259410858154s
Received healthy response to inference request in 1.7697031497955322s
Received healthy response to inference request in 1.3590121269226074s
Received healthy response to inference request in 1.6955187320709229s
Received healthy response to inference request in 1.4877898693084717s
5 requests
0 failed requests
5th percentile: 1.3847676753997802
10th percentile: 1.4105232238769532
20th percentile: 1.462034320831299
30th percentile: 1.5293356418609618
40th percentile: 1.6124271869659423
50th percentile: 1.6955187320709229
60th percentile: 1.7251924991607666
70th percentile: 1.7548662662506103
80th percentile: 1.858527708053589
90th percentile: 2.036176824569702
95th percentile: 2.1250013828277585
99th percentile: 2.1960610294342042
mean time: 1.70516996383667
Pipeline stage StressChecker completed in 10.84s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.92s
Shutdown handler de-registered
function_setil_2025-11-20 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
%s, retrying in %s seconds...
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 6034.74s
Shutdown handler de-registered
function_setil_2025-11-20 status is now torndown due to DeploymentManager action