developer_uid: chai_backend_admin
submission_id: function_tuhot_2025-11-20
model_name: function_tuhot_2025-11-20
model_group:
status: torndown
timestamp: 2025-11-23T16:56:13+00:00
num_battles: 6447
num_wins: 3352
celo_rating: 1304.31
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_tuhot_2025-11-20
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-11-20
win_ratio: 0.5199317512021095
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7125823497772217s
Received healthy response to inference request in 2.219867467880249s
Received healthy response to inference request in 2.230280637741089s
Received healthy response to inference request in 1.089428424835205s
Received healthy response to inference request in 2.2484216690063477s
5 requests
0 failed requests
5th percentile: 1.2140592098236085
10th percentile: 1.3386899948120117
20th percentile: 1.5879515647888183
30th percentile: 1.814039373397827
40th percentile: 2.0169534206390383
50th percentile: 2.219867467880249
60th percentile: 2.224032735824585
70th percentile: 2.228198003768921
80th percentile: 2.233908843994141
90th percentile: 2.2411652565002442
95th percentile: 2.2447934627532957
99th percentile: 2.247696027755737
mean time: 1.9001161098480224
Pipeline stage StressChecker completed in 10.78s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.67s
Shutdown handler de-registered
function_tuhot_2025-11-20 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2727.16s
Shutdown handler de-registered
function_tuhot_2025-11-20 status is now torndown due to DeploymentManager action