developer_uid: NischayDnk
submission_id: function_sehet_2025-08-19
model_name: function_sehet_2025-08-19
model_group:
status: torndown
timestamp: 2025-08-19T21:38:14+00:00
num_battles: 6481
num_wins: 3556
celo_rating: 1286.73
family_friendly_score: 0.5264
family_friendly_standard_error: 0.007061204429840564
submission_type: function
display_name: function_sehet_2025-08-19
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-19
win_ratio: 0.5486807591421077
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.0677707195281982s
Received healthy response to inference request in 5.403413534164429s
Received healthy response to inference request in 4.329643249511719s
Received healthy response to inference request in 3.0065269470214844s
Received healthy response to inference request in 1.7853729724884033s
5 requests
0 failed requests
5th percentile: 1.8418525218963624
10th percentile: 1.8983320713043212
20th percentile: 2.011291170120239
30th percentile: 2.2555219650268556
40th percentile: 2.63102445602417
50th percentile: 3.0065269470214844
60th percentile: 3.535773468017578
70th percentile: 4.065019989013671
80th percentile: 4.544397306442261
90th percentile: 4.973905420303344
95th percentile: 5.1886594772338865
99th percentile: 5.36046272277832
mean time: 3.3185454845428466
%s, retrying in %s seconds...
Received healthy response to inference request in 3.1750717163085938s
Received healthy response to inference request in 3.7547826766967773s
Received healthy response to inference request in 2.7024807929992676s
Received healthy response to inference request in 2.0171191692352295s
Received healthy response to inference request in 5.306879281997681s
5 requests
0 failed requests
5th percentile: 2.154191493988037
10th percentile: 2.2912638187408447
20th percentile: 2.56540846824646
30th percentile: 2.796998977661133
40th percentile: 2.9860353469848633
50th percentile: 3.1750717163085938
60th percentile: 3.406956100463867
70th percentile: 3.6388404846191404
80th percentile: 4.065201997756958
90th percentile: 4.686040639877319
95th percentile: 4.9964599609375
99th percentile: 5.244795417785644
mean time: 3.3912667274475097
Pipeline stage StressChecker completed in 35.72s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.26s
Shutdown handler de-registered
function_sehet_2025-08-19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3085.81s
Shutdown handler de-registered
function_sehet_2025-08-19 status is now inactive due to auto deactivation removed underperforming models
function_sehet_2025-08-19 status is now torndown due to DeploymentManager action