developer_uid: NischayDnk
submission_id: function_delar_2025-08-19
model_name: function_delar_2025-08-19
model_group:
status: torndown
timestamp: 2025-08-19T05:03:35+00:00
num_battles: 6431
num_wins: 3480
celo_rating: 1273.9
family_friendly_score: 0.5716
family_friendly_standard_error: 0.006998191766449387
submission_type: function
display_name: function_delar_2025-08-19
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-18
win_ratio: 0.5411289068574094
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 1.3545503616333008s
Received healthy response to inference request in 1.642080545425415s
Received healthy response to inference request in 1.3998675346374512s
Received healthy response to inference request in 1.3794114589691162s
5 requests
1 failed requests
5th percentile: 1.3595225811004639
10th percentile: 1.364494800567627
20th percentile: 1.3744392395019531
30th percentile: 1.3835026741027832
40th percentile: 1.3916851043701173
50th percentile: 1.3998675346374512
60th percentile: 1.4967527389526367
70th percentile: 1.5936379432678223
80th percentile: 5.335176753997806
90th percentile: 12.72136917114258
95th percentile: 16.41446537971496
99th percentile: 19.368942346572876
mean time: 5.176694297790528
%s, retrying in %s seconds...
Received healthy response to inference request in 1.7071161270141602s
Received healthy response to inference request in 1.4105207920074463s
Received healthy response to inference request in 1.6663463115692139s
Received healthy response to inference request in 1.4997689723968506s
Received healthy response to inference request in 1.7985923290252686s
5 requests
0 failed requests
5th percentile: 1.428370428085327
10th percentile: 1.446220064163208
20th percentile: 1.4819193363189698
30th percentile: 1.5330844402313233
40th percentile: 1.5997153759002685
50th percentile: 1.6663463115692139
60th percentile: 1.6826542377471925
70th percentile: 1.6989621639251709
80th percentile: 1.7254113674163818
90th percentile: 1.7620018482208253
95th percentile: 1.780297088623047
99th percentile: 1.7949332809448242
mean time: 1.616468906402588
Pipeline stage StressChecker completed in 36.84s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.69s
Shutdown handler de-registered
function_delar_2025-08-19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2092.81s
Shutdown handler de-registered
function_delar_2025-08-19 status is now inactive due to auto deactivation removed underperforming models
function_delar_2025-08-19 status is now torndown due to DeploymentManager action