developer_uid: chai_backend_admin
submission_id: function_dejom_2025-07-10
model_name: function_dejom_2025-07-10
model_group:
status: torndown
timestamp: 2025-07-10T21:43:28+00:00
num_battles: 9079
num_wins: 5043
celo_rating: 1326.24
family_friendly_score: 0.49360000000000004
family_friendly_standard_error: 0.007070488526261817
submission_type: function
display_name: function_dejom_2025-07-10
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-07-10
win_ratio: 0.5554576495208723
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.8937482833862305s
Received healthy response to inference request in 2.690303087234497s
Received healthy response to inference request in 3.1431784629821777s
Received healthy response to inference request in 3.63517165184021s
5 requests
1 failed requests
5th percentile: 2.7309921264648436
10th percentile: 2.7716811656951905
20th percentile: 2.853059244155884
30th percentile: 2.9436343193054197
40th percentile: 3.0434063911437987
50th percentile: 3.1431784629821777
60th percentile: 3.3399757385253905
70th percentile: 3.5367730140686033
80th percentile: 6.933313894271853
90th percentile: 13.529598379135134
95th percentile: 16.827740621566768
99th percentile: 19.466254415512083
mean time: 6.4976568698883055
%s, retrying in %s seconds...
Received healthy response to inference request in 2.617884397506714s
Received healthy response to inference request in 3.061378240585327s
Received healthy response to inference request in 3.211765766143799s
Received healthy response to inference request in 2.4186999797821045s
Received healthy response to inference request in 3.0630991458892822s
5 requests
0 failed requests
5th percentile: 2.4585368633270264
10th percentile: 2.4983737468719482
20th percentile: 2.578047513961792
30th percentile: 2.7065831661224364
40th percentile: 2.883980703353882
50th percentile: 3.061378240585327
60th percentile: 3.062066602706909
70th percentile: 3.0627549648284913
80th percentile: 3.0928324699401855
90th percentile: 3.1522991180419924
95th percentile: 3.1820324420928956
99th percentile: 3.2058191013336184
mean time: 2.8745655059814452
Pipeline stage StressChecker completed in 50.00s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
function_dejom_2025-07-10 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 4429.85s
Shutdown handler de-registered
function_dejom_2025-07-10 status is now inactive due to auto deactivation removed underperforming models
function_dejom_2025-07-10 status is now torndown due to DeploymentManager action