developer_uid: chai_backend_admin
submission_id: function_ropir_2025-11-25
model_name: function_ropir_2025-11-25
model_group:
status: torndown
timestamp: 2025-11-25T16:57:42+00:00
num_battles: 9664
num_wins: 4895
celo_rating: 1290.68
family_friendly_score: 0.514
family_friendly_standard_error: 0.00706829540978587
submission_type: function
display_name: function_ropir_2025-11-25
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-11-25
win_ratio: 0.5065190397350994
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 4.078242063522339s
Received healthy response to inference request in 2.9641458988189697s
Received healthy response to inference request in 2.8122153282165527s
Received healthy response to inference request in 5.943972110748291s
Received healthy response to inference request in 2.4779608249664307s
Received healthy response to inference request in 3.2479543685913086s
Received healthy response to inference request in 2.4936039447784424s
Received healthy response to inference request in 4.455578327178955s
Received healthy response to inference request in 4.443114519119263s
10 requests
1 failed requests
5th percentile: 2.485000228881836
10th percentile: 2.4920396327972414
20th percentile: 2.7484930515289308
30th percentile: 2.9185667276382445
40th percentile: 3.134430980682373
50th percentile: 3.6630982160568237
60th percentile: 4.224191045761108
70th percentile: 4.446853661537171
80th percentile: 4.753257083892823
90th percentile: 7.3607439756393385
95th percentile: 13.736217367649063
99th percentile: 18.836596081256868
mean time: 5.302847814559937
%s, retrying in %s seconds...
Received healthy response to inference request in 3.6627488136291504s
Received healthy response to inference request in 4.131308078765869s
Received healthy response to inference request in 2.47577166557312s
Received healthy response to inference request in 1.785215139389038s
Received healthy response to inference request in 4.345889568328857s
Received healthy response to inference request in 3.443761110305786s
Received healthy response to inference request in 3.8525898456573486s
Received healthy response to inference request in 2.1382415294647217s
Received healthy response to inference request in 3.015592575073242s
Received healthy response to inference request in 3.246636390686035s
10 requests
0 failed requests
5th percentile: 1.9440770149230957
10th percentile: 2.1029388904571533
20th percentile: 2.4082656383514403
30th percentile: 2.8536463022232055
40th percentile: 3.154218864440918
50th percentile: 3.3451987504959106
60th percentile: 3.531356191635132
70th percentile: 3.71970112323761
80th percentile: 3.908333492279053
90th percentile: 4.152766227722168
95th percentile: 4.249327898025513
99th percentile: 4.326577234268188
mean time: 3.209775471687317
Pipeline stage StressChecker completed in 89.28s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
function_ropir_2025-11-25 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2718.12s
Shutdown handler de-registered
function_ropir_2025-11-25 status is now inactive due to auto deactivation removed underperforming models
function_ropir_2025-11-25 status is now torndown due to DeploymentManager action