developer_uid: chai_backend_admin
submission_id: function_nudob_2025-04-09
model_name: function_nudob_2025-04-09
model_group:
status: torndown
timestamp: 2025-04-09T21:38:26+00:00
num_battles: 5886
num_wins: 2926
celo_rating: 1277.01
family_friendly_score: 0.5538000000000001
family_friendly_standard_error: 0.0070300150782199615
submission_type: function
display_name: function_nudob_2025-04-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-04-09
win_ratio: 0.49711179068977235
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.225604772567749s
Received healthy response to inference request in 3.0372157096862793s
Received healthy response to inference request in 2.181831121444702s
Received healthy response to inference request in 3.707063674926758s
Received healthy response to inference request in 3.709373950958252s
5 requests
0 failed requests
5th percentile: 2.3529080390930175
10th percentile: 2.523984956741333
20th percentile: 2.866138792037964
30th percentile: 3.171185302734375
40th percentile: 3.4391244888305663
50th percentile: 3.707063674926758
60th percentile: 3.7079877853393555
70th percentile: 3.708911895751953
80th percentile: 3.8126201152801515
90th percentile: 4.01911244392395
95th percentile: 4.122358608245849
99th percentile: 4.204955539703369
mean time: 3.3722178459167482
%s, retrying in %s seconds...
Received healthy response to inference request in 1.9178776741027832s
Received healthy response to inference request in 2.814173460006714s
Received healthy response to inference request in 3.257019519805908s
Received healthy response to inference request in 2.7897047996520996s
Failed to get response for submission jellywibble-alex-the-gy_43020_v1: HTTPConnectionPool(host='jellywibble-alex-the-gy-43020-v1-predictor.tenant-chaiml-guanaco.k.chaiverse.com', port=80): Read timed out. (read timeout=12.0)
Received healthy response to inference request in 2.308065414428711s
5 requests
0 failed requests
5th percentile: 1.9959152221679688
10th percentile: 2.0739527702331544
20th percentile: 2.2300278663635256
30th percentile: 2.4043932914733888
40th percentile: 2.597049045562744
50th percentile: 2.7897047996520996
60th percentile: 2.7994922637939452
70th percentile: 2.809279727935791
80th percentile: 2.902742671966553
90th percentile: 3.0798810958862304
95th percentile: 3.168450307846069
99th percentile: 3.2393056774139404
mean time: 2.6173681735992433
Pipeline stage StressChecker completed in 32.18s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
Shutdown handler de-registered
function_nudob_2025-04-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2967.23s
Shutdown handler de-registered
function_nudob_2025-04-09 status is now inactive due to auto deactivation removed underperforming models
function_nudob_2025-04-09 status is now torndown due to DeploymentManager action