developer_uid: NischayDnk
submission_id: function_fukok_2025-08-19
model_name: function_fukok_2025-08-19
model_group:
status: torndown
timestamp: 2025-08-19T21:47:12+00:00
num_battles: 5476
num_wins: 2866
celo_rating: 1280.75
family_friendly_score: 0.5388
family_friendly_standard_error: 0.0070497455273222445
submission_type: function
display_name: function_fukok_2025-08-19
is_internal_developer: False
ranking_group: single
us_pacific_date: 2025-08-19
win_ratio: 0.5233747260774287
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.5316851139068604s
Received healthy response to inference request in 4.089078426361084s
Received healthy response to inference request in 2.2365777492523193s
Received healthy response to inference request in 2.5110273361206055s
5 requests
1 failed requests
5th percentile: 2.2914676666259766
10th percentile: 2.346357583999634
20th percentile: 2.4561374187469482
30th percentile: 2.7151588916778566
40th percentile: 3.1234220027923585
50th percentile: 3.5316851139068604
60th percentile: 3.75464243888855
70th percentile: 3.977599763870239
80th percentile: 7.294029378890994
90th percentile: 13.703931283950807
95th percentile: 16.90888223648071
99th percentile: 19.47284299850464
mean time: 6.496440362930298
%s, retrying in %s seconds...
Received healthy response to inference request in 2.9854395389556885s
Received healthy response to inference request in 3.9101576805114746s
Received healthy response to inference request in 3.335317611694336s
Received healthy response to inference request in 2.184443473815918s
Received healthy response to inference request in 2.896970748901367s
5 requests
0 failed requests
5th percentile: 2.3269489288330076
10th percentile: 2.4694543838500977
20th percentile: 2.7544652938842775
30th percentile: 2.9146645069122314
40th percentile: 2.95005202293396
50th percentile: 2.9854395389556885
60th percentile: 3.1253907680511475
70th percentile: 3.2653419971466064
80th percentile: 3.450285625457764
90th percentile: 3.6802216529846192
95th percentile: 3.7951896667480467
99th percentile: 3.887164077758789
mean time: 3.0624658107757567
Pipeline stage StressChecker completed in 50.42s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.94s
Shutdown handler de-registered
function_fukok_2025-08-19 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3137.87s
Shutdown handler de-registered
function_fukok_2025-08-19 status is now inactive due to auto deactivation removed underperforming models
function_fukok_2025-08-19 status is now protected due to ABTestQueueItem
function_fukok_2025-08-19 status is now protected due to ABTestQueueItem
function_fukok_2025-08-19 status is now torndown due to DeploymentManager action