developer_uid: chai_backend_admin
submission_id: function_kilom_2025-12-01
model_name: function_kilom_2025-12-01
model_group:
status: inactive
timestamp: 2025-12-02T05:26:54+00:00
num_battles: 5084
num_wins: 2616
celo_rating: 1303.39
family_friendly_score: 0.5166
family_friendly_standard_error: 0.007067169730521547
submission_type: function
display_name: function_kilom_2025-12-01
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-01
win_ratio: 0.5145554681353265
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 17.89620351791382s
Received healthy response to inference request in 13.37550163269043s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.510467052459717s
Received healthy response to inference request in 3.2012221813201904s
Received healthy response to inference request in 2.7247068881988525s
Received healthy response to inference request in 2.133031129837036s
Received healthy response to inference request in 1.5820512771606445s
Received healthy response to inference request in 2.1891074180603027s
Received healthy response to inference request in 2.2210447788238525s
10 requests
1 failed requests
5th percentile: 1.8299922108650208
10th percentile: 2.077933144569397
20th percentile: 2.1778921604156496
30th percentile: 2.2114635705947876
40th percentile: 2.394698143005371
50th percentile: 2.6175869703292847
60th percentile: 2.9153130054473873
70th percentile: 6.253506016731261
80th percentile: 14.279642009735108
90th percentile: 18.11749541759491
95th percentile: 19.113308966159817
99th percentile: 19.90995980501175
mean time: 6.7942458391189575
%s, retrying in %s seconds...
Received healthy response to inference request in 3.1179604530334473s
Received healthy response to inference request in 2.7537682056427s
Received healthy response to inference request in 2.128934383392334s
Received healthy response to inference request in 3.5589935779571533s
Received healthy response to inference request in 3.41084623336792s
Received healthy response to inference request in 1.5206332206726074s
Received healthy response to inference request in 2.3225395679473877s
Received healthy response to inference request in 2.178162097930908s
Received healthy response to inference request in 3.090634822845459s
Received healthy response to inference request in 2.1543397903442383s
10 requests
0 failed requests
5th percentile: 1.7943687438964844
10th percentile: 2.0681042671203613
20th percentile: 2.1492587089538575
30th percentile: 2.171015405654907
40th percentile: 2.264788579940796
50th percentile: 2.538153886795044
60th percentile: 2.8885148525238034
70th percentile: 3.0988325119018554
80th percentile: 3.176537609100342
90th percentile: 3.4256609678268433
95th percentile: 3.4923272728919983
99th percentile: 3.5456603169441223
mean time: 2.6236812353134153
Pipeline stage StressChecker completed in 96.73s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_kilom_2025-12-01 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2982.06s
Shutdown handler de-registered
function_kilom_2025-12-01 status is now inactive due to auto deactivation removed underperforming models