developer_uid: chai_backend_admin
submission_id: function_situn_2026-02-12
model_name: function_situn_2026-02-12
model_group:
status: inactive
timestamp: 2026-02-12T21:51:12+00:00
num_battles: 2199
num_wins: 854
celo_rating: 1220.76
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: function_situn_2026-02-12
is_internal_developer: True
ranking_group: single
us_pacific_date: 2026-02-12
win_ratio: 0.3883583447021373
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.550222158432007s
Received healthy response to inference request in 2.928598642349243s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.3172786235809326s
Received healthy response to inference request in 3.031404972076416s
Received healthy response to inference request in 7.75725245475769s
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.988492965698242s
Received healthy response to inference request in 5.829178810119629s
Received healthy response to inference request in 4.000615119934082s
10 requests
2 failed requests
5th percentile: 2.9555510878562927
10th percentile: 2.9825035333633423
20th percentile: 3.0228225708007814
30th percentile: 3.2315165281295775
40th percentile: 3.457044744491577
50th percentile: 3.7754186391830444
60th percentile: 4.7320405960083
70th percentile: 6.407600903511047
80th percentile: 10.233080577850345
90th percentile: 20.257123494148253
95th percentile: 20.800410401821136
99th percentile: 21.23503992795944
mean time: 7.4883134126663204
%s, retrying in %s seconds...
Received healthy response to inference request in 3.1844875812530518s
Received healthy response to inference request in 3.4765896797180176s
Received healthy response to inference request in 3.035712242126465s
Received healthy response to inference request in 2.7674050331115723s
Received healthy response to inference request in 3.8816580772399902s
Received healthy response to inference request in 4.459892272949219s
Received healthy response to inference request in 3.9230971336364746s
Received healthy response to inference request in 2.5204169750213623s
Received healthy response to inference request in 3.8204495906829834s
Received healthy response to inference request in 3.189861297607422s
10 requests
0 failed requests
5th percentile: 2.6315616011619567
10th percentile: 2.742706227302551
20th percentile: 2.9820508003234862
30th percentile: 3.139854979515076
40th percentile: 3.187711811065674
50th percentile: 3.3332254886627197
60th percentile: 3.6141336441040037
70th percentile: 3.8388121366500854
80th percentile: 3.889945888519287
90th percentile: 3.976776647567749
95th percentile: 4.218334460258483
99th percentile: 4.411580710411072
mean time: 3.4259569883346557
Pipeline stage StressChecker completed in 186.18s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.91s
Shutdown handler de-registered
function_situn_2026-02-12 status is now deployed due to DeploymentManager action
function_situn_2026-02-12 status is now inactive due to admin request