developer_uid: chai_evaluation_service
submission_id: function_fapas_2025-12-16
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-19T09:51:18+00:00
num_battles: 8012
num_wins: 3951
celo_rating: 1288.45
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-19
win_ratio: 0.4931352970544184
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.635266065597534s
Received healthy response to inference request in 4.159644842147827s
Received healthy response to inference request in 9.262233018875122s
Received healthy response to inference request in 2.995957851409912s
Received healthy response to inference request in 6.621118783950806s
Received healthy response to inference request in 4.04100227355957s
Received healthy response to inference request in 2.6818037033081055s
Received healthy response to inference request in 5.227586507797241s
Received healthy response to inference request in 3.279301643371582s
10 requests
1 failed requests
5th percentile: 2.8231730699539184
10th percentile: 2.9645424365997313
20th percentile: 3.222632884979248
30th percentile: 3.5284767389297484
40th percentile: 3.8787077903747558
50th percentile: 4.100323557853699
60th percentile: 4.586821508407592
70th percentile: 5.64564619064331
80th percentile: 7.14934163093567
90th percentile: 10.346455073356625
95th percentile: 15.225454318523395
99th percentile: 19.12865371465683
mean time: 6.200836825370788
%s, retrying in %s seconds...
Received healthy response to inference request in 2.6198949813842773s
Received healthy response to inference request in 3.1854617595672607s
Received healthy response to inference request in 4.655912160873413s
Received healthy response to inference request in 2.737015962600708s
Received healthy response to inference request in 3.9887123107910156s
Received healthy response to inference request in 4.144143342971802s
Received healthy response to inference request in 4.1597137451171875s
Received healthy response to inference request in 3.1244471073150635s
Received healthy response to inference request in 2.270402193069458s
Received healthy response to inference request in 4.553628206253052s
10 requests
0 failed requests
5th percentile: 2.427673947811127
10th percentile: 2.5849457025527953
20th percentile: 2.713591766357422
30th percentile: 3.0082177639007566
40th percentile: 3.161055898666382
50th percentile: 3.587087035179138
60th percentile: 4.05088472366333
70th percentile: 4.148814463615418
80th percentile: 4.23849663734436
90th percentile: 4.563856601715088
95th percentile: 4.6098843812942505
99th percentile: 4.646706604957581
mean time: 3.5439331769943236
Pipeline stage StressChecker completed in 102.13s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.63s
Shutdown handler de-registered
function_fapas_2025-12-16 status is now deployed due to DeploymentManager action
function_fapas_2025-12-16 status is now inactive due to auto deactivation removed underperforming models
function_fapas_2025-12-16 status is now torndown due to DeploymentManager action