developer_uid: chai_evaluation_service
submission_id: function_lokas_2025-12-14
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-17T17:21:24+00:00
num_battles: 8710
num_wins: 4355
celo_rating: 1308.46
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.400546073913574s
Received healthy response to inference request in 1.9899003505706787s
Received healthy response to inference request in 2.550257921218872s
Received healthy response to inference request in 2.4120240211486816s
Received healthy response to inference request in 2.2671661376953125s
Received healthy response to inference request in 2.2969746589660645s
Received healthy response to inference request in 2.561552047729492s
Received healthy response to inference request in 2.4108822345733643s
Received healthy response to inference request in 2.012235164642334s
10 requests
1 failed requests
5th percentile: 1.9999510169029235
10th percentile: 2.0100016832351684
20th percentile: 2.216179943084717
30th percentile: 2.2880321025848387
40th percentile: 2.3591175079345703
50th percentile: 2.4057141542434692
60th percentile: 2.411338949203491
70th percentile: 2.4534941911697388
80th percentile: 2.552516746520996
90th percentile: 4.316243934631341
95th percentile: 12.212357425689678
99th percentile: 18.52924821853638
mean time: 4.101000952720642
%s, retrying in %s seconds...
Received healthy response to inference request in 2.5603466033935547s
Received healthy response to inference request in 3.3966588973999023s
Received healthy response to inference request in 1.9232194423675537s
Received healthy response to inference request in 2.865983486175537s
Received healthy response to inference request in 2.065730094909668s
Received healthy response to inference request in 2.442410945892334s
Received healthy response to inference request in 1.962177038192749s
Received healthy response to inference request in 2.262962818145752s
Received healthy response to inference request in 2.196711778640747s
Received healthy response to inference request in 2.0497236251831055s
10 requests
0 failed requests
5th percentile: 1.9407503604888916
10th percentile: 1.9582812786102295
20th percentile: 2.032214307785034
30th percentile: 2.060928153991699
40th percentile: 2.1443191051483153
50th percentile: 2.2298372983932495
60th percentile: 2.3347420692443848
70th percentile: 2.4777916431427003
80th percentile: 2.6214739799499513
90th percentile: 2.9190510272979733
95th percentile: 3.1578549623489374
99th percentile: 3.3488981103897095
mean time: 2.3725924730300902
Pipeline stage StressChecker completed in 67.24s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.55s
Shutdown handler de-registered
function_lokas_2025-12-14 status is now deployed due to DeploymentManager action
function_lokas_2025-12-14 status is now inactive due to auto deactivation removed underperforming models
function_lokas_2025-12-14 status is now torndown due to DeploymentManager action