developer_uid: chai_evaluation_service
submission_id: function_kebas_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T09:47:07+00:00
num_battles: 9984
num_wins: 4986
celo_rating: 1256.33
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.49939903846153844
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6906228065490723s
Received healthy response to inference request in 5.377932071685791s
Received healthy response to inference request in 2.659717321395874s
Received healthy response to inference request in 3.0034091472625732s
Received healthy response to inference request in 2.283799409866333s
Received healthy response to inference request in 4.405146837234497s
Received healthy response to inference request in 3.3370790481567383s
Received healthy response to inference request in 2.470776319503784s
Received healthy response to inference request in 1.7169551849365234s
Received healthy response to inference request in 1.9896197319030762s
10 requests
0 failed requests
5th percentile: 1.839654231071472
10th percentile: 1.962353277206421
20th percentile: 2.2249634742736815
30th percentile: 2.4146832466125487
40th percentile: 2.584140920639038
50th percentile: 2.675170063972473
60th percentile: 2.8157373428344723
70th percentile: 3.103510117530823
80th percentile: 3.5506926059722903
90th percentile: 4.502425360679626
95th percentile: 4.940178716182707
99th percentile: 5.290381400585175
mean time: 2.9935057878494264
Pipeline stage StressChecker completed in 31.22s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_kebas_2025-12-13 status is now deployed due to DeploymentManager action
function_kebas_2025-12-13 status is now inactive due to auto deactivation removed underperforming models