developer_uid: chai_evaluation_service
submission_id: function_kogek_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T05:57:29+00:00
num_battles: 7532
num_wins: 3754
celo_rating: 1256.35
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.49840679766330326
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.4864108562469482s
Received healthy response to inference request in 1.6502954959869385s
Received healthy response to inference request in 3.2145724296569824s
Received healthy response to inference request in 2.3597428798675537s
Received healthy response to inference request in 2.386958360671997s
Received healthy response to inference request in 2.4434821605682373s
Received healthy response to inference request in 1.5908479690551758s
Received healthy response to inference request in 1.6377921104431152s
Received healthy response to inference request in 2.4948501586914062s
Received healthy response to inference request in 2.1863088607788086s
10 requests
0 failed requests
5th percentile: 1.6119728326797484
10th percentile: 1.6330976963043213
20th percentile: 1.647794818878174
30th percentile: 2.0255048513412475
40th percentile: 2.2903692722320557
50th percentile: 2.3733506202697754
60th percentile: 2.409567880630493
70th percentile: 2.4563607692718508
80th percentile: 2.48809871673584
90th percentile: 2.5668223857879635
95th percentile: 2.8906974077224725
99th percentile: 3.149797425270081
mean time: 2.2451261281967163
Pipeline stage StressChecker completed in 23.87s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.61s
Shutdown handler de-registered
function_kogek_2025-12-14 status is now deployed due to DeploymentManager action
function_kogek_2025-12-14 status is now inactive due to auto deactivation removed underperforming models