developer_uid: chai_evaluation_service
submission_id: function_gokem_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T09:56:43+00:00
num_battles: 8197
num_wins: 4131
celo_rating: 1256.37
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.5039648651945834
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.878405570983887s
Received healthy response to inference request in 2.7892746925354004s
Received healthy response to inference request in 2.9370994567871094s
Received healthy response to inference request in 4.304009437561035s
Received healthy response to inference request in 2.271383285522461s
Received healthy response to inference request in 3.443807363510132s
Received healthy response to inference request in 2.708498001098633s
Received healthy response to inference request in 2.8581957817077637s
Received healthy response to inference request in 3.6648688316345215s
Received healthy response to inference request in 3.087306499481201s
10 requests
0 failed requests
5th percentile: 2.4680849075317384
10th percentile: 2.664786529541016
20th percentile: 2.773119354248047
30th percentile: 2.8375194549560545
40th percentile: 2.905537986755371
50th percentile: 3.0122029781341553
60th percentile: 3.2299068450927733
70th percentile: 3.5101258039474486
80th percentile: 3.792696952819824
90th percentile: 4.36144905090332
95th percentile: 4.619927310943603
99th percentile: 4.82670991897583
mean time: 3.2942848920822145
Pipeline stage StressChecker completed in 35.23s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_gokem_2025-12-14 status is now deployed due to DeploymentManager action
function_gokem_2025-12-14 status is now inactive due to auto deactivation removed underperforming models