developer_uid: chai_evaluation_service
submission_id: function_gotok_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T06:17:22+00:00
num_battles: 8427
num_wins: 4064
celo_rating: 1256.3
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-12
win_ratio: 0.4822594042957162
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7266132831573486s
Received healthy response to inference request in 2.4934964179992676s
Received healthy response to inference request in 2.162743091583252s
Received healthy response to inference request in 2.5472471714019775s
Received healthy response to inference request in 2.572603464126587s
Received healthy response to inference request in 1.889169454574585s
Received healthy response to inference request in 1.7296903133392334s
Received healthy response to inference request in 1.907952070236206s
Received healthy response to inference request in 2.356973171234131s
Received healthy response to inference request in 2.3212366104125977s
10 requests
0 failed requests
5th percentile: 1.7279979467391968
10th percentile: 1.729382610321045
20th percentile: 1.8572736263275147
30th percentile: 1.9023172855377197
40th percentile: 2.0608266830444335
50th percentile: 2.241989850997925
60th percentile: 2.335531234741211
70th percentile: 2.397930145263672
80th percentile: 2.5042465686798097
90th percentile: 2.5497828006744383
95th percentile: 2.5611931324005126
99th percentile: 2.570321397781372
mean time: 2.1707725048065187
Pipeline stage StressChecker completed in 23.10s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_gotok_2025-12-13 status is now deployed due to DeploymentManager action
function_gotok_2025-12-13 status is now inactive due to auto deactivation removed underperforming models