developer_uid: chai_evaluation_service
submission_id: function_mikal_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T05:16:28+00:00
num_battles: 7493
num_wins: 3807
celo_rating: 1256.3
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-12
win_ratio: 0.5080742025890831
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.654411554336548s
Received healthy response to inference request in 1.8046095371246338s
Received healthy response to inference request in 2.905893564224243s
Received healthy response to inference request in 2.0909903049468994s
Received healthy response to inference request in 2.7910373210906982s
Received healthy response to inference request in 2.0137438774108887s
Received healthy response to inference request in 1.807286024093628s
Received healthy response to inference request in 1.6937620639801025s
Received healthy response to inference request in 2.050962448120117s
Received healthy response to inference request in 2.324620246887207s
10 requests
0 failed requests
5th percentile: 1.7436434268951415
10th percentile: 1.7935247898101807
20th percentile: 1.806750726699829
30th percentile: 1.9518065214157103
40th percentile: 2.0360750198364257
50th percentile: 2.0709763765335083
60th percentile: 2.1844422817230225
70th percentile: 2.4235576391220093
80th percentile: 2.681736707687378
90th percentile: 2.8025229454040526
95th percentile: 2.854208254814148
99th percentile: 2.8955565023422243
mean time: 2.2137316942214964
Pipeline stage StressChecker completed in 24.04s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.62s
Shutdown handler de-registered
function_mikal_2025-12-13 status is now deployed due to DeploymentManager action
function_mikal_2025-12-13 status is now inactive due to auto deactivation removed underperforming models