developer_uid: chai_evaluation_service
submission_id: function_tilok_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T05:27:37+00:00
num_battles: 7515
num_wins: 3809
celo_rating: 1256.34
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5068529607451763
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.7604525089263916s
Received healthy response to inference request in 2.8525655269622803s
Received healthy response to inference request in 2.96940279006958s
Received healthy response to inference request in 3.29190731048584s
Received healthy response to inference request in 1.673384189605713s
Received healthy response to inference request in 2.6070096492767334s
Received healthy response to inference request in 2.2102277278900146s
Received healthy response to inference request in 1.9010965824127197s
Received healthy response to inference request in 3.42012619972229s
Received healthy response to inference request in 2.164942741394043s
10 requests
0 failed requests
5th percentile: 1.7125649333000184
10th percentile: 1.7517456769943238
20th percentile: 1.8729677677154541
30th percentile: 2.085788893699646
40th percentile: 2.192113733291626
50th percentile: 2.408618688583374
60th percentile: 2.705232000350952
70th percentile: 2.8876167058944704
80th percentile: 3.033903694152832
90th percentile: 3.304729199409485
95th percentile: 3.3624276995658873
99th percentile: 3.4085864996910096
mean time: 2.4851115226745604
Pipeline stage StressChecker completed in 26.44s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_tilok_2025-12-14 status is now deployed due to DeploymentManager action
function_tilok_2025-12-14 status is now inactive due to auto deactivation removed underperforming models