developer_uid: chai_evaluation_service
submission_id: function_luhin_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T06:56:29+00:00
num_battles: 7781
num_wins: 3858
celo_rating: 1256.35
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.49582315897699525
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.8802032470703125s
Received healthy response to inference request in 3.0856876373291016s
Received healthy response to inference request in 1.767406940460205s
Received healthy response to inference request in 3.8532092571258545s
Received healthy response to inference request in 2.065861940383911s
Received healthy response to inference request in 1.9148287773132324s
Received healthy response to inference request in 2.936687707901001s
Received healthy response to inference request in 2.7174439430236816s
Received healthy response to inference request in 2.025348424911499s
Received healthy response to inference request in 1.8263695240020752s
10 requests
0 failed requests
5th percentile: 1.7939401030540467
10th percentile: 1.8204732656478881
20th percentile: 1.869436502456665
30th percentile: 1.9044411182403564
40th percentile: 1.9811405658721923
50th percentile: 2.045605182647705
60th percentile: 2.326494741439819
70th percentile: 2.7832170724868774
80th percentile: 2.966487693786621
90th percentile: 3.1624397993087765
95th percentile: 3.5078245282173146
99th percentile: 3.784132311344147
mean time: 2.4073047399520875
Pipeline stage StressChecker completed in 25.44s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.55s
Shutdown handler de-registered
function_luhin_2025-12-14 status is now deployed due to DeploymentManager action
function_luhin_2025-12-14 status is now inactive due to auto deactivation removed underperforming models