developer_uid: chai_evaluation_service
submission_id: function_gonas_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T23:47:10+00:00
num_battles: 6823
num_wins: 3430
celo_rating: 1256.34
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5027114172651327
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.6443934440612793s
Received healthy response to inference request in 3.161734104156494s
Received healthy response to inference request in 2.2357242107391357s
Received healthy response to inference request in 1.7317595481872559s
Received healthy response to inference request in 2.076725959777832s
Received healthy response to inference request in 2.5557925701141357s
Received healthy response to inference request in 2.183180809020996s
Received healthy response to inference request in 3.71516752243042s
Received healthy response to inference request in 2.463834524154663s
Received healthy response to inference request in 2.3040971755981445s
10 requests
0 failed requests
5th percentile: 1.886994433403015
10th percentile: 2.0422293186187743
20th percentile: 2.161889839172363
30th percentile: 2.2199611902236938
40th percentile: 2.276747989654541
50th percentile: 2.383965849876404
60th percentile: 2.500617742538452
70th percentile: 2.5823728322982786
80th percentile: 2.7478615760803224
90th percentile: 3.2170774459838865
95th percentile: 3.466122484207153
99th percentile: 3.665358514785767
mean time: 2.5072409868240357
Pipeline stage StressChecker completed in 26.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.71s
Shutdown handler de-registered
function_gonas_2025-12-13 status is now deployed due to DeploymentManager action
function_gonas_2025-12-13 status is now inactive due to auto deactivation removed underperforming models