developer_uid: chai_evaluation_service
submission_id: function_mupib_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T12:27:34+00:00
num_battles: 8036
num_wins: 3939
celo_rating: 1256.38
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-14
win_ratio: 0.49016923842707816
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.102978467941284s
Received healthy response to inference request in 2.7716073989868164s
Received healthy response to inference request in 4.001223087310791s
Received healthy response to inference request in 3.3646674156188965s
Received healthy response to inference request in 6.093203783035278s
Received healthy response to inference request in 3.896284818649292s
Received healthy response to inference request in 2.7560172080993652s
Received healthy response to inference request in 5.743488788604736s
Received healthy response to inference request in 2.2814252376556396s
Received healthy response to inference request in 2.9826419353485107s
10 requests
0 failed requests
5th percentile: 2.183279514312744
10th percentile: 2.263580560684204
20th percentile: 2.66109881401062
30th percentile: 2.766930341720581
40th percentile: 2.898228120803833
50th percentile: 3.1736546754837036
60th percentile: 3.5773143768310542
70th percentile: 3.927766299247742
80th percentile: 4.349676227569581
90th percentile: 5.77846028804779
95th percentile: 5.935832035541534
99th percentile: 6.06172943353653
mean time: 3.599353814125061
Pipeline stage StressChecker completed in 37.28s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_mupib_2025-12-14 status is now deployed due to DeploymentManager action
function_mupib_2025-12-14 status is now inactive due to auto deactivation removed underperforming models