developer_uid: chai_evaluation_service
submission_id: function_mapif_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T23:17:22+00:00
num_battles: 7073
num_wins: 3539
celo_rating: 1256.34
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5003534568075781
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.8894307613372803s
Received healthy response to inference request in 5.168447971343994s
Received healthy response to inference request in 2.657543420791626s
Received healthy response to inference request in 4.3857262134552s
Received healthy response to inference request in 2.536853075027466s
Received healthy response to inference request in 1.7467200756072998s
Received healthy response to inference request in 3.7894279956817627s
Received healthy response to inference request in 4.926270484924316s
Received healthy response to inference request in 2.7043046951293945s
Received healthy response to inference request in 2.4200096130371094s
10 requests
0 failed requests
5th percentile: 2.049700367450714
10th percentile: 2.3526806592941285
20th percentile: 2.5134843826293944
30th percentile: 2.621336317062378
40th percentile: 2.685600185394287
50th percentile: 2.7968677282333374
60th percentile: 3.2494296550750725
70th percentile: 3.9683174610137937
80th percentile: 4.493835067749024
90th percentile: 4.9504882335662845
95th percentile: 5.059468102455139
99th percentile: 5.146651997566223
mean time: 3.322473430633545
Pipeline stage StressChecker completed in 34.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.57s
Shutdown handler de-registered
function_mapif_2025-12-13 status is now deployed due to DeploymentManager action
function_mapif_2025-12-13 status is now inactive due to auto deactivation removed underperforming models