developer_uid: chai_evaluation_service
submission_id: function_molit_2025-12-18
model_name: richard
model_group:
status: torndown
timestamp: 2025-12-21T07:21:19+00:00
num_battles: 8046
num_wins: 4008
celo_rating: 1291.93
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-20
win_ratio: 0.4981357196122297
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.7660858631134033s
Received healthy response to inference request in 1.776709794998169s
Received healthy response to inference request in 2.13187837600708s
Received healthy response to inference request in 3.4195127487182617s
Received healthy response to inference request in 2.329387664794922s
Received healthy response to inference request in 1.8619909286499023s
Received healthy response to inference request in 2.369471311569214s
Received healthy response to inference request in 3.082639455795288s
Received healthy response to inference request in 3.1817541122436523s
Received healthy response to inference request in 2.6812350749969482s
10 requests
0 failed requests
5th percentile: 1.815086305141449
10th percentile: 1.853462815284729
20th percentile: 2.0779008865356445
30th percentile: 2.270134878158569
40th percentile: 2.3534378528594972
50th percentile: 2.525353193283081
60th percentile: 2.7151753902435303
70th percentile: 2.8610519409179687
80th percentile: 3.102462387084961
90th percentile: 3.205529975891113
95th percentile: 3.3125213623046874
99th percentile: 3.398114471435547
mean time: 2.5600665330886843
Pipeline stage StressChecker completed in 27.04s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_molit_2025-12-18 status is now deployed due to DeploymentManager action
function_molit_2025-12-18 status is now inactive due to auto deactivation removed underperforming models
function_molit_2025-12-18 status is now torndown due to DeploymentManager action