developer_uid: chai_evaluation_service
submission_id: function_titor_2025-12-14
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-14T05:57:29+00:00
num_battles: 7292
num_wins: 3699
celo_rating: 1256.35
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5072682391662096
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.1252145767211914s
Received healthy response to inference request in 3.1600794792175293s
Received healthy response to inference request in 1.8434765338897705s
Received healthy response to inference request in 3.6281981468200684s
Received healthy response to inference request in 4.070624589920044s
Received healthy response to inference request in 3.2771201133728027s
Received healthy response to inference request in 3.002758026123047s
Received healthy response to inference request in 1.9893476963043213s
Received healthy response to inference request in 3.1112077236175537s
Received healthy response to inference request in 2.9701077938079834s
10 requests
0 failed requests
5th percentile: 1.9091185569763183
10th percentile: 1.9747605800628663
20th percentile: 2.773955774307251
30th percentile: 2.992962956428528
40th percentile: 3.067827844619751
50th percentile: 3.1182111501693726
60th percentile: 3.1391605377197265
70th percentile: 3.195191669464111
80th percentile: 3.3473357200622558
90th percentile: 3.672440791130066
95th percentile: 3.8715326905250547
99th percentile: 4.030806210041046
mean time: 3.017813467979431
Pipeline stage StressChecker completed in 31.49s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_titor_2025-12-14 status is now deployed due to DeploymentManager action
function_titor_2025-12-14 status is now inactive due to auto deactivation removed underperforming models