developer_uid: chai_evaluation_service
submission_id: function_tekim_2025-12-13
model_name: richard
model_group:
status: inactive
timestamp: 2025-12-13T08:17:10+00:00
num_battles: 8655
num_wins: 4417
celo_rating: 1256.32
family_friendly_score: 0.0
family_friendly_standard_error: 0.0
submission_type: function
display_name: richard
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-13
win_ratio: 0.5103408434430965
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.158310890197754s
Received healthy response to inference request in 2.6528141498565674s
Received healthy response to inference request in 2.7193238735198975s
Received healthy response to inference request in 3.0070888996124268s
Received healthy response to inference request in 2.6147401332855225s
Received healthy response to inference request in 2.970808506011963s
Received healthy response to inference request in 2.1289150714874268s
Received healthy response to inference request in 3.0783531665802s
Received healthy response to inference request in 2.903221607208252s
Received healthy response to inference request in 2.6580722332000732s
10 requests
0 failed requests
5th percentile: 2.142143189907074
10th percentile: 2.155371308326721
20th percentile: 2.523454284667969
30th percentile: 2.641391944885254
40th percentile: 2.655968999862671
50th percentile: 2.6886980533599854
60th percentile: 2.792882966995239
70th percentile: 2.9234976768493652
80th percentile: 2.9780645847320555
90th percentile: 3.014215326309204
95th percentile: 3.046284246444702
99th percentile: 3.0719393825531007
mean time: 2.6891648530960084
Pipeline stage StressChecker completed in 29.97s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.58s
Shutdown handler de-registered
function_tekim_2025-12-13 status is now deployed due to DeploymentManager action
function_tekim_2025-12-13 status is now inactive due to auto deactivation removed underperforming models