developer_uid: chai_backend_admin
submission_id: function_mihar_2025-12-02
model_name: function_mihar_2025-12-02
model_group:
status: inactive
timestamp: 2025-12-02T20:17:22+00:00
num_battles: 6156
num_wins: 3231
celo_rating: 1308.36
family_friendly_score: 0.5162
family_friendly_standard_error: 0.0070673553752446895
submission_type: function
display_name: function_mihar_2025-12-02
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-12-02
win_ratio: 0.5248538011695907
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.317967176437378s
Received healthy response to inference request in 3.33423113822937s
Received healthy response to inference request in 3.134035348892212s
Received healthy response to inference request in 3.6296169757843018s
Received healthy response to inference request in 3.456469774246216s
Received healthy response to inference request in 2.5100042819976807s
Received healthy response to inference request in 3.667182683944702s
Received healthy response to inference request in 3.351997137069702s
Received healthy response to inference request in 3.7173807621002197s
Received healthy response to inference request in 3.0128180980682373s
10 requests
0 failed requests
5th percentile: 2.4043838739395142
10th percentile: 2.4908005714416506
20th percentile: 2.912255334854126
30th percentile: 3.0976701736450196
40th percentile: 3.2541528224945067
50th percentile: 3.343114137649536
60th percentile: 3.3937861919403076
70th percentile: 3.5084139347076415
80th percentile: 3.637130117416382
90th percentile: 3.672202491760254
95th percentile: 3.6947916269302365
99th percentile: 3.712862935066223
mean time: 3.2131703376770018
Pipeline stage StressChecker completed in 33.47s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_mihar_2025-12-02 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2755.30s
Shutdown handler de-registered
function_mihar_2025-12-02 status is now inactive due to auto deactivation removed underperforming models