developer_uid: chai_backend_admin
submission_id: function_refok_2025-05-09
model_name: function_refok_2025-05-09
model_group:
status: torndown
timestamp: 2025-05-09T02:10:16+00:00
num_battles: 6223
num_wins: 3195
celo_rating: 1297.79
family_friendly_score: 0.5327999999999999
family_friendly_standard_error: 0.007055836732804976
submission_type: function
display_name: function_refok_2025-05-09
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-05-08
win_ratio: 0.5134179656114414
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 2.1621358394622803s
Received healthy response to inference request in 2.3691744804382324s
Received healthy response to inference request in 2.6203901767730713s
Received healthy response to inference request in 2.287493944168091s
Received healthy response to inference request in 2.264723539352417s
5 requests
0 failed requests
5th percentile: 2.182653379440308
10th percentile: 2.203170919418335
20th percentile: 2.2442059993743895
30th percentile: 2.2692776203155516
40th percentile: 2.278385782241821
50th percentile: 2.287493944168091
60th percentile: 2.3201661586761473
70th percentile: 2.352838373184204
80th percentile: 2.4194176197052
90th percentile: 2.5199038982391357
95th percentile: 2.5701470375061035
99th percentile: 2.6103415489196777
mean time: 2.3407835960388184
Pipeline stage StressChecker completed in 12.78s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.70s
Shutdown handler de-registered
function_refok_2025-05-09 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3192.92s
Shutdown handler de-registered
function_refok_2025-05-09 status is now inactive due to auto deactivation removed underperforming models
function_refok_2025-05-09 status is now torndown due to DeploymentManager action