developer_uid: chai_backend_admin
submission_id: function_ramaf_2025-05-22
model_name: function_ramaf_2025-05-22
model_group:
status: torndown
timestamp: 2025-05-22T00:00:18+00:00
num_battles: 6699
num_wins: 3562
celo_rating: 1302.21
family_friendly_score: 0.5224
family_friendly_standard_error: 0.007063968289849552
submission_type: function
display_name: function_ramaf_2025-05-22
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-05-21
win_ratio: 0.5317211524108075
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.329707622528076s
Received healthy response to inference request in 1.5064568519592285s
Received healthy response to inference request in 3.870911121368408s
Received healthy response to inference request in 2.580807685852051s
Received healthy response to inference request in 3.0691962242126465s
5 requests
0 failed requests
5th percentile: 1.721327018737793
10th percentile: 1.9361971855163573
20th percentile: 2.365937519073486
30th percentile: 2.67848539352417
40th percentile: 2.873840808868408
50th percentile: 3.0691962242126465
60th percentile: 3.3898821830749513
70th percentile: 3.7105681419372556
80th percentile: 3.962670421600342
90th percentile: 4.146189022064209
95th percentile: 4.237948322296143
99th percentile: 4.311355762481689
mean time: 3.071415901184082
%s, retrying in %s seconds...
Received healthy response to inference request in 4.438126802444458s
Received healthy response to inference request in 2.6338937282562256s
Received healthy response to inference request in 2.5220935344696045s
Received healthy response to inference request in 2.7696425914764404s
Received healthy response to inference request in 3.3841044902801514s
5 requests
0 failed requests
5th percentile: 2.544453573226929
10th percentile: 2.566813611984253
20th percentile: 2.611533689498901
30th percentile: 2.6610435009002686
40th percentile: 2.7153430461883543
50th percentile: 2.7696425914764404
60th percentile: 3.015427350997925
70th percentile: 3.261212110519409
80th percentile: 3.5949089527130127
90th percentile: 4.016517877578735
95th percentile: 4.227322340011597
99th percentile: 4.395965909957885
mean time: 3.149572229385376
Pipeline stage StressChecker completed in 33.37s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.59s
Shutdown handler de-registered
function_ramaf_2025-05-22 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2734.30s
Shutdown handler de-registered
function_ramaf_2025-05-22 status is now inactive due to auto deactivation removed underperforming models
function_ramaf_2025-05-22 status is now torndown due to DeploymentManager action