developer_uid: chai_backend_admin
submission_id: function_kukur_2024-12-15
model_name: avian-mistral
model_group:
status: inactive
timestamp: 2024-12-15T01:33:43+00:00
num_battles: 9840
num_wins: 5684
celo_rating: 1318.02
family_friendly_score: 0.585
family_friendly_standard_error: 0.00696814178960216
submission_type: function
display_name: avian-mistral
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-14
win_ratio: 0.5776422764227642
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', 'Bot:', 'User:', 'You:', 'Me:', '####'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 1.5122971534729004s
Received healthy response to inference request in 1.1275129318237305s
Received healthy response to inference request in 1.1312854290008545s
Received healthy response to inference request in 0.9058616161346436s
Received healthy response to inference request in 3.730339527130127s
5 requests
0 failed requests
5th percentile: 0.950191879272461
10th percentile: 0.9945221424102784
20th percentile: 1.0831826686859132
30th percentile: 1.1282674312591552
40th percentile: 1.1297764301300048
50th percentile: 1.1312854290008545
60th percentile: 1.2836901187896728
70th percentile: 1.436094808578491
80th percentile: 1.955905628204346
90th percentile: 2.8431225776672364
95th percentile: 3.2867310523986815
99th percentile: 3.641617832183838
mean time: 1.681459331512451
Pipeline stage StressChecker completed in 9.53s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.92s
Shutdown handler de-registered
function_kukur_2024-12-15 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 6259.16s
Shutdown handler de-registered
function_kukur_2024-12-15 status is now inactive due to auto deactivation removed underperforming models