developer_uid: chai_backend_admin
submission_id: function_negin_2024-12-15
model_name: avian-mistral
model_group:
status: inactive
timestamp: 2024-12-15T01:29:41+00:00
num_battles: 9682
num_wins: 5591
celo_rating: 1316.87
family_friendly_score: 0.5840000000000001
family_friendly_standard_error: 0.006970566691453429
submission_type: function
display_name: avian-mistral
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-14
win_ratio: 0.5774633340218963
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n', '</s>', 'Bot:', 'User:', 'You:', 'Me:', '####'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 1.0919132232666016s
Received healthy response to inference request in 1.3854546546936035s
Received healthy response to inference request in 2.1252830028533936s
Received healthy response to inference request in 2.6782004833221436s
5 requests
1 failed requests
5th percentile: 1.150621509552002
10th percentile: 1.2093297958374023
20th percentile: 1.326746368408203
30th percentile: 1.5334203243255615
40th percentile: 1.8293516635894775
50th percentile: 2.1252830028533936
60th percentile: 2.3464499950408935
70th percentile: 2.5676169872283934
80th percentile: 6.179054975509647
90th percentile: 13.180763959884645
95th percentile: 16.68161845207214
99th percentile: 19.48230204582214
mean time: 5.492664861679077
%s, retrying in %s seconds...
Received healthy response to inference request in 1.1416680812835693s
Received healthy response to inference request in 10.302202701568604s
Received healthy response to inference request in 1.1578290462493896s
Received healthy response to inference request in 3.089550256729126s
Received healthy response to inference request in 1.5987591743469238s
5 requests
0 failed requests
5th percentile: 1.1449002742767334
10th percentile: 1.1481324672698974
20th percentile: 1.1545968532562256
30th percentile: 1.2460150718688965
40th percentile: 1.4223871231079102
50th percentile: 1.5987591743469238
60th percentile: 2.1950756072998043
70th percentile: 2.7913920402526853
80th percentile: 4.532080745697023
90th percentile: 7.417141723632813
95th percentile: 8.859672212600707
99th percentile: 10.013696603775024
mean time: 3.4580018520355225
Pipeline stage StressChecker completed in 47.11s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.49s
Shutdown handler de-registered
function_negin_2024-12-15 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 6056.56s
Shutdown handler de-registered
function_negin_2024-12-15 status is now inactive due to auto deactivation removed underperforming models