developer_uid: chai_backend_admin
submission_id: function_mugim_2024-12-08
model_name: retune_with_base
model_group:
status: inactive
timestamp: 2024-12-08T05:27:09+00:00
num_battles: 10661
num_wins: 5629
celo_rating: 1278.38
family_friendly_score: 0.5838
family_friendly_standard_error: 0.006971048127792549
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-07
win_ratio: 0.5279992496013507
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
HTTPConnectionPool(host='guanaco-submitter.guanaco-backend.k2.chaiverse.com', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 2.9735097885131836s
Received healthy response to inference request in 2.686051845550537s
Received healthy response to inference request in 3.0546858310699463s
Received healthy response to inference request in 2.7062954902648926s
5 requests
1 failed requests
5th percentile: 2.690100574493408
10th percentile: 2.694149303436279
20th percentile: 2.7022467613220216
30th percentile: 2.7597383499145507
40th percentile: 2.8666240692138674
50th percentile: 2.9735097885131836
60th percentile: 3.005980205535889
70th percentile: 3.0384506225585937
80th percentile: 6.471195363998416
90th percentile: 13.30421442985535
95th percentile: 16.72072396278381
99th percentile: 19.453931589126586
mean time: 6.311555290222168
%s, retrying in %s seconds...
Received healthy response to inference request in 3.081913709640503s
Received healthy response to inference request in 3.2119638919830322s
Received healthy response to inference request in 2.711616277694702s
Received healthy response to inference request in 3.0416245460510254s
Received healthy response to inference request in 3.3784894943237305s
5 requests
0 failed requests
5th percentile: 2.777617931365967
10th percentile: 2.8436195850372314
20th percentile: 2.9756228923797607
30th percentile: 3.049682378768921
40th percentile: 3.065798044204712
50th percentile: 3.081913709640503
60th percentile: 3.1339337825775146
70th percentile: 3.1859538555145264
80th percentile: 3.245269012451172
90th percentile: 3.3118792533874513
95th percentile: 3.3451843738555906
99th percentile: 3.3718284702301027
mean time: 3.085121583938599
Pipeline stage StressChecker completed in 49.52s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.34s
Shutdown handler de-registered
function_mugim_2024-12-08 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3221.98s
Shutdown handler de-registered
function_mugim_2024-12-08 status is now inactive due to auto deactivation removed underperforming models