developer_uid: rirv938
submission_id: function_saral_2024-12-17
model_name: retune_with_base
status: inactive
timestamp: 2024-12-17T17:16:16+00:00
num_battles: 11435
num_wins: 5155
celo_rating: 1217.84
family_friendly_score: 0.593
family_friendly_standard_error: 0.00694767587039004
submission_type: function
display_name: retune_with_base
is_internal_developer: True
ranking_group: single
us_pacific_date: 2024-12-17
win_ratio: 0.4508089199825098
generation_params: {'temperature': 0.9, 'top_p': 0.9, 'min_p': 0.05, 'top_k': 80, 'presence_penalty': 0.5, 'frequency_penalty': 0.5, 'stopping_words': ['\n', '</s>'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Connection pool is full, discarding connection: %s. Connection pool size: %s
Failed to get response for submission blend_mokul_2024-11-14: ('', 'read tcp> read: connection reset by peer\n')
HTTPConnectionPool(host='', port=80): Read timed out. (read timeout=20)
Received unhealthy response to inference request!
Received healthy response to inference request in 3.0771548748016357s
Received healthy response to inference request in 2.146874189376831s
Received healthy response to inference request in 2.3959109783172607s
Received healthy response to inference request in 3.1277153491973877s
5 requests
1 failed requests
5th percentile: 2.196681547164917
10th percentile: 2.246488904953003
20th percentile: 2.346103620529175
30th percentile: 2.532159757614136
40th percentile: 2.804657316207886
50th percentile: 3.0771548748016357
60th percentile: 3.0973790645599366
70th percentile: 3.1176032543182375
80th percentile: 6.526982402801517
90th percentile: 13.325516510009766
95th percentile: 16.72478356361389
99th percentile: 19.444197206497194
mean time: 6.174341201782227
%s, retrying in %s seconds...
Received healthy response to inference request in 2.2642428874969482s
Received healthy response to inference request in 1.6033422946929932s
Received healthy response to inference request in 2.700634241104126s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Received healthy response to inference request in 2.158437967300415s
Received healthy response to inference request in 2.7529456615448s
5 requests
0 failed requests
5th percentile: 1.7143614292144775
10th percentile: 1.825380563735962
20th percentile: 2.0474188327789307
30th percentile: 2.1795989513397216
40th percentile: 2.221920919418335
50th percentile: 2.2642428874969482
60th percentile: 2.438799428939819
70th percentile: 2.6133559703826905
80th percentile: 2.7110965251922607
90th percentile: 2.7320210933685303
95th percentile: 2.742483377456665
99th percentile: 2.7508532047271728
mean time: 2.2959206104278564
Pipeline stage StressChecker completed in 44.70s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 2.17s
Shutdown handler de-registered
function_saral_2024-12-17 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2840.60s
Shutdown handler de-registered
function_saral_2024-12-17 status is now inactive due to auto deactivation removed underperforming models