function_fotub_2025-04-11

developer_uid: chai_backend_admin

submission_id: function_fotub_2025-04-11

model_name: function_fotub_2025-04-11

model_group:

status: torndown

timestamp: 2025-04-11T15:42:26+00:00

num_battles: 6423

num_wins: 3187

celo_rating: 1279.45

family_friendly_score: 0.5584

family_friendly_standard_error: 0.007022669577874214

submission_type: function

display_name: function_fotub_2025-04-11

is_internal_developer: True

ranking_group: single

us_pacific_date: 2025-04-11

win_ratio: 0.49618558306087496

generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.586843729019165s
Received healthy response to inference request in 3.1063196659088135s
Received healthy response to inference request in 5.241745710372925s
Received healthy response to inference request in 2.587505340576172s
Received healthy response to inference request in 3.6540322303771973s
5 requests
0 failed requests
5th percentile: 2.6912682056427
10th percentile: 2.7950310707092285
20th percentile: 3.002556800842285
30th percentile: 3.21586217880249
40th percentile: 3.434947204589844
50th percentile: 3.6540322303771973
60th percentile: 4.027156829833984
70th percentile: 4.4002814292907715
80th percentile: 4.717824125289917
90th percentile: 4.979784917831421
95th percentile: 5.1107653141021725
99th percentile: 5.215549631118774
mean time: 3.8352893352508546
%s, retrying in %s seconds...
Received healthy response to inference request in 1.9036824703216553s
Received healthy response to inference request in 2.7206263542175293s
Received healthy response to inference request in 4.620314121246338s
Received healthy response to inference request in 3.6177189350128174s
Received healthy response to inference request in 1.6979975700378418s
5 requests
0 failed requests
5th percentile: 1.7391345500946045
10th percentile: 1.7802715301513672
20th percentile: 1.8625454902648926
30th percentile: 2.06707124710083
40th percentile: 2.3938488006591796
50th percentile: 2.7206263542175293
60th percentile: 3.0794633865356444
70th percentile: 3.4383004188537596
80th percentile: 3.8182379722595217
90th percentile: 4.219276046752929
95th percentile: 4.419795083999634
99th percentile: 4.580210313796997
mean time: 2.9120678901672363
Pipeline stage StressChecker completed in 35.91s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.68s
Shutdown handler de-registered
function_fotub_2025-04-11 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3030.42s
Shutdown handler de-registered
function_fotub_2025-04-11 status is now inactive due to auto deactivation removed underperforming models
function_fotub_2025-04-11 status is now torndown due to DeploymentManager action