function_lojub_2025-04-30

developer_uid: chai_backend_admin

submission_id: function_lojub_2025-04-30

model_name: function_lojub_2025-04-30

model_group:

status: torndown

timestamp: 2025-04-30T01:14:05+00:00

num_battles: 7839

num_wins: 4374

celo_rating: 1327.73

family_friendly_score: 0.5618000000000001

family_friendly_standard_error: 0.007016847725296595

submission_type: function

display_name: function_lojub_2025-04-30

is_internal_developer: True

ranking_group: single

us_pacific_date: 2025-04-29

win_ratio: 0.5579793340987371

generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 40, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['\n'], 'max_input_tokens': 1024, 'best_of': 8, 'max_output_tokens': 64}

formatter: {'memory_template': '### Instruction:\n{memory}\n', 'prompt_template': '### Input:\n{prompt}\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '### Response:\n{bot_name}:', 'truncate_by_message': False}

Resubmit model

Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 4.113027095794678s
Received healthy response to inference request in 4.30736517906189s
Received healthy response to inference request in 3.6758437156677246s
Received healthy response to inference request in 2.704634428024292s
Received healthy response to inference request in 3.9616894721984863s
5 requests
0 failed requests
5th percentile: 2.8988762855529786
10th percentile: 3.093118143081665
20th percentile: 3.481601858139038
30th percentile: 3.733012866973877
40th percentile: 3.8473511695861817
50th percentile: 3.9616894721984863
60th percentile: 4.0222245216369625
70th percentile: 4.08275957107544
80th percentile: 4.1518947124481205
90th percentile: 4.229629945755005
95th percentile: 4.268497562408447
99th percentile: 4.299591655731201
mean time: 3.752511978149414
%s, retrying in %s seconds...
Received healthy response to inference request in 3.6647000312805176s
Received healthy response to inference request in 4.666570425033569s
Received healthy response to inference request in 2.511467456817627s
Received healthy response to inference request in 2.2368459701538086s
Received healthy response to inference request in 3.284442901611328s
5 requests
0 failed requests
5th percentile: 2.291770267486572
10th percentile: 2.346694564819336
20th percentile: 2.4565431594848635
30th percentile: 2.6660625457763674
40th percentile: 2.9752527236938477
50th percentile: 3.284442901611328
60th percentile: 3.436545753479004
70th percentile: 3.5886486053466795
80th percentile: 3.865074110031128
90th percentile: 4.265822267532348
95th percentile: 4.466196346282959
99th percentile: 4.626495609283447
mean time: 3.27280535697937
Pipeline stage StressChecker completed in 37.21s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 0.66s
Shutdown handler de-registered
function_lojub_2025-04-30 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2973.17s
Shutdown handler de-registered
function_lojub_2025-04-30 status is now inactive due to auto deactivation removed underperforming models
function_lojub_2025-04-30 status is now torndown due to DeploymentManager action