developer_uid: azuruce
submission_id: function_kehob_2025-01-14
model_name: microsoft_phi4
model_group:
status: torndown
timestamp: 2025-01-14T22:07:02+00:00
num_battles: 3875
num_wins: 1288
celo_rating: 1138.41
family_friendly_score: 0.6042000000000001
family_friendly_standard_error: 0.006915813184290045
submission_type: function
display_name: microsoft_phi4
is_internal_developer: True
ranking_group: single
us_pacific_date: 2025-01-14
win_ratio: 0.33238709677419354
generation_params: {'temperature': 1.0, 'top_p': 1.0, 'min_p': 0.0, 'top_k': 100, 'presence_penalty': 0.0, 'frequency_penalty': 0.0, 'stopping_words': ['You:', 'Me:', 'User:'], 'max_input_tokens': 1024, 'best_of': 2, 'max_output_tokens': 68}
formatter: {'memory_template': "{bot_name}'s Persona: {memory}\n####\n", 'prompt_template': '{prompt}\n<START>\n', 'bot_template': '{bot_name}: {message}\n', 'user_template': '{user_name}: {message}\n', 'response_template': '{bot_name}:', 'truncate_by_message': False}
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage StressChecker
Received healthy response to inference request in 3.87394118309021s
Received healthy response to inference request in 2.1248297691345215s
Received healthy response to inference request in 3.9906885623931885s
Received healthy response to inference request in 2.0873281955718994s
Received healthy response to inference request in 1.975982904434204s
5 requests
0 failed requests
5th percentile: 1.9982519626617432
10th percentile: 2.0205210208892823
20th percentile: 2.0650591373443605
30th percentile: 2.0948285102844237
40th percentile: 2.109829139709473
50th percentile: 2.1248297691345215
60th percentile: 2.824474334716797
70th percentile: 3.524118900299072
80th percentile: 3.897290658950806
90th percentile: 3.943989610671997
95th percentile: 3.9673390865325926
99th percentile: 3.986018667221069
mean time: 2.8105541229248048
Pipeline stage StressChecker completed in 15.74s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 1.08s
Shutdown handler de-registered
function_kehob_2025-01-14 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 3539.42s
Shutdown handler de-registered
function_kehob_2025-01-14 status is now inactive due to auto deactivation removed underperforming models
function_kehob_2025-01-14 status is now torndown due to DeploymentManager action
ChatRequest
Generation Params
Prompt Formatter
Chat History
ChatMessage 1