submission_id: blend_furos_2024-11-15
developer_uid: chai_backend_admin
celo_rating: 1263.61
display_name: blend_furos_2024-11-15
family_friendly_score: 0.5738
family_friendly_standard_error: 0.006993619377689924
is_internal_developer: True
language_model: chaiml-elo-alignment-run-3_v48,chaiml-lexical-nemo-v4-1k1e5_v10,chaiml-lexical-nemov8-1k1e5_v14,chaiml-virgo-edit-v1-1e5_v9,sao10k-mn-12b-lyra-v4a1_v12,chaiml-nemo-20241017-tie_8098_v3
model_group:
model_name: blend_furos_2024-11-15
model_size: n/a
num_battles: 9265
num_wins: 4834
ranking_group: blended
reward_model: random
status: inactive
submission_type: blend
submissions: ['chaiml-elo-alignment-run-3_v48', 'chaiml-lexical-nemo-v4-1k1e5_v10', 'chaiml-lexical-nemov8-1k1e5_v14', 'chaiml-virgo-edit-v1-1e5_v9', 'sao10k-mn-12b-lyra-v4a1_v12', 'chaiml-nemo-20241017-tie_8098_v3']
timestamp: 2024-11-15T17:12:16+00:00
us_pacific_date: 2024-11-15
win_ratio: 0.5217485159201295
Resubmit model
Shutdown handler not registered because Python interpreter is not running in the main thread
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyTriggerPipeline
run_pipeline:run_in_cloud %s
starting trigger_guanaco_pipeline args=%s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
Connection pool is full, discarding connection: %s. Connection pool size: %s
triggered trigger_guanaco_pipeline args=%s
Pipeline stage OfflineFamilyFriendlyTriggerPipeline completed in 5.44s
Shutdown handler de-registered
blend_furos_2024-11-15 status is now deployed due to DeploymentManager action
Shutdown handler registered
run pipeline %s
run pipeline stage %s
Running pipeline stage OfflineFamilyFriendlyScorer
Evaluating %s Family Friendly Score with %s threads
Pipeline stage OfflineFamilyFriendlyScorer completed in 2284.09s
Shutdown handler de-registered
blend_furos_2024-11-15 status is now inactive due to auto deactivation removed underperforming models