Weighted aggregation: scores segments with embedding similarity, then rolls them up by duration. Full-video
score = longer scenes contribute more than shorter scenes.
Evidence-aware: scores each available evidence track, combines tracks by the weights below, then rolls
up by duration. If only Transcript and Visual description exist, the score is normalized
over those two weights. Use after evidence-aware scoring has run, otherwise pages may have
no score for that source.