Executive Summary – 1a. Early and Late Fusion

Key Concepts

StrategyWhere They MergeProsCons
Early FusionBefore feature extractionCaptures cross-modal cues → top accuracyRequires tight spatial alignment & heavier model
Late FusionNear classifier headModular, sensors can fail independentlyRisks missing joint correlations

Workflow

  1. Load & augment RGB + LiDAR data.
  2. Train single-modal baselines (RGB-ResNet, LiDAR-PointNet).
  3. Implement fusion networks (early & late).
  4. Compare accuracy, convergence speed & saliency maps.

Results

ModelTest Accuracy
RGB baseline81 %
LiDAR baseline74 %
Late Fusion86 %
Early Fusion88 %
Early fusion wins but costs ≈ 15 % more FLOPs.

Practical Insights


The notebook includes an animated GIF that spins a LiDAR point-cloud, colour-coded by model confidence, to visualise attention.