
Abstract
Standard object detectors typically treat architectural elements independently, often resulting in facade parsings that lack the structural coherence required for downstream procedural reconstruction. We address this limitation by augmenting the YOLOv8 training objective with a custom lightweight alignment loss. This regularization encourages grid-consistent arrangements of bounding boxes during training, effectively injecting geometric priors without altering the standard inference pipeline. Experiments on the CMP dataset demonstrate that our method successfully improves structural regularity, correcting alignment errors caused by perspective and occlusion while maintaining a controllable trade-off with standard detection accuracy.
Video
Models
Below are models for all the tested values of threshold T and weight W (see the paper for details).
| Threshold T | |||||||
|---|---|---|---|---|---|---|---|
| Weight W | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
| 0.1 | .pt | .pt | .pt | .pt | .pt | .pt | .pt |
| 0.25 | .pt | .pt | .pt | .pt | .pt | .pt | .pt |
| 0.5 | .pt | .pt | .pt | .pt | .pt | .pt | .pt |
| 0.75 | .pt | .pt | .pt | .pt | .pt | .pt | .pt |
| 1.0 | .pt | .pt | .pt | .pt | .pt | .pt | .pt |
BibTeX
TBD