Structured Instance Understanding with Boundary Box Relationships
We introduce a post-processing validator that reasons over geometric relationships of predicted bounding boxes to detect inconsistent object structures, improving reliability across industrial object detection pipelines.
Structured Consistency for Object Detection
Bounding boxes alone provide limited structural guarantees. SIU elevates detection pipelines by verifying that inferred components respect the spatial grammar of the target system.
Key Contributions
Relational Validator
Encodes canonical spatial relationships between parts and flags detections that violate structural constraints.
Lightweight Deployment
Adds <1 second runtime overhead, enabling integration in latency-sensitive production systems.
Robust Gains
Delivers up to 25% improvement in structural reliability by reducing misassembled detections 60–80%.
Model Agnostic
Easily plugs into any object detector, allowing teams to elevate quality control without retraining.
Validation Workflow
SIU integrates after the detector, composing a three-phase pipeline that contextualizes bounding boxes before final approval.
Graph Construction
Convert bounding boxes into a relational graph where nodes capture object categories and edges encode adjacency, alignment, and distance statistics.
- Derive canonical anchor points for each object class.
- Compute directional relationships and scale-normalized gaps.
Constraint Evaluation
Apply domain-specific rules to the graph to score structural plausibility using statistical thresholds.
- Check mandatory part presence and configuration.
- Measure pairwise deviations against learned tolerances.
Decision & Feedback
Flag suspect detections and produce diagnostics to trigger downstream human review or automatic retries.
- Emit structured failure codes for interpretability.
- Provide actionable feedback for detector refinement.
Experimental Results
Evaluation on the Car Parts dataset with 3,291 images across 20 categories demonstrates substantial accuracy and reliability gains.
Metric | Value | Description |
---|---|---|
Accuracy | 85–95% | Overall classification accuracy on structural validity labels. |
Precision | 85–92% | Rate of correct structure predictions among accepted detections. |
Recall | 88–95% | True positive rate of structural anomaly detection. |
F1 Score | 85–92% | Harmonic mean capturing overall balance. |
ROC AUC | 0.90–0.95 | Area under the ROC curve across operating points. |
Reproduce & Extend
Get started with the open-source implementation, documentation, and dataset to adapt SIU for your domain.
Citation
If you use this work in your research, please cite the following.
@inproceedings{siu2024, title = {Structured Instance Understanding with Boundary Box Relationships in Object Detection System}, author = {Vorathammathorn, Supasate and Angsarawanee, Thanatwit and Tasanangam, Sakol and Sakdejayont, Theerat}, booktitle = {Proceedings of the ACM Conference}, year = {2024}, doi = {10.1145/3643487.3662729} }