عنوان انگلیسی مقاله:
Weighted boxes fusion: Ensembling boxes from different object detection models
ترجمه فارسی عنوان مقاله:
همجوشی جعبه های توزین شده: جمع آوری جعبه هایی از مدل های مختلف تشخیص شیء
Sciencedirect - Elsevier - Image and Vision Computing, 107 (2021) 104117: doi:10:1016/j:imavis:2021:104117
Object detection is a crucial task in computer vision systems with a wide range of applications in autonomous driving, medical imaging, retail, security, face recognition, robotics, and others. Nowadays, neural networks- based models are used to localize and classify instances of objects of particular classes. When real-time inference is not required, ensembles of models help to achieve better results. In this work, we present a novel method for fusing predictions from different object detection models: weighted boxes fusion. Our algorithm utilizes conﬁdence scores of all proposed bounding boxes to construct averaged boxes. We tested the method on several datasets and evaluated it in the context of Open Images and COCO Object Detection challenges, achieving top results in these challenges. The 3D version of boxes fusion was successfully applied by the winning teams of Waymo Open Dataset and Lyft 3D Object Detection for Autonomous Vehicles challenges. The source code is publicly available at GitHub (Solovyev, 2019 ).We present a novel method for combining predictions in ensembles of different object detection models: weighted boxes fusion. This method signiﬁcantly improves the quality of the fused predicted rectangles for an ensemble. We tested the method on several datasets and evaluated it in the context of the Open Images and COCO Object Detection challenges. It helped to achieve top results in these challenges. The source code is publicly available at GitHub.© 2021 Published by Elsevier B.V.
Keywords: Object detection | Computer vision | Deep learning