Automated optical inspection of FAST’s reflector surface using drones and computer vision

Jianan Li; Shenwang Jiang; Liqiang Song; Peiran Peng; Feng Mu; Hui Li; Peng Jiang; Tingfa Xu

doi:10.37188/lam.2023.001

The Five-hundred-meter Aperture Spherical radio Telescope (FAST) is the world ’ s largest single-dish radio telescope. Its large reflecting surface achieves unprecedented sensitivity but is prone to damage, such as dents and holes, caused by naturally-occurring falling objects. Hence, the timely and accurate detection of surface defects is crucial for FAST’s stable operation. Conventional manual inspection involves human inspectors climbing up and examining the large surface visually, a time-consuming and potentially unreliable process. To accelerate the inspection process and increase its accuracy, this work makes the first step towards automating the inspection of FAST by integrating deep-learning techniques with drone technology. First, a drone flies over the surface along a predetermined route. Since surface defects significantly vary in scale and show high inter-class similarity, directly applying existing deep detectors to detect defects on the drone imagery is highly prone to missing and misidentifying defects. As a remedy, we introduce cross-fusion, a dedicated plug-in operation for deep detectors that enables the adaptive fusion of multi-level features in a point-wise selective fashion, depending on local defect patterns. Consequently, strong semantics and fine-grained details are dynamically fused at different positions to support the accurate detection of defects of various scales and types. Our AI-powered drone-based automated inspection is time-efficient, reliable, and has good accessibility, which guarantees the long-term and stable operation of FAST.

HTML

Conclusion and Discussions

This work presents an automated optical inspection of the reflector surface of FAST, the world’s largest single-dish radio telescope, by exploiting advances in drone technology and deep-learning techniques. To tackle the challenges of surface defects in drone imagery exhibiting large-scale variation and high inter-class similarity, we introduced a simple yet effective cross-fusion operation that aggregates multi-level features in a point-wise selective manner to help detect defects of various scales and types.

Our cross-fusion method is lightweight and computationally efficient, particularly valuable features for onboard drone applications. Currently, we process the video data captured by the drone camera offline on a ground station, which increases the operational complexity. Future work will implement the algorithm on embedded hardware platforms to process captured videos onboard the drone, to make the inspection system more autonomous and more robust.

Acknowledgements

This work was financially supported by the National Natural Science Foundation of China (No. 62101032), the Postdoctoral Science Foundation of China (Nos. 2021M690015, 2022T150050), and the Beijing Institute of Technology Research Fund Program for Young Scholars (No. 3040011182111).

Reference (43)

[1]	Kuschmierz, R. et al. Ultra-thin 3D lensless fiber endoscopy using diffractive optical elements and deep neural networks. Light: Advanced Manufacturing 2, 30 (2021).
[2]	Situ, G. H. Deep holography. Light: Advanced Manufacturing 3, 8 (2022).
[3]	Cao, W. M., L iu, Q. F. & He, Z. Q. Review of pavement defect detection methods. IEEE Access 8, 14531-14544 (2020).
[4]	Cao, M. T. et al. Survey on performance of deep learning models for detecting road damages using multiple dashcam image resources. Advanced Engineering Informatics 46, 101182 (2020).
[5]	Zhu, J. Q. et al. Pavement distress detection using convolutional neural networks with images captured via UAV. Automation in Construction 133, 103991 (2022).
[6]	Zhang, C. B., Chang, C. C. & Jamshidi, M. Concrete bridge surface damage detection using a single-stage detector. Computer-Aided Civil and Infrastructure Engineering 35, 389-409 (2020).
[7]	Du, F. J., Jiao, S. J. & Chu, K. L. Application research of bridge damage detection based on the improved lightweight convolutional neural network model. Applied Sciences 12, 6225 (2022).
[8]	Liu, C. Y. et al. Insulator faults detection in aerial images from high-voltage transmission lines based on deep learning model. Applied Sciences 11, 4647 (2021).
[9]	Liu, J. J. et al. An improved method based on deep learning for insulator fault detection in diverse aerial images. Energies 14, 4365 (2021).
[10]	Redmon, J. & Farhadi, A. YOLOV3: an incremental improvement. Preprint at https://doi.org/10.48550/arXiv.1804.02767 (2018).
[11]	Vlaminck, M. et al. Region-based CNN for anomaly detection in PV power plants using aerial imagery. Sensors 22, 1244 (2022).
[12]	Di Tommaso, A. et al. A multi-stage model based on YOLOv3 for defect detection in PV panels based on IR and visible imaging by unmanned aerial vehicle. Renewable Energy 193, 941-962 (2022).
[13]	Sandler, M. et al. MobileNetV2: inverted residuals and linear bottlenecks. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, UT, USA: IEEE, 2018, 4510-4520.
[14]	He, K. M. et al. Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE, 2016, 770-778.
[15]	Lin, T. Y. et al. Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, HI, USA: IEEE, 2017, 2117-2125.
[16]	Bell, S. et al. Inside-outside net: detecting objects in context with skip pooling and recurrent neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Las Vegas, NV, USA: IEEE, 2016, 2874-2883.
[17]	Chen, L. C. et al. Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the 15th European Conference on Computer Vision. Munich, Germany: Springer, 2018, 833-851.
[18]	Vaswani, A. et al. Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, CA, USA: Curran Associates Inc., 2017, 6000-6010.
[19]	Wang, X. L. et al. Non-local neural networks. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, UT, USA: IEEE, 2018, 7794-7803.
[20]	Zhu, P. F. et al. Vision meets drones: a challenge. Preprint at https://doi.org/10.48550/arXiv.1804.07437 (2018).
[21]	Lin, T. Y. et al. Microsoft COCO: common objects in context. Proceedings of the 13th European Conference on Computer Vision. Zurich, Switzerland: Springer, 2014, 740-755.
[22]	Pang, J. M. et al. Libra R-CNN: towards balanced learning for object detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach, CA, USA: IEEE, 2019, 821-830.
[23]	Ge, Z. et al. YOLOX: exceeding YOLO series in 2021. Preprint at https://doi.org/10.48550/arXiv.2107.08430 (2021).
[24]	Jocher, G. ultralytics/yolov5: v3.1 – Bug Fixes and Performance Improvements. (2020). At https://github.com/ultralytics/yolov5 URL.
[25]	Wang, C. Y., Bochkovskiy, A. & Liao, H. Y. M. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Preprint at https://doi.org/10.48550/arXiv.2207.02696 (2022).
[26]	Feng, C. J. et al. TOOD: task-aligned one-stage object detection. Proceedings of the IEEE/CVF International Conference on Computer Vision. Montreal, QC, Canada: IEEE, 2021, 3490-3499.
[27]	Ren, S. Q. et al. Faster R-CNN: towards real-time object detection with region proposal networks. Proceedings of the 28th International Conference on Neural Information Processing Systems. Montreal, Quebec, Canada: MIT Press, 2015, 91-99.
[28]	Wang, J. Q. et al. CARAFE: content-aware reassembly of features. Proceedings of the IEEE/CVFInternational Conference on Computer Vision. Seoul, Korea (South): IEEE, 2019, 3007-3016.
[29]	Lu, X. et al. Grid R-CNN. Proceedings of the IEEE/CVF Conference on Computer Vision andPattern Recognition. Long Beach, CA, USA: IEEE, 2019, 7355-7364.
[30]	Huang, Y. C., Chen, J. X. & Huang, D. UFPMP-Det: toward accurate and efficient object detection on drone imagery. Preprint at https://doi.org/10.48550/arXiv.2112.10415 (2021).
[31]	Yang, F. et al. Clustered object detection in aerial images. Proceedings of the IEEE/CVF International Conference on Computer Vision. Seoul, Korea (South): IEEE, 2019, 8311-8320.
[32]	Li, C. L. et al. Density map guided object detection in aerial images. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. Seattle, WA, USA: IEEE, 2020, 190-191.
[33]	Deng, S. T. et al. A global-local self-adaptive network for drone-view object detection. IEEE Transactions on Image Processing 30, 1556-1569 (2020).
[34]	Wei, Z. W. et al. AMRNet: chips augmentation in aerial images object detection. Preprint at https://doi.org/10.48550/arXiv.2009.07168 (2020).
[35]	Rossi, L., Karimi, A. & Prati, A. A novel region of interest extraction layer for instance segmentation. Proceedings of the 25th International Conference on Pattern Recognition. Milan, Italy: IEEE, 2021, 2203-2209.
[36]	Chen, Q. et al. You only look one-level feature. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Nashville, TN, USA: IEEE, 2021, 13039-13048.
[37]	Chen, K. et al. MMDetection: open MMLab detection toolbox and benchmark. Preprint at https://doi.org/10.48550/arXiv.1906.07155 (2019).
[38]	Wang, J. Q. et al. Side-aware boundary localization for more precise object detection. Proceedings of the 16th European Conference on Computer Vision. Glasgow, UK: Springer, 2020, 403-419.
[39]	Lin, T. Y. et al. Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision. Venice, Italy: IEEE, 2017, 2980-2988.
[40]	Xu, H. Y. et al. Deep regionlets for object detection. Proceedings of the 15th European Conference on Computer Vision. Munich, Germany: Springer, 2018, 827-844.
[41]	Shrivastava, A. et al. Beyond skip connections: top-down modulation for object detection. Preprint at https://doi.org/10.48550/arXiv.1612.06851 (2016).
[42]	Zhang, S. F. et al. Single-shot refinement neural network for object detection. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City, UT, USA: IEEE, 2018, 4203-4212.
[43]	Zhao, Q. J. et al. M2Det: a single-shot object detector based on multi-level feature pyramid network. Proceedings of the 33rd AAAI Conference on Artificial Intelligence. Honolulu, Hawaii, USA: AAAI, 2019, 9259-9266.

Method	Dent			Hole			AP	AP₅₀	AP₇₅
Method	AP	AP₅₀	AP₇₅	AP	AP₅₀	AP₇₅	AP	AP₅₀	AP₇₅
YOLOv3¹⁰	19.8	58.3	8.0	32.4	81.3	19.7	26.1	69.8	13.9
YOLOX²³	25.4	66.9	13.2	31.2	78.2	18.1	28.3	72.5	15.7
YOLOv5²⁴	29.1	74.0	15.2	46.6	91.6	41.8	37.9	82.8	28.5
YOLOv7²⁵	31.9	79.8	16.3	45.8	91.4	38.1	38.9	85.6	27.2
TOOD²⁶	28.8	73.6	14.5	44.1	88.4	34.1	36.4	81.0	24.3
Faster R-CNN²⁷	29.6	73.4	17.6	45.0	93.3	38.8	37.8	83.4	28.2
CARAFE²⁸	31.1	76.0	19.6	47.1	93.3	42.8	39.1	84.6	31.2
Grid R-CNN²⁹	29.5	74.1	15.9	45.9	94.7	41.2	37.7	84.4	28.5
Libra R-CNN²²	32.3	77.3	19.4	47.0	92.9	40.4	39.6	85.1	29.9
Cross-Pooling (Ours)	33.5	80.0	20.3	49.1	94.9	45.5	41.3	87.5	32.9

Method	Crack	Repair	Pothole	mAP
YOLOX²³	27.6	55.5	40.9	41.3
TOOD²⁶	32.6	55.8	53.2	47.2
Libra R-CNN²²	36.1	63.9	50.7	50.2
YOLOv5²⁴	37.0	64.0	53.2	51.4
CARAFE²⁸	36.7	60.3	58.5	51.8
Grid R-CNN²⁹	37.6	62.4	57.0	52.3
Cross-Pooling (Ours)	38.9	65.9	58.9	54.6

Method	AP	AP$ _{50} $	AP$ _{75} $
Faster R-CNN²⁷	21.4	40.7	19.9
ClusDet³¹	26.7	50.6	24.4
DMNet³²	28.2	47.6	28.9
GLSAN³³	30.7	55.4	30.0
AMRNet³⁴	31.7	52.7	33.1
UFPMP-Det^*30	35.8	56.7	38.3
Cross-Pooling (Ours)	36.2	56.9	38.6

Model		AP	AP$ _{50} $	AP$ _{75} $
MNet	SSDLite¹³	22.1	−	−
	Faster R-CNN^*	23.3	40.7	23.9
	Cross-fusion (Faster R-CNN)	24.4 $ \uparrow $1.1	42.3	25.0
R50	Faster R-CNN^*	35.3	56.3	38.1
	RetinaNet-FPN^*	35.9	55.8	38.4
	Faster R-CNN-FPN^*	36.6	58.8	39.6
	YOLOF^*36	37.5	57.0	40.4
	GRoIE³⁵	37.5	59.2	40.6
	CARAFE³⁸	38.1	60.7	41.0
	Libra R-CNN^*22	38.3	59.5	41.9
	Cross-fusion (Faster R-CNN)	37.0 $ \uparrow $1.7	58.3	39.9
	Cross-fusion (YOLOF)	38.0 $ \uparrow $0.5	56.9	41.1
	Cross-fusion (Libra R-CNN)	38.6 $ \uparrow $0.3	59.5	42.2
R101	TDM⁴¹	35.2	55.3	38.1
	RefineDet⁴²	36.4	57.5	39.5
	Faster R-CNN^*	38.7	59.5	41.8
	M2Det⁴³	38.8	59.4	41.7
	RetinaNet-FPN³⁹	39.1	59.1	42.3
	Faster R-CNN-FPN^*	39.2	61.2	42.6
	Regionlets⁴⁰	39.3	59.8	-
	Cross-fusion (Faster R-CNN)	39.5 $ \uparrow $0.8	60.8	42.7

Automated optical inspection of FAST’s reflector surface using drones and computer vision

Abstract

References

Rights and permissions

通讯作者: 陈斌, bchen63@163.com

Research Summary

Article Metrics