Transmission Line Defect Detection Algorithm Based on Improved RT-DETR Model
Download PDF

Keywords

RT-DETR
Defect detection
Efficient vision attention
C2f
Small object detection

DOI

10.26689/jera.v10i1.13324

Submitted : 2026-01-14
Accepted : 2026-01-29
Published : 2026-02-13

Abstract

This paper addresses the urgent need for high-precision and high-efficiency visual perception technologies in power equipment operation and maintenance under the background of rapid development of smart grids. It points out the performance limitations of the existing real-time target detection framework RT-DETR when handling small targets, dense targets, and complex backgrounds in power inspection scenarios. To overcome this bottleneck, this study proposes an improved backbone network model, DETR-EVA, based on an efficient visual attention mechanism (EVA). This model innovatively designs an attention computation structure with linear complexity by deeply integrating the EVA mechanism with the C2f module in the RT-DETR backbone network, and combines local detail perception and global dependency modeling capabilities. Its core lies in the introduction of a gated fusion mechanism, which significantly enhances the model’s ability to model long-distance contextual relationships and the adaptive adjustment efficiency of feature weights while retaining the advantages of multi-branch feature extraction and fusion of the C2f module. Experiments were conducted on an inspection image dataset containing typical power equipment targets. The results show that compared with the original RT-DETR model, DETR-EVA improves the overall accuracy index mAP50-95 by 2.5%, reduces computational complexity by 14%, and reduces the number of model parameters by 27%. This effectively verifies that the proposed method can significantly improve the detection accuracy of small targets and complex scenes while maintaining real-time detection speed, providing a better visual solution for intelligent operation and maintenance of power equipment.

References

Tao T, Li Z, 2026, A Review of Deep Learning Application in Transmission Line Defect Detection. Electric Power Systems Research, 2026(250): 112193.

Girshick R, Donahue J, Darrell T, et al., 2014, Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 580–587.

Girshick R, 2015, Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision, 1440–1448.

Ren S, He K, Girshick R, et al., 2016, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(6): 1137–1149.

Redmon J, Divvala S, Girshick R, et al., 2016, You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 779–788.

Redmon J, Farhadi A, 2017, YOLO9000: Better, Faster, Stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 7263–7271.

Redmon J, Farhadi A, 2018, Yolov3: An Incremental Improvement, arXiv preprint, https://doi.org/10.48550/arXiv.1804.02767

Khanam R, Hussain M, 2024, Yolov11: An Overview of the Key Architectural Enhancements, arXiv preprint, https://doi.org/10.48550/arXiv.2410.17725

Liu W, Anguelov D, Erhan D, et al., 2016, Ssd: Single Shot Multibox Detector. European Conference on Computer Vision, Springer International Publishing, 21–37.

Carion N, Massa F, Synnaeve G, et al., 2020, End-to-End Object Detection with Transformers. European Conference on Computer Vision, Springer International Publishing, 213–229.

Dosovitskiy A, 2020, An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv preprint, https://doi.org/10.48550/arXiv.2010.11929

Lv Y, Chen W, Chen B, et al., 2023, RT-DETR: DETRs Beat YOLOs on Real-Time Object Detection. Proc. of the IEEE/CVF Int. Conf. on Computer Vision Workshops (ICCVW), 2420–2430.

Li D, Yang P, Zou Y, 2024, Optimizing Insulator Defect Detection with Improved DETR Models. Mathematics, 12(10): 1507.

Bai X, Ji H, Wang L, et al., 2024, Detection Method for Structural Defects of U-Shaped Hanging Ring for Power Fittings Based on Magnetic Leakage Principle. Engineering Research Express, 6(2): 025339.

Huang X, Jia M, Tai X, et al., 2024, Federated Knowledge Distillation for Enhanced Insulator Defect Detection in Resource Constrained Environments. IET Computer Vision, 18(8): 1072–1086.

Xie Z, Dong C, Zhang K, et al., 2024, Power-DETR: End-to-End Power Line Defect Components Detection Based on Contrastive Denoising and Hybrid Label Assignment. IET Generation, Transmission & Distribution, 18(20): 3264–3277.

Chen W, Li S, Han X, 2025, IDD-DETR: Insulator Defect Detection Model and Low-Carbon Operation and Maintenance Application Based on Bidirectional Cross-Scale Fusion and Dynamic Histogram Attention. Sensors, 25(18): 5848.

Vaswani A, Shazeer N, Parmar N, et al., 2017, Attention is All You Need. Advances in Neural Information Processing Systems, 2017: 30.

Hu J, Shen L, Sun G, 2018, Squeeze-and-Excitation Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 7132–7141.

Woo S, Park J, Lee J, et al., 2018, Cbam: Convolutional Block Attention Module. Proceedings of the European Conference on Computer Vision (ECCV), 3–19.