For the resource constraints and real-time requirements of edge computing scenarios, this paper systematically studies the optimization and implementation method of a lightweight neural network architecture. From the constraints of edge device computing power, power consumption constraints and the diversity of deployment environment, this paper discusses the architecture design strategy based on convolution decomposition, efficient embedding of attention mechanism and width depth collaborative optimization, and then proposes a comprehensive optimization scheme of training perception quantization, pruning and neural architecture search combined optimization, reasoning engine collaborative adaptation and end-to-end dynamic optimization. Through the deep integration of algorithm innovation and hardware characteristics, a complete technical framework from model design to edge deployment is constructed, aiming to achieve a good balance between accuracy and efficiency under resource constraints, and provide theoretical support and practical guidance for the model selection and deployment optimization of edge intelligent system.
Cai T, 2025, Design of a Lightweight Intrusion Detection Algorithm for Smart Factories Oriented to Industrial Edge Computing. Cybersecurity and Informatization, 2025(6): 127–129.
Lin C, Yan W, Ji G, et al., 2024, A Survey of Lightweight Methods for Deep Neural Network Parameters. Journal of China Academy of Electronics and Information Technology, 19(4): 350–363+379.
He W, Liu Z, Liu H, et al., 2024, Electromagnetic Field Simulation Method for Transformers Based on a Lightweight Hybrid-Gate Neural Network. Power System Technology, 48(5): 2143–2151.
Lu M, 2021, Research on Lightweight Models under Edge Computing. Modern Computer, 27(25): 31–35.
Ma F, Wang B, Dong X, et al., 2020, Power Vision Edge Intelligence: Deep Visual Acceleration Technology Driven by Edge Computing in Power Systems. Power System Technology, 44(6): 2020–2029.
Liao F, Liu B, Huang J, et al., 2024, Research on Cloud-Edge Collaborative Application Mode of Intelligent Distribution Station Rooms Based on Lightweight Models. Automation & Instrumentation, 2024(3): 210–215.
Xu X, Zhou C, Hu Z, et al., 2024, A Survey on Lightweight Deep Neural Network Models for Edge Intelligence. Computer Science, 51(7): 257–271.
Pang S, 2025, Design of Lightweight Optimization Algorithms for System Integration in Edge Computing Scenarios. Product Reliability Report, 2025(7): 213–214.
Zhang K, Lu Z, Nie T, et al., 2023, Analysis of Intelligent Edge Computing Software Technologies for Unmanned Equipment. Acta Armamentarii, 44(9): 2611–2621.
Ling X, Jiang J, Bao J, 2025, An Optimization Method for Quantization-Aware Training inEdge Environments. Journal of China Academy of Electronics and Information Technology, 20(5): 465–474.