SSOD-MViT: A novel model for recognizing alfalfa seed pod maturity based on semi-supervised learning
Date:2025-05-07 Page Views: 10

Fuyang Tian , Yinuo Zhang ,Shakeel Ahmed Soomro ,Qiang Wang, Shuaiyang Zhang, Ji Zhang, Qinglu Yang, Yunpeng Yan, Zhenwei Yu, Zhanhua Song

Abstract

The current study was conducted to address the challenges of recognizing alfalfa seed pod maturity in complex field environments, and the significant impact of the quantity of labeled samples on the performance of object detection algorithms. A method for identifying the maturity of alfalfa seed pod clusters was proposed using an unmanned aerial vehicle (UAV) and a semi-supervised deep learning model SSOD-MViT (Semi-Supervised Object Detection based on the MViTNet). To enhance the models capability to extract key feature information, an improved lightweight general vision transformer MobileViT (Mobile Vision Transformer) was firstly employed as the backbone. The deep integration of ScConv (Spatial and Channel Reconstruction Convolution) was additionally employed to reduce redundant information within the channels, thereby decreasing the computational load of the model. Secondly, a small object detection layer was incorporated into the Neck, and the Efficient Multi-Scale Attention Module (EMA) was added to the C2f structure. The SAHI (Slicing Aided Hyper Inference) algorithm was integrated during the inference process, which improves the detection accuracy of small-sized alfalfa seed pod clusters and enhances the models resistance to interference. Finally, the concept of Consistency Regularization was incorporated into the model to reduce its dependency on sample data. The experimental results revealed that SSOD-MViT achieved a mAP@0.5 of 92.23 %. When compared to the YOLOv8 object detection model, the mAP@0.5 had improved by 12.31 %. When compared to the Faster R-CNN object detection model, the average detection time reduced by 175.81 ms. The proposed model MViTNet (MobileViT Network) had a storage size of 5.3 MB, and an average detection time of 82.34 ms, providing favorable conditions for subsequent deployment on embedded devices. This research effectively improved the detection performance of existing models in detecting alfalfa seed pod maturity in complex field environments. This advancement also aids in determining the optimal harvesting period for alfalfa seeds, thereby providing technical support to enhance productivity and reduce production costs in the alfalfa seed production industry.

Paper Linkage:https://doi.org/10.1016/j.compag.2025.110439


Copyright@SDAU
Top