Remote Sensing Image Scene Classification by Multiple Granularity Semantic Learning

oleh: Weilong Guo, Shengyang Li, Jian Yang, Zhuang Zhou, Yunfei Liu, Junjie Lu, Longxuan Kou, Manqi Zhao

Format: Article
Diterbitkan: IEEE 2022-01-01

Deskripsi

Remote sensing image scene classification faces challenges, such as the difference in semantic granularity of different scene categories and the imbalance of the number of samples, which cause the wrong features learning for deep convolutional networks (DCNs). This article proposes a multiple granularity semantic learning network (MGSN), including multiple granularity semantic learning (MGSL) and nonuniform sampling augmentation (NUA) modules. Specifically, the MGSL module makes full use of different granularities of semantic information of scenes, guiding the network to learn global and local features simultaneously. And, the relationship between semantic features of different granularity has been explored, based on which the learning of coarse-grained features helps to improve the learning of fine-grained semantic features. It shows that learning fine-grain semantics can inhibit learning coarse-grain semantic features. The NUA module combines sampling and sample augmentation to balance the sample distribution, which can avoid overfitting caused by oversampling. The proposed MGSN achieved state-of-the-art classification accuracy on two large-scale remote sensing image scene classification datasets, Million-AID and NWPU-RESISC45. Under 10<inline-formula><tex-math notation="LaTeX">$\%$</tex-math></inline-formula> and 20<inline-formula><tex-math notation="LaTeX">$\%$</tex-math></inline-formula> training samples of the NWPU-RESISC45 dataset, MGSN achieves 91.92<inline-formula><tex-math notation="LaTeX">$\%$</tex-math></inline-formula> and 94.33<inline-formula><tex-math notation="LaTeX">$\%$</tex-math></inline-formula> top-1 accuracy, respectively. In experiments conducted on the Million-AID dataset, the proposed MGSN performed best among 18 DCNs. In comparison to the baseline, FixEfficientNet, MGSN improved the accuracy of top-1 and top-5 by 10.63<inline-formula><tex-math notation="LaTeX">$\%$</tex-math></inline-formula> and 5.47<inline-formula><tex-math notation="LaTeX">$\%$</tex-math></inline-formula>, respectively, with low complexity costs.