skip to main content
Primo Search
Search in: Busca Geral
Tipo de recurso Mostra resultados com: Mostra resultados com: Índice

Pre-Trained AlexNet Architecture with Pyramid Pooling and Supervision for High Spatial Resolution Remote Sensing Image Scene Classification

Han, Xiaobing ; Zhong, Yanfei ; Cao, Liqin ; Zhang, Liangpei

Remote sensing (Basel, Switzerland), 2017-08, Vol.9 (8), p.848 [Periódico revisado por pares]

Basel: MDPI AG

Texto completo disponível

Citações Citado por
  • Título:
    Pre-Trained AlexNet Architecture with Pyramid Pooling and Supervision for High Spatial Resolution Remote Sensing Image Scene Classification
  • Autor: Han, Xiaobing ; Zhong, Yanfei ; Cao, Liqin ; Zhang, Liangpei
  • Assuntos: Architectural engineering ; Artificial neural networks ; Big Data ; Bridges ; Classification ; convolutional neural network ; Datasets ; Detection ; high spatial resolution remote sensing imagery ; Image classification ; International conferences ; Neural networks ; pre-trained AlexNet ; Remote sensing ; scene classification ; Semantics ; side supervision ; spatial pyramid pooling ; Spatial resolution ; Trends
  • É parte de: Remote sensing (Basel, Switzerland), 2017-08, Vol.9 (8), p.848
  • Descrição: The rapid development of high spatial resolution (HSR) remote sensing imagery techniques not only provide a considerable amount of datasets for scene classification tasks but also request an appropriate scene classification choice when facing with finite labeled samples. AlexNet, as a relatively simple convolutional neural network (CNN) architecture, has obtained great success in scene classification tasks and has been proven to be an excellent foundational hierarchical and automatic scene classification technique. However, current HSR remote sensing imagery scene classification datasets always have the characteristics of small quantities and simple categories, where the limited annotated labeling samples easily cause non-convergence. For HSR remote sensing imagery, multi-scale information of the same scenes can represent the scene semantics to a certain extent but lacks an efficient fusion expression manner. Meanwhile, the current pre-trained AlexNet architecture lacks a kind of appropriate supervision for enhancing the performance of this model, which easily causes overfitting. In this paper, an improved pre-trained AlexNet architecture named pre-trained AlexNet-SPP-SS has been proposed, which incorporates the scale pooling—spatial pyramid pooling (SPP) and side supervision (SS) to improve the above two situations. Extensive experimental results conducted on the UC Merced dataset and the Google Image dataset of SIRI-WHU have demonstrated that the proposed pre-trained AlexNet-SPP-SS model is superior to the original AlexNet architecture as well as the traditional scene classification methods.
  • Editor: Basel: MDPI AG
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.