skip to main content
Tipo de recurso Mostra resultados com: Mostra resultados com: Índice

Efficient Generation of Training Libraries for Image Classification Models from Photos of Herbarium Specimens

Schmidt-Lebuhn, Alexander N. ; Knerr, Nunzio

International journal of plant sciences, 2023-06, Vol.184 (5), p.387-391 [Periódico revisado por pares]

Chicago: The University of Chicago Press

Texto completo disponível

Citações Citado por
  • Título:
    Efficient Generation of Training Libraries for Image Classification Models from Photos of Herbarium Specimens
  • Autor: Schmidt-Lebuhn, Alexander N. ; Knerr, Nunzio
  • Assuntos: Annotations ; Biodiversity ; Classification ; Computer vision ; End users ; Image annotation ; Image classification ; Libraries ; Open source software ; Training
  • É parte de: International journal of plant sciences, 2023-06, Vol.184 (5), p.387-391
  • Descrição: Premise of research. Computer vision has the potential to become a transformative identification tool in biodiversity research and collections management, allowing high-throughput identification and removing the need for nonexpert end users to understand technical terminology. A major bottleneck for taxonomists is the generation of sufficient numbers of training images. Contemporary large-scale imaging projects of herbaria provide an increasing number of specimen photos, but whole-sheet images are not directly suitable for training image classification models targeted at individual taxonomically informative characters. Methodology. Here, we illustrate a time- and labor-efficient approach for generating training libraries for image classification from photos of herbarium sheets. It involves the annotation of specimen images with bounding boxes using open-source software and automated cropping of annotations with a custom script to produce the training library. We demonstrate the approach on the flower heads of a genus of Asteraceae comprising eight taxa, six species and two nontypus varieties. Pivotal results. After generating 816 training images from 33 specimen photos with a time investment of only ∼90 min, we trained an image classification model that achieved 98.2% precision and recall. Conclusions. The demonstrated approach allows taxonomists to use digitized herbarium specimens to produce training libraries for image classification models within hours. We expect that computer vision will increasingly become a part of taxonomic practice.
  • Editor: Chicago: The University of Chicago Press
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.