Insects Dataset

Dataset details

Last updated: 15 Dec 2022
Meta Album ID SM_AM.INS
Domain ID SM_AM
Domain Name Small Aninamls
Set Number 2
Dataset ID INS
Dataset Name Insects
Short Description Insects dataset from Science Project SPIPOLL
Long Description The original Insects dataset is created by the National Museum of Natural History, Paris (https://www.mnhn.fr/fr). It has more than 290 000 images in different sizes and orientations. The dataset has hierarchical classes which are listed from top to bottom as Order, Super-Family, Family, and Texa. Each image contains an insect in its natural environment or habitat, i.e, either on a flower or near to vegetation. The images are collected by the researchers and hundreds of volunteers from SPIPOLL Science project(https://www.spipoll.org/). The images are uploaded to a centralized server either by using the SPIPOLL website, Android application or IOS application. The preprocessed insect dataset is prepared from the original Insects dataset by carefully preprocessing the images, i.e., cropping the images from either side to make squared images. These cropped images are then resized into 128x128 using Open-CV with an anti-aliasing filter.
# Classes 117
# Images 170506
Keywords insects, ecology
Data Format images
Image size 128x128
License
(original data release)
CC BY-NC 2.0
License URL
(original data release)
https://www.spipoll.org/mentions-legales
License
(Meta-Album data release)
CC BY-NC 2.0
License URL
(Meta-Album data release)
https://creativecommons.org/licenses/by-nc/2.0/
Source SPIPOLL; National Museum of Natural History, Paris
Source URL https://www.spipoll.org/
Original Author Gregoire Lois, Colin Fontaine, Jean-Francois Julien
Original contact contact@spipoll.org
Meta Album author Ihsan Ullah
Created Date 01 March 2022
Contact Name Ihsan Ullah
Contact Email meta-album@chalearn.org
Contact URL https://meta-album.github.io/

Download Meta-data files

Download Dataset from OpenML

Dataset Version OpenML ID
Micro 44276 Download
Mini 44306 Download
Extended 44340 Download

Code to download dataset using OpenML API

      # import openml
      import openml
  
      # download dataset with DATASET_ID. DATASET_ID is OpenML ID
      dataset = openml.datasets.get_dataset(DATASET_ID)
  
      # display dataset info
      print(dataset.name)
              

Sample Images

Cite this dataset

@article{insects, 
    title={Data quality and participant engagement in citizen science: comparing two approaches for monitoring pollinators in France and South Korea}, 
    author={Serret, Hortense and Deguines, Nicolas and Jang, Yikweon and Lois, Gregoire and Julliard, Romain}, 
    journal={Citizen Science: Theory and Practice}, 
    volume={4}, 
    number={1}, 
    pages={22}, 
    year={2019} 
}
              
Download as bib

Cite Meta-Album

  @inproceedings{meta-album-2022,
    title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},
    author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},
    booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
    url = {https://meta-album.github.io/},
    year = {2022}
  }
              
Download as bib