Plankton Dataset

Dataset details

Last updated: 15 Dec 2022
Meta Album ID SM_AM.PLK
Domain ID SM_AM
Domain Name Small Animals
Set Number 0
Dataset ID PLK
Dataset Name Plankton
Short Description Plankton dataset from WHOI
Long Description The Plankton dataset is created by researchers at the Woods Hole Oceanographic Institution (https://www.whoi.edu/). Imaging FlowCytobot (IFCB) was used for the data collection. The Complete process and mechanism are described in the paper [31]. Each image in the dataset contains one or multiple planktons. The images are captured in a controlled environment and have different orientations based on the flow of the fluid in which the images are captured and the size and shape of the planktons. The preprocessed plankton dataset is prepared from the original WHOI Plankton dataset. The preprocessing of the images is done by creating a background squared image by either duplicating the top and bottom-most 3 rows or the left and right most 3 columns based on the orientation of the original image to match the width or height of the image respectively. A Gaussian kernal of size 29x29 is applied to the background image to blur the image. Finally, the original plankton image is pasted on the background image at the center of the image. The squared background image with the original plankton image on top of it as one image is then resized into 128x128 with anti-aliasing.
# Classes 102
# Images 473273
Keywords plankton, ecology
Data Format images
Image size 128x128
License
(original data release)
MIT License
License URL
(original data release)
https://github.com/hsosik/WHOI-Plankton/blob/master/LICENSE
License
(Meta-Album data release)
MIT License
License URL
(Meta-Album data release)
https://github.com/hsosik/WHOI-Plankton/blob/master/LICENSE
Source Woods Hole Oceanographic Institution
Source URL https://github.com/hsosik/WHOI-Plankton
Original Author Heidi M. Sosik, Emily E. Peacock, Emily F. Brownlee, Eric Orenstein
Original contact hsosik@whoi.edu
Meta Album author Ihsan Ullah
Created Date 01 March 2022
Contact Name Ihsan Ullah
Contact Email meta-album@chalearn.org
Contact URL https://meta-album.github.io/

Download Meta-data files

Download Dataset from OpenML

Dataset Version OpenML ID
Micro 44238 Download
Mini 44282 Download
Extended 44317 Download

Code to download dataset using OpenML API

      # import openml
      import openml
  
      # download dataset with DATASET_ID. DATASET_ID is OpenML ID
      dataset = openml.datasets.get_dataset(DATASET_ID)
  
      # display dataset info
      print(dataset.name)
              

Sample Images

Cite this dataset

@misc{whoiplankton,
      title={Annotated Plankton Images - Data Set for Developing and Evaluating Classification Methods.}, 
      author={Heidi M. Sosik, Emily E. Peacock, Emily F. Brownlee},
      year={2015},
      DOI = {10.1575/1912/7341},
      url={https://hdl.handle.net/10.1575/1912/7341}
}
              
Download as bib

Cite Meta-Album

  @inproceedings{meta-album-2022,
    title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification},
    author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh},
    booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
    url = {https://meta-album.github.io/},
    year = {2022}
  }
              
Download as bib