Meta Album ID | OCR.MD_6 |
---|---|
Domain ID | OCR |
Domain Name | Optical Character Recognition |
Set Number | 2 |
Dataset ID | MD_6 |
Dataset Name | OmniPrint-MD-6 |
Short Description | Character images with a specific set of nuisance parameters |
Long Description | OmniPrint-MD-6 dataset consists of 28 120 images (128x128, RGB) from 703 categories. The images are synthesized with OmniPrint, no further processing was done. The OmniPrint synthesis parameters are stated as follows: font size is 192, image size is 128, the strength of random perspective transformation is 0.04, left/right/top/bottom margins are all 20% of the image size, the strength of pre-rasterization elastic transformation is 0.035, random translation is activated both horizontally and vertically, image blending method is Poisson Image Editing, rotation is within -60 and 60 degrees, horizontal shear is within -0.5 and 0.5, both foreground and background are images taken from a personal mobile phone. |
# Classes | 703 |
# Images | 28120 |
Keywords | ocr |
Data Format | images |
Image size | 128x128 |
License (original data release) |
CC BY 4.0 |
License URL (original data release) |
https://creativecommons.org/licenses/by/4.0/ |
License (Meta-Album data release) |
CC BY 4.0 |
License URL (Meta-Album data release) |
https://creativecommons.org/licenses/by/4.0/ |
Source | OmniPrint |
Source URL |
https://github.com/SunHaozhe/OmniPrint |
Original Author | Haozhe Sun |
Original contact | sunhaozhe275940200@gmail.com |
Meta Album author | Haozhe Sun |
Created Date | 25 June 2021 |
Contact Name | Haozhe Sun |
Contact Email | meta-album@chalearn.org |
Contact URL | https://meta-album.github.io/ |
# import openml import openml # download dataset with DATASET_ID. DATASET_ID is OpenML ID dataset = openml.datasets.get_dataset(DATASET_ID) # display dataset info print(dataset.name)
@inproceedings{sun2021omniprint, title={OmniPrint: A Configurable Printed Character Synthesizer}, author={Haozhe Sun and Wei-Wei Tu and Isabelle M Guyon}, booktitle={Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1)}, year={2021}, url={https://openreview.net/forum?id=R07XwJPmgpl} }Download as bib
@inproceedings{meta-album-2022, title={Meta-Album: Multi-domain Meta-Dataset for Few-Shot Image Classification}, author={Ullah, Ihsan and Carrion, Dustin and Escalera, Sergio and Guyon, Isabelle M and Huisman, Mike and Mohr, Felix and van Rijn, Jan N and Sun, Haozhe and Vanschoren, Joaquin and Vu, Phan Anh}, booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track}, url = {https://meta-album.github.io/}, year = {2022} }Download as bib