1d0073d5cb | ||
---|---|---|
.github | ||
images | ||
LICENSE | ||
README.md | ||
cli.py | ||
epoch.py | ||
main.py | ||
plantnet_300k_env.yml | ||
utils.py |
README.md
PlantNet-300K
This repository contains the code used to produce the benchmark in the paper "Pl@ntNet-300K: a plant image dataset with high label ambiguity and a long-tailed distribution". You can find a link to the paper here. In order to train a model on the PlantNet-300K dataset, you first have to download the dataset here. If you are looking for the hyperparameters used in the paper, you can find them in the supplementary material here.
If you use this work for your research, please cite the paper:
@inproceedings{plantnet-300k,
author = {C. Garcin and A. Joly and P. Bonnet and A. Affouard and \JC Lombardo and M. Chouet and M. Servajean and T. Lorieul and J. Salmon},
booktitle = {NeurIPS Datasets and Benchmarks 2021},
title = {{Pl@ntNet-300K}: a plant image dataset with high label ambiguity and a long-tailed distribution},
year = {2021},
}
Dataset Version // Meta-data files
Make sure you download the latest version of the dataset in Zenodo (version 1.1 as in the link above, not 1.0). The difference lies in the metadata files, the images are the same. If you wish to download ONLY the metadata files (not possible in Zenodo), you will find them here.
Pre-trained models
You can find the pre-trained models here.
Requirements
Only pytorch, torchvision are necessary for the code to run. If you have installed anaconda, you can run the following command:
conda env create -f plantnet_300k_env.yml
Training a model
In order to train a model on the PlantNet-300K dataset, run the following command:
python main.py --lr=0.01 --batch_size=32 --mu=0.0001 --n_epochs=30 --epoch_decay 20 25 --k 1 3 5 10 --model=resnet18 --pretrained --seed=4 --image_size=256 --crop_size=224 --root=path_to_data --save_name_xp=xp1
You must provide in the "root" option the path to the train val and test folders. The "save_name_xp" option is the name of the directory where the weights of the model and the results (metrics) will be stored. You can check out the different options in the file cli.py.