SnakeCLEF: Gradient Boosting for the Visual Classification of Fungi Species, 2022

Scripts, figures and working notes for the participation in SnakeCLEF 2022, part of the LifeCLEF labs at the 13th CLEF Conference, 2022.

Implementation Stack: Python, Keras/Tensorflow, XGBoost, Scikit.

Quick Links

The following references will help in reproducing this implementation and to extend the experiments for further analyses.

Cite Us

Link to the Research Paper

If you find our work useful in your research, don't forget to cite us:

@article{palaniappan2022deep,
  url = {https://ceur-ws.org/Vol-3180/paper-173.pdf},
  title={Deep Learning and Gradient Boosting Ensembles for Classification of Snake Species},
  author={Palaniappan, Mirunalini and Desingu, Karthik and Bharathi, Haricharan and Chodisetty, Eeswara Anvesh and Bhaskar, Anirudh},
  keywords={Ensemble Learning, Convolutional Neural Networks, Gradient Boosting Ensemble, Metadata-aided Classification, Image Classification, Transfer Learning},
  journal={Conference and Labs of the Evaluation Forum},
  publisher={Conference and Labs of the Evaluation Forum},
  year={2022},
  ISSN={1613-0073},  
  copyright = {Creative Commons Attribution 4.0 International}
}

Key Highlights

Proposed Prediction Workflow

Each observation in the dataset is made up of numerous fungus photos and its contextual geographic information like nation, exact area where the photograph was taken on four layers, along with specific attributes like substrate and habitat.
Each image in an observation is preprocessed before being fed through the two feature extraction networks to generate two 4096-element-long representation vectors. - These vectors are combined with numeric encoded nation, location at three-level precision, substrate, and habitat metadata for the image to produce a final vector with a size of 8198.
The boosting ensemble classifier is fed all the 8198 features to generate a probability distribution over all potential fungi species classes.

This workflow is depicted below,

Conclusions and Future Scope

The ensembling approach was found to be an effective option for applying to data-intensive and high-complexity image classification tasks that are commonly released at LifeCLEF.
The inclusion of contextual information showed a strong impact on the classification results — the F1-scores for the best models improved from 61.72% and 41.95% to 63.88% and 42.74%.
We further conjecture that training the individual models to convergence, and subsequently applying the boosting ensembler with hyperparameter tuning will culminate in a superior prediction performance, that exhausts the proposed architectures’ and methodology’s potential.
In addition, approaches involving input image resolution variations, usage of alternative pre-trained weights [A. Joly et al.], as well as the inclusion of custom training layers to the frozen base model when transfer learning [M. Zhong et al.] can greatly improve the quality of feature extraction.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
Docs		Docs
Scripts		Scripts
assets		assets
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs

Docs

Scripts

Scripts

assets

assets

.gitignore

.gitignore

README.md

README.md

Repository files navigation

SnakeCLEF: Gradient Boosting for the Visual Classification of Fungi Species, 2022

Quick Links

Cite Us

Key Highlights

Proposed Prediction Workflow

Conclusions and Future Scope

About

Releases

Packages

Languages

karthik-d/SnakeCLEF-2022-using-Network-Ensembles

Folders and files

Latest commit

History

Repository files navigation

SnakeCLEF: Gradient Boosting for the Visual Classification of Fungi Species, 2022

Quick Links

Cite Us

Key Highlights

Proposed Prediction Workflow

Conclusions and Future Scope

About

Topics

Resources

Stars

Watchers

Forks

Languages