Abstract
Butterflies are one of the most vibrant and diverse species, making them a crucial subject of study in various fields such as biology, conservation, and environmental science. Their classification based on visual features is an essential task for ecological monitoring and biodiversity research, as the diversity and distribution of butterfly species can serve as indicators of environmental health. Manual classification, however, is not only labor-intensive but also prone to errors, particularly when large datasets are involved or when species exhibit subtle visual differences. With advancements in artificial intelligence, particularly deep learning, automated image classification systems have gained significant traction. Convolutional Neural Networks (CNNs) have emerged as powerful tools for image recognition, excelling in tasks requiring the analysis of complex patterns, textures, and colors. In this research, we explore the task of butterfly image classification using multiple machine learning techniques, including CNNs, Random Forest, K-Nearest Neighbors (KNN), and Support Vector Machine (SVM). The dataset comprises images of various butterfly species, and the goal is to accurately classify them based on their visual features. We employ several data augmentation techniques to make the model more robust to variations in image quality and orientation.
Furthermore, the models are evaluated based on accuracy scores, confusion matrices, and other performance metrics such as precision, recall, and F1-score. The study highlights the superior performance of CNNs in complex image classification tasks and discusses potential improvements, such as incorporating transfer learning, to enhance model accuracy and generalizability in future research.