Deep learning models demonstrate a high degree of accuracy in image classification. The task of distinguishing between various sources of red fox images—such as authentic photographs, game-captured images, hand-drawn illustrations, and AI-generated images—raises important considerations regarding realism, texture, and style. This study conducts an evaluation of three deep learning architectures: Inception V3, VGG-16, and VGG-19, utilizing images of red foxes. The research employs Silhouette Graphs, Multidimensional Scaling (MDS), and t-Distributed Stochastic Neighbor Embedding (t-SNE) to assess clustering and classification efficiency. Support Vector Machines (SVM) and Logistic Regression are utilized to compute the Area Under the Curve (AUC), Classification Accuracy (CA), and Mean Squared Error (MSE). The MDS plots and t-SNE data clearly demonstrate the capability of the three deep learning models to distinguish between the image categories. For game-captured images, VGG-16 and VGG-19 demonstrate quite outstanding performance with silhouette scores of 0.398 and 0.315, respectively. This study explores the enhancement of classification accuracy in logistic regression and support vector machines (SVM) through the refinement of decision boundaries for overlapping categories. Utilizing Inception V3, an artificial intelligence-generated image silhouette score of 0.244 was achieved, demonstrating proficiency in image classification. The research highlights the challenges posed by diverse datasets and the efficacy of deep learning models in the classification of red fox images. The findings suggest that integrating deep learning with machine learning classifiers, such as logistic regression and SVM, may improve classification accuracy.
Copyrights © 2025