Acne is a common inflammatory skin condition that can affect an individual’s psychological well-being and overall quality of life. The inability to independently recognize specific types of acne often leads to the use of inappropriate skincare products. This situation highlights the need for an image-based classification system that can provide accurate visual identification. The self-supervised learning method Distillation with NO Labels, version 2 (DINOv2), is employed as a feature extractor to classify four types of acne—Acne fulminans, Acne nodules, Papules, and Pustules—using the “skin-90” dataset. The fine-tuning process is conducted through a Parameter-Efficient Fine-Tuning (PEFT) approach using Low-Rank Adaptation (LoRA) to adjust the model’s visual representations to the acne domain without updating all parameters in full, followed by integration with a classification head. The results show that the model achieves an accuracy of 90.70%, with precision, recall, and F1-score values of 90.64%, 90.68%, and 90.57%, respectively. The findings suggest that the proposed architectural design and training configuration are suitable for capturing relevant visual patterns of acne, while further validation is required to assess robustness across more diverse data distributions.