Coffee is a leading Indonesian commodity with a diversity of aromas and flavors influenced by variety and region of origin. However, the process of identifying and classifying coffee types is still often carried out conventionally through sensory testing, which is subjective, time-consuming, and dependent on panelist expertise. This situation encourages the need for a more objective and consistent automated approach based on sensor technology and machine learning. This study aims to compare the performance of several machine learning algorithms, namely Logistic Regression, Support Vector Classifier (SVC), and Random Forest, in classifying Indonesian coffee types using multisensor Electronic Nose and Electronic Tongue data. The data used comes from gas, temperature, and pH sensors with a total of 1,503 samples representing ten coffee classes. The preprocessing stage includes data cleaning using the Interquartile Range (IQR) method to remove outliers and noise reduction using the Moving Average method. The results show that the application of data cleaning and noise reduction significantly improves the performance of all classification models. Among the algorithms tested, Random Forest showed the most stable and superior performance in classifying coffee types. These findings confirm that the combination of appropriate data preprocessing and appropriate algorithm selection plays a crucial role in improving the accuracy of machine learning-based coffee classification systems.
Copyrights © 2026