Journal Collabits
Vol 3, No 1 (2026)

COMPARATIVE ANALYSIS OF LINEAR REGRESSION AND RANDOM FOREST FOR USED CAR PRICE PREDICTION

Syamsudi, Muhammad Faris Adjil (Unknown)
Daffa, Bimo Arya (Unknown)
Jarodi, Wisnu (Unknown)



Article Info

Publish Date
22 Feb 2026

Abstract

Manual estimation is often subjective and prone to human bias because the used car market has a complex pricing structure with non-linear depreciation. Objective: This study conducted a comparative analysis between Linear Regression and Random Forest algorithms to develop a more objective pricing model. Methods: The Kaggle dataset contains 5,000 entries indicating features such as manufacturer, model, engine size, and mileage for this study. The methodology included data cleaning, feature engineering, and outlier removal using the IQR method. For training and testing, the data was split 80:20. Results: "Year of Manufacture" was identified as the feature that most significantly influences price, and the evaluation results showed a significant difference in performance. Linear Regression achieved 82.33% accuracy, while Random Forest achieved 99.60% accuracy. Conclusion: Random Forest captures non-linear patterns and complex relationships in used car pricing better than Linear Regression, although it remains quite reliable for general trends.

Copyrights © 2026






Journal Info

Abbrev

collabits

Publisher

Subject

Computer Science & IT Engineering

Description

Journal Collabits adalah jurnal yang membahas strategi keamanan cyber untuk meningkatkan kinerja dan keandalan dalam implementasi teknologi kecerdasan buatan (AI), kecerdasan bisnis (BI), dan sains data, yang di kelola oleh Fakultas Ilmu Komputer (FASILKOM) terdiri dari dua prodi yaitu Teknik ...