The abundance of tourism information in Yogyakarta often overwhelms tourists due to non-standard text data. This research develops a tourism recommendation system using Content-Based Filtering by integrating TF-IDF and Cosine Similarity algorithms, enhanced with a Word Normalizer stage. The research method involves data preprocessing including case folding, filtering, stopword removal, and stemming combined with word normalization to standardize irregular spellings. Text feature representation is calculated using TF-IDF weighting, followed by measuring similarity between destinations through vector-based Cosine Similarity. The query testing of Pantai Parangtritis against Pantai Ngandong yielded the highest similarity score of 0.9397. System performance evaluation showed a Precision@5 of 0.84, Recall@5 of 0.10, and Mean Average Precision (MAP) of 0.81. In conclusion, strengthening the method with a Word Normalizer significantly improves the validity of top-ranked recommendations, enabling tourists to accurately find relevant attractions according to their preferences.