Disaster management is a strategic issue that has been widely studied as a form of mutual responsibility in reducing victims and losses due to disasters. Today, one of the sources of disaster information is spread on Twitter social media. This research explains the implementation of the C4.5 algorithm to classify and map the disaster information that delivers using social media Twitter from the official BNPB_Indonesia account. The disaster data was retrieved and then processed to display in the geographic information system form. The words combination of disaster events, victims, and disaster locations process is carried out using n-gram by divided into unigram, bigram, and trigram to obtain the vocabulary accordance with the database. The C4.5 algorithm in this research was used to classify the disaster information with several categories. The results shows that the C4.5 algorithm can be used to classified the category of disaster and could identified the disaster informasion such as type of disaster, victims, and locations. The result can provide real time information on the distribution of disaster events and their locations using geographical information system. However, for location such as name of provinces only which has many geo-position possibilities (district or sub-district). The determination of the disaster location could be difficult. In addition, to determine the information obtained from post-disaster conditions such as the number of victims, damage, and losses. The comparison of n-gram with predetermined keywords is still constrained by noise of data.
Copyrights © 2022