This study presents a new approach for constructing a medical knowledge graph using Named Entity Recognition (NER) to identify entities such as diseases, drugs, or medical procedures, alongside part-of-speech (POS) tagging and dependency parsing to determine words that function as verbs and roots. These extracted words are then used as relations between entities, forming triplets in the format (entity, relation, entity). While the knowledge graph provides a structured representation of medical information, the evaluation primarily reflects the performance of the underlying NLP pipeline (NER, POS tagging, and dependency parsing) used to generate the triplets. Quantitative evaluation was performed using metrics such as precision, recall, and F1-score to assess the accuracy and completeness of entity and relation extraction. The qualitative evaluation involved medical domain experts to assess the relevance and validity of the relationships derived. The results indicate that fine-tuning a pre-trained model for NER and leveraging a pre-trained model for POS tagging and dependency parsing can effectively generate accurate triplets for constructing a medical knowledge graph. This approach demonstrated strong performance, achieving high evaluation scores in both quantitative and qualitative evaluations.
Copyrights © 2025