Garuda - Garba Rujukan Digital

Article Per Year (5 Year)

p-Index From 2020 - 2025

0.23

P-Index

This Author published in this journals

All Journal International Journal of Advances in Applied Sciences

Ephzibah Evan Prince

Vellore Institute of Technology (VIT)

Author-ID : 6800062

Earth & Planetary Sciences Environmental Science Materials Science & Nanotechnology Mathematics Physics

Published : 1 Documents Claim Missing Document

Claim Missing Document

Articles

Improving the BERT model for long text sequences in question answering domain Vijayan Ramaraj; Mareeswari Venkatachala Appa Swamy; Ephzibah Evan Prince; Chandhan Kumar
International Journal of Advances in Applied Sciences Vol 13, No 1: March 2024
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijaas.v13.i1.pp106-115

The text-based question-answering (QA) system aims to answer natural language questions by querying the external knowledge base. It can be applied to real-world systems like medical documents, research papers, and crime-related documents. Using this system, users don't have to go through the documents manually the system will understand the knowledge base and find the answer based on the text and question given to the system. Earlier state-of-the-art natural language processing (NLP) was recurrent neural network (RNN) and long short-term memory (LSTM). As a result, these models are hard to parallelize and poor at retaining contextual relationships across long text inputs. Today, bidirectional encoder representations from transformers (BERT) are the contemporary algorithm for NLP. BERT is not capable of handling long text sequences; it can handle 512 tokens at a time which makes it difficult for long context. Smooth inverse frequency (SIF) and the BERT model will be incorporated together to solve this challenge. BERT trained on the Stanford question answering dataset (SQuAD) and SIF model demonstrates robustness and effectiveness on long text sequences from different domains. Experimental results suggest that the proposed approach is a promising solution for QA on long text sequences.

Co-Authors Chandhan Kumar Mareeswari Venkatachala Appa Swamy Vijayan Ramaraj

Title

Found 1 Documents
Search

Abstract

Title Search

Found 1 Documents Search

Abstract

Title

Found 1 Documents
Search