Large Language Models (LLMs) have experienced rapid development and have been established as the dominant paradigm in modern Natural Language Processing (NLP), with high performance demonstrated across various language understanding and generation tasks. Increasing architectural complexity has led to the need for a structured conceptual framework to explain how architectural design, training paradigms, and inference mechanisms are collectively associated with model behavior. A conceptual and analytical review of LLMs is presented in this article through an examination of the relationship between Transformer-based architectures, multi-stage training processes, and the resulting capabilities and limitations. Encoder-only, decoder-only, and encoder–decoder architectural variants are examined in relation to structural characteristics and functional implications. The roles of pretraining, supervised fine-tuning, and instruction tuning are analyzed to clarify how output characteristics are shaped during model development. This study emphasizes how architectural and training strategies causally influence generative capabilities and inherent limitations. Fundamental issues, including hallucination, bias, data dependency, computational cost, and evaluation challenges, are critically examined as consequences of the probabilistic modeling paradigm adopted in LLMs. This review contributes a structured analytical perspective for evaluating LLMs design choices and their operational consequences, supporting more informed development and deployment practices.
Copyrights © 2026