Castelli, Mauro
Unknown Affiliation

Published : 5 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 5 Documents
Search

Leveraging Feature Sets and Machine Learning for Enhanced Energy Load Prediction: A Comparative Analysis Almeida, Fernando Pedro Silva; Castelli, Mauro; Côrte-Real, Nadine
Emerging Science Journal Vol 8, No 6 (2024): December
Publisher : Ital Publication

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.28991/ESJ-2024-08-06-01

Abstract

Accurate cooling consumption forecasts are crucial for optimizing energy management, storage, and overall efficiency in interconnected HVAC systems. Weather conditions, building characteristics, and operational parameters significantly impact prediction accuracy. Since meteorological conditions highly influence cooling demand, leveraging external air data and user metrics offers a promising approach to estimate a building's hourly cooling energy usage. This study addresses the gap in existing research by comprehensively analyzing the performance of various machine learning algorithms, including ensemble learning and deep learning models, to improve prediction accuracy. By leveraging weather conditions, building characteristics, and operational parameters, we aim to predict cooling consumption across multiple systems (Cooling Ceiling, Ventilation, Free Cooling, and Total Cooling). Data from four weather stations, encompassing diverse features relevant to the European Central Bank (ECB) building's cooling consumption in Frankfurt, were employed. Our methodology includes the use of K-Nearest Neighbor, Decision Tree, Support Vector Regression, Linear Regression, Random Forest, Gradient Boosting, XGBoost, Adaboost, Long-Short-Term Memory, and Gated Recurrent Unit. Models. The results consistently demonstrate the superiority of the Random Forest model across different weather stations and feature sets. This model achieved a Mean Squared Error of approximately 0.002-0.003, Mean Absolute Error of around 0.031-0.034, and Root Mean Squared Error of about 0.052-0.069. These findings contribute to improved building cooling load management, promoting insights into optimal energy utilization and sustainable building practices. Doi: 10.28991/ESJ-2024-08-06-01 Full Text: PDF
Zero-Shot Prompting Strategies for Table Question Answering with a Low-Resource Language Jannuzzi, Marcelo; Perezhohin, Yuriy; Peres, Fernando; Castelli, Mauro; Popovič, Aleš
Emerging Science Journal Vol 8, No 5 (2024): October
Publisher : Ital Publication

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.28991/ESJ-2024-08-05-020

Abstract

This work explores the application of zero-shot prompting strategies for table question answering (TQA) in Portuguese, focusing specifically on the Text2SQL task. This task involves translating questions posed in natural language into Structured Query Language (SQL) queries, which can be executed against a database to answer the original question. Given the popularity of relational databases across various domains, advancements in this field can substantially impact the accessibility and democratization of data as simpler and more intuitive interfaces for database interaction are developed. Despite this significant potential, progress in developing Portuguese TQA solutions remains limited. The proposed approach leverages Large Language Models (LLMs)—specifically the GPT-3.5 and GPT-4 models—through zero-shot prompting. The primary objectives are to assess the effectiveness of such LLMs in this task and to identify the most suitable prompt styles. These are evaluated using a Portuguese translation of the popular Spider Text2SQL benchmark. Results reveal that the proposed approach can generate adequate SQL queries to answer Portuguese language questions about various databases, mainly when using GPT-4. The findings suggest that including schema information and database content in the prompts is critical for satisfactory outcomes. Doi: 10.28991/ESJ-2024-08-05-020 Full Text: PDF
Retrieval-Augmented Generation Assistant for Anatomical Pathology Laboratories Pires, Diogo; Perezhohin, Yuriy; Castelli, Mauro
Emerging Science Journal Vol. 9 No. 6 (2025): December
Publisher : Ital Publication

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.28991/ESJ-2025-09-06-013

Abstract

Accurate and efficient access to laboratory protocols is essential in Anatomical Pathology (AP), where up to 70% of medical decisions depend on laboratory diagnoses. However, static documentation such as printed manuals or PDFs is often outdated, fragmented, and difficult to search, creating risks of workflow errors and diagnostic delays. This study proposes and evaluates a Retrieval-Augmented Generation (RAG) assistant tailored to AP laboratories, designed to provide technicians with context-grounded answers to protocol-related queries. We curated a novel corpus of 99 AP protocols from a Portuguese healthcare institution and constructed 323 question-answer pairs for systematic evaluation. Ten experiments were conducted, varying chunking strategies, retrieval methods, and embedding models. Performance was assessed using the RAGAS framework (faithfulness, answer relevance, context recall) alongside top-k retrieval metrics. Results show that recursive chunking and hybrid retrieval delivered the strongest baseline performance. Incorporating a biomedical-specific embedding model (MedEmbed) further improved answer relevance (0.74), faithfulness (0.70), and context recall (0.77), showing the importance of domain-specialized embeddings. Top-k analysis revealed that retrieving a single top-ranked chunk (k=1) maximized efficiency and accuracy, reflecting the modular structure of AP protocols. These findings highlight critical design considerations for deploying RAG systems in healthcare and demonstrate their potential to transform static documentation into dynamic, reliable knowledge assistants, thus improving laboratory workflow efficiency and supporting patient safety.
Deep Learning in Predicting High School Grades: A Quantum Space of Representation Costa-Mendes, Ricardo; Cruz-Jesus, Frederico; Oliveira, Tiago; Castelli, Mauro
Emerging Science Journal Vol. 6 (2022): Special Issue "Current Issues, Trends, and New Ideas in Education"
Publisher : Ital Publication

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.28991/ESJ-2022-SIED-012

Abstract

This paper applies deep learning to the prediction of Portuguese high school grades. A deep multilayer perceptron and a multiple linear regression implementation are undertaken. The objective is to demonstrate the adequacy of deep learning as a quantitative explanatory paradigm when compared with the classical econometrics approach. The results encompass point predictions, prediction intervals, variable gradients, and the impact of an increase in the class size on grades. Deep learning's generalization error is lower in the student grade prediction, and its prediction intervals are more accurate. The deep multilayer perceptron gradient empirical distributions largely align with the regression coefficient estimates, indicating a satisfactory regression fit. Based on gradient discrepancies, a student's mother being an employer does not seem to be a positive factor. A benign paradigm shift concerning the balance between home and career affairs for both genders should be reinforced. The deep multilayer perceptron broadens the spectrum of possibilities, providing a quantum solution hinged on a universal approximator. In the case of an academic achievement-critical factor such as class size, where the literature is neither unanimous on its importance nor its direction, the multilayer perceptron formed three distinct clusters per the individual gradient signals. Doi: 10.28991/ESJ-2022-SIED-012 Full Text: PDF
Mathematics and Mother Tongue Academic Achievement: A Machine Learning Approach Nunes, Catarina; Beatriz-Afonso, Ana; Cruz-Jesus, Frederico; Oliveira, Tiago; Castelli, Mauro
Emerging Science Journal Vol. 6 (2022): Special Issue "Current Issues, Trends, and New Ideas in Education"
Publisher : Ital Publication

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.28991/ESJ-2022-SIED-010

Abstract

Academic achievement is of great interest to education researchers and practitioners. Several academic achievement determinants have been described in the literature, mostly identified by analyzing primary (sample) data with classic statistical methods. Despite their superiority, only recently have machine learning methods started to be applied systematically in this context. However, even when this is the case, the ability to draw conclusions is greatly hampered by the "black-box" effect these methods entail. We contribute to the literature by combining the efficiency of machine learning methods, trained with data from virtually every public upper-secondary student of a European country, with the ability to quantify exactly how much each driver impacts academic achievement on Mathematics and mother tongue, through the use of prototypes. Our results indicate that the most important general academic achievement inhibitor is the previous retainment. Legal guardian's education is a critical driver, especially in Mathematics; whereas gender is especially important for mother tongue, as female students perform better. Implications for research and practice are presented. Doi: 10.28991/ESJ-2022-SIED-010 Full Text: PDF