Claim Missing Document
Check
Articles

Found 5 Documents
Search

Hybrid Cloud Architecture for Efficient and Cost-Effective Large Language Model Deployment Xin, Qi
Journal of Information System and Informatics Vol 7 No 3 (2025): September
Publisher : Universitas Bina Darma

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.51519/journalisi.v7i3.1170

Abstract

Large Language Models (LLMs) have achieved remarkable success across natural language tasks, but their enormous computational requirements pose challenges for practical deployment. This paper proposes a hybrid cloud–edge architecture to deploy LLMs in a cost-effective and efficient manner. The proposed system employs a lightweight on-premise LLM to handle the bulk of user requests, and dynamically offloads complex queries to a powerful cloud-hosted LLM only when necessary. We implement a confidence-based routing mechanism to decide when to invoke the cloud model. Experiments on a question-answering use case demonstrate that our hybrid approach can match the accuracy of a state-of-the-art LLM while reducing cloud API usage by over 60%, resulting in significant cost savings and a ~40% reduction in average latency. We also discuss how the hybrid strategy enhances data privacy by keeping sensitive queries on-premise. These results highlight a promising direction for organizations to leverage advanced LLM capabilities without prohibitive expense or risk, by intelligently combining local and cloud resources.
Uncertainty-Aware Late Fusion for 3D Perception (Confidence Calibration + Fusion Rule Learning) Xin, Qi
Journal of Technology Informatics and Engineering Vol. 4 No. 1 (2025): APRIL | JTIE : Journal of Technology Informatics and Engineering
Publisher : University of Science and Computer Technology

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.51903/jtie.v4i1.485

Abstract

Late fusion remains attractive for multi-sensor 3D perception because it preserves independent sensor pipelines, enables modular upgrades, and supports rigorous ablation experiments. This paper presents an uncertainty-aware late-fusion framework that combines per-modality confidence calibration with learning a fusion rule. We conduct full experimental evaluations on a PandaSet-style LiDAR+camera subset comprising 10 multi-frame sequences and 2,200 synchronized frames, with 49,549 annotated 3D objects across the Car, Pedestrian, and Cyclist classes. The framework calibrates LiDAR and camera confidence using temperature scaling and isotonic regression, estimates uncertainty-conditioned localization variance, and fuses associated candidates using multiple rules (max, mean, product/odds, and Dempster-Shafer) as well as a learned fusion rule (logistic regression trained on association features). On the test split, isotonic calibration reduces LiDAR Expected Calibration Error from 0.260 to 0.006 and Negative Log-Likelihood from 0.410 to 0.110, and it similarly improves camera confidence quality. Although mean Average Precision (mAP) remains similar to a LiDAR-only baseline in this controlled setting, calibrated late fusion provides substantially better decision reliability at fixed confidence thresholds and maintains conservative high-precision behavior under camera dropout. These results support an engineering conclusion: confidence calibration is the highest-leverage upgrade for late fusion in safety-critical stacks, and fusion rule choice can be tuned to downstream risk preferences.
LiDAR–Camera Object-Level Fusion for Multi-Target Tracking Using JPDA and EKF: A Reproducible Empirical Study on a PandaSet-Parameterised Five-Sequence Dataset Xin, Qi
Journal of Technology Informatics and Engineering Vol. 5 No. 1 (2026): APRIL | JTIE : Journal of Technology Informatics and Engineering
Publisher : University of Science and Computer Technology

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.51903/jtie.v5i1.486

Abstract

Multi-target tracking in cluttered scenes is essential for automated driving, where downstream planning requires stable object identities and accurate state estimates. This paper provides a fully reproducible empirical and sensitivity study of a classical object-level LiDAR–camera fusion tracker that combines Joint Probabilistic Data Association (JPDA) with an Extended Kalman Filter (EKF) under a constant-velocity state model. Because the MathWorks PandaSet subset is distributed as a ZIP archive that cannot be ingested into our execution environment, we generate a PandaSet-parameterised five-sequence synthetic dataset with explicitly specified sampling rates, measurement noise, detection probabilities, and Poisson clutter, and report end-to-end results with fixed random seeds. Using sequential fusion (LiDAR JPDA–EKF update followed by a camera bearing update), we obtain a mean MOTA of 0.880 and a mean position RMSE of 0.361 m, compared with LiDAR-only JPDA–EKF MOTA of 0.883 and RMSE of 0.395 m. Fusion, therefore, improves localization accuracy while sometimes reducing MOTA due to additional association ambiguity introduced by camera clutter; this trade-off is discussed in terms of downstream use cases that prioritize state accuracy. Sensitivity sweeps show that probabilistic association degrades more gracefully than hard nearest-neighbor assignment as clutter increases and delineate regimes where camera information is beneficial. A camera-only bearing tracker is included as a diagnostic baseline (not as a competitive approach); as expected, given the observability limits, it is not reliable under the studied clutter conditions. The dataset specification, parameters, and reporting artefacts form a reproducible template for diagnosing JPDA/EKF tracking and object-level fusion.
Self-Supervised Customer Representation Learning for Segmentation and Next-Purchase Prediction on UCI Online Retail Xin, Qi
J-INTECH ( Journal of Information and Technology) Vol 14 No 01 (2026): Journal of Information and Technology
Publisher : LPPM Universitas Bhinneka Nusantara

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.32664/j-intech.v14i01.2229

Abstract

Customer analytics in financial retail, payments, and bank marketing frequently relies on segmentation and propensity prediction, but transactional logs are sparse, high-dimensional, and only weakly labeled. This paper presents a fast and reproducible self-supervised learning pipeline that converts raw e-commerce transactions into customer representations and evaluates them on two downstream tasks: customer segmentation and next-purchase prediction. We conduct full experimental evaluation on the UCI Online Retail dataset (541,909 invoice-line transactions from 2010-12-01 to 2011-12-09). After deterministic cleaning (removing cancellations and non-positive prices/quantities), 397,884 valid line items remain, spanning 4,338 customers, 18,532 invoices, 3,665 products, and 37 countries. For each customer we construct an ordered invoice sequence and define a canonical item per invoice (the item with the largest aggregated quantity). For each invoice transition we build a dual-view customer state vector that concatenates a lifetime purchase count view and a recent-window view (30 days), then learn embeddings via TF-IDF reweighting and truncated SVD. To increase robustness we introduce a denoising ridge projection (DRP) objective: a linear denoising model trained to map corrupted TF-IDF state vectors back to clean SVD embeddings without using labels, which yields denoised customer embeddings for downstream models. Our main contribution is an applied, computationally light integration of TF-IDF+SVD embeddings with a denoising linear projection for reuse across segmentation and next-purchase prediction, rather than a fundamentally new learning paradigm. In next-purchase prediction restricted to the 200 most frequent target items, a multinomial logistic model trained on DualDRP embeddings achieves Hit@20=0.587, outperforming MostPopular (Hit@20=0.327) and Markov (Hit@20=0.291). In segmentation we apply k-means clustering and analyze cluster-level RFM statistics and dominant products, showing that the learned embeddings recover actionable segments such as high-value frequent buyers and low-activity long-tail customers. All results, tables, and figures are generated with fixed random seeds and are reproducible in this environment
Self-Supervised Log Anomaly Detection with LogBERT-Style Transformers: Full Empirical Evaluation on a Reproducible SynHDFS Benchmark Xin, Qi
JEECS (Journal of Electrical Engineering and Computer Sciences) Vol. 11 No. 1 (2026): JEECS (Journal of Electrical Engineering and Computer Sciences) - In press
Publisher : Fakultas Teknik Universitas Bhayangkara

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.54732/jeecs.v11i1.3

Abstract

Log-based anomaly detection is a core problem in AIOps because system logs provide fine-grained evidence of failures, performance regressions, and security incidents. Recent work has shown that self-supervised sequence modeling substantially improves generalization compared with purely frequency-based detectors, especially when labeled anomalies are scarce. This paper presents a LogBERT-style transformer framework for session-level log anomaly detection and reports a complete, reproducible experimental evaluation. Due to download constraints of large archived log datasets in this environment, we construct a faithful fallback benchmark, SynHDFS-6k, which mimics HDFS-style block workflows by composing normal execution patterns and injecting five realistic anomaly types. SynHDFS-6k contains 6000 sessions with a fixed 5.0% anomaly rate and a vocabulary of 20 event templates. We train a two-layer transformer encoder with masked language modeling on normal sessions only and derive an anomaly score using pseudo log-likelihood (PLL) computed by masking each token position once. We compare against unigram and bigram probabilistic models, PCA reconstruction error, one-class SVM, isolation forest, a DeepLog-style GRU next-event predictor, and a supervised logistic regression upper bound. On the SynHDFS-6k test split, the proposed LogBERT-PLL achieves Precision=0.615, Recall=0.533, F1=0.571, ROC-AUC=0.898, and PR-AUC=0.594. We additionally analyze transformer scoring strategies (PLL mean, PLL top-k, PLL max, random masking, and CLS Mahalanobis), report runtime and model capacity trade-offs, and quantify per-anomaly-type detection behavior. The study provides an end-to-end blueprint for transformer-based self-supervised log anomaly detection under a fully specified protocol, and it highlights strengths and limitations that inform deployment on real-world HDFS logs.