Back close

Enhancing Causal Text Detection Using Uncertainty-Weighted Machine Learning Ensembles

Publication Type : Journal Article

Publisher : MDPI AG

Source : Informatics

Url : https://doi.org/10.3390/informatics13030037

Campus : Coimbatore

School : School of Artificial Intelligence

Year : 2026

Abstract : Causal inference in text data has been a demanding objective in the field of natural language processing, mainly due to the intrinsic ambiguity and context sensitivity inherent in data, inducing uncertainty. Diminishing this uncertainty is essential in identifying reliable causal connections and advancing predictive consistency. In this research, we introduce an uncertainty-aware ensemble architecture that combines multiple text embedding schemes with both linear and nonlinear classifiers to boost causal text detection. Both sparse and neural-level embeddings were employed, and then combined it with an ensemble weighting approach based on two uncertainty estimation techniques, namely entropy-based and KL divergence-based. Unlike conventional ensemble methods with uniform or fixed voting strategies, our approach assigns weights inversely proportional to classifier uncertainty, ensuring that confident models exert greater influence on the final decisions. Our results show that TF-IDF, through its effective word frequency weighting scheme, consistently outperforms other embedding techniques, achieving better performance across both linear and nonlinear classifiers on both datasets (News Corpus and CausalLM–Adjective group). The experimental results show that our uncertainty-aware ensemble approach enhances both calibration and confidence predictions. Entropy-based weighting improves confidence in the case of linear classifiers with accuracy, F1-score, entropy and prediction confidence values of 94.3%, 94.0%, 0.382 and 0.774, respectively, while in the case of nonlinear classifiers the KL divergence-based weighting acquires a better performance with an accuracy of 97.6%, F1-score of 97.2%, KL Mean value of around 0.055 and LogLoss of 0.221.

Cite this Research Publication : Sivachandra K B, Neethu Mohan, Mithun Kumar Kar, Sikha O K, Sachin Kumar S, Enhancing Causal Text Detection Using Uncertainty-Weighted Machine Learning Ensembles, Informatics, MDPI AG, 2026, https://doi.org/10.3390/informatics13030037

Admissions Apply Now