COMPARATIV ANALYSIS OF TEXTUAL MODELS BERT, BART AND XLNET FOR PREDICTING THE POPULARITY OF YOUTUBE VIDEOS
DOI:
https://doi.org/10.24867/30BE01JovicKeywords:
BERT, BART, XLNet, Fine-tuning, YoutubeAbstract
This paper talks about a comparative analysis of three text models: BERT, BART, and XLNet. The data for training and validation were taken from the website kaggle.com. There are two datasets: the initial one and an expanded one, created in an attempt to improve the recall metric. After downloading, preprocessing was done, including tokenization and stemming. The dataset was split in a ratio of 80:20 for training and validation. Different values for learning rate and batch size were tested, and the best result was achieved using the BERT large model with a learning rate of 1e-5 and a batch size of 8. The other two models produced somewhat worse results than BERT. All experiments and results will be presented below.
References
[1] Kumar, Ashok; Trueman, Tina Esther; Cambria, Erik. Fake news detection using XLNet fine-tuning model. In: 2021 International Conference on Computational Intelligence and Computing Applications (ICCICA). IEEE, 2021. p. 1-4.
[2] W. Y. Wang, „Liar, liar, pants on fire“ : A new benchmark dataset for fake news detection, in Proceedings of 55th Annual Meeting of Association for Computational Linguistics 2017, pp 2931-2937
[3] Spristav, Gaurav; Kant, Shri; Spristava, Durgesh. Design of an AI-Driven Feedback and Decision Analysis in Online Learning with Google BERT. International Journal of Intelligent Systems and Applications in Engineering, 2024, 12.10s: 629–643-629–643
[4] Chae, Youngjin; Davidson, Thomas. Large language models for text classification: From zero-shot learning to fine-tuning. Open Science Foundation, 2023.
[2] W. Y. Wang, „Liar, liar, pants on fire“ : A new benchmark dataset for fake news detection, in Proceedings of 55th Annual Meeting of Association for Computational Linguistics 2017, pp 2931-2937
[3] Spristav, Gaurav; Kant, Shri; Spristava, Durgesh. Design of an AI-Driven Feedback and Decision Analysis in Online Learning with Google BERT. International Journal of Intelligent Systems and Applications in Engineering, 2024, 12.10s: 629–643-629–643
[4] Chae, Youngjin; Davidson, Thomas. Large language models for text classification: From zero-shot learning to fine-tuning. Open Science Foundation, 2023.
Downloads
Published
2025-03-03
Issue
Section
Electrotechnical and Computer Engineering