EXPERIMENTS WITH TEXT EMBEDDING METHODS FOR EXTRACTIVE SUMMARIZATION

Authors

  • Aleksandar Kovačević Fakultet tehničkih nauka Supervisor
  • Aleksa Šaršanski Autor

DOI:

https://doi.org/10.24867/28BE39Sarsanski

Keywords:

Artificial Intelligence, Deep learning, Extractive summarization, NLP, FastText, BERT, GPT

Abstract

In this paper is implemented a solution to the problem of extractive summarization of user-written hotel accommodation reviews. The solution uses  FastText, BERT, and GPT-3.5 deep learning models. The paper provides a detailed description of how each of these models works, as well as their application methods in addressing this problem. The dataset used in this solution consists of publicly available user comments obtained through the web scraping process. The effectiveness of all three mentioned approaches in solving the problem is compared during the evaluation of the results.

References

[1] Rada Mihalcea and Paul Tarau, 2004. TextRank: Bringing Order into Texts https://aclanthology.org/W04-3252.pdf
[2] Abhishek Kumar, 2023. EXTRACTIVE TEXT SUMMARIZATION http://14.139.251.106:8080/jspui/bitstream/repository/20079/1/Abhishek%20Kumar%20Mtech.pdf
[3] Complete Guide to TensorFlow for Deep Learning with Python by Jose Portilla – Udemy course - https://www.udemy.com/course/complete-guide-to-tensorflow-for-deep-learning-with-python/
[4] What is natural language processing (NLP)? - https://www.ibm.com/topics/natural-language-processing

Published

2024-09-06

Issue

Section

Electrotechnical and Computer Engineering