Exaggerated news titles are used to deceive news readers and spread misleading information. This paper presents a new natural language processing (NLP) technique that identifies exaggerated news titles. The technique uses Jaccard similarity as a pre-processing step to filter out unrelated articles. The technique then applies text summarisation on the content of the news article to create a new title. Lastly, the technique applies cosine similarity to compare similar articles between the article title and the newly generated titles. The output is the classification of the news articles using the output of cosine similarity. This technique performed well in major South African news articles.
Reference:
Sefara, T.J. & Rangata, M.R. 2023. A natural language processing technique to identify exaggerated news titles. http://hdl.handle.net/10204/13147 .
Sefara, T. J., & Rangata, M. R. (2023). A natural language processing technique to identify exaggerated news titles. http://hdl.handle.net/10204/13147
Sefara, Tshephisho J, and Mapitsi R Rangata. "A natural language processing technique to identify exaggerated news titles." Lecture Notes in Networks and Systems, 757 (paper presented at ICICCT 2023: Inventive Communication and Computational Technologies) (2023): http://hdl.handle.net/10204/13147
Sefara TJ, Rangata MR, A natural language processing technique to identify exaggerated news titles; 2023. http://hdl.handle.net/10204/13147 .