Publication: Classification of Turkish tweets by document vectors and investigation of the effects of parameter changes on classification success
No Thumbnail Available
Date
2020-06-13
Authors
Authors
Bilgin, Metin
Journal Title
Journal ISSN
Volume Title
Publisher
Yıldız Teknik Üniversitesi
Abstract
Natural language processing is an artificial intelligence field which is gaining in popularity in recent years. To make an emotional deduction from texts related to an issue, or classify documents are of great importance considering the increasing data size in today's world. Understanding and interpreting written texts is a feature that pertains to people. But, it is possible to deduce from texts or classify texts using natural language processing which is a sub-branch of machine learning and artificial intelligence. In this study, both text classification was made on Turkish tweets, and text classification success of method parameter changes was investigated using two different methods of the algorithm mentioned as document vectors in the literature. It was found in the study that as well as higher accuracy values were obtained by the DBoW (Distributed Bag of Words) method than DM (Distributed Memory) method; higher accuracy values were also obtained by DBoW-NS (Negative Sampling) architecture than others.
Description
Keywords
Text classification, Natural language processing, Document vectors, Doc2vec, Sentiment analysis, Deep learning, Engineering