Show simple item record

dc.contributor.authorBayrak Ş.H.
dc.contributor.authorTakçi H.
dc.contributor.authorEminli M.
dc.date.accessioned2019-07-27T12:10:23Z
dc.date.accessioned2019-07-28T09:32:40Z
dc.date.available2019-07-27T12:10:23Z
dc.date.available2019-07-28T09:32:40Z
dc.date.issued2013
dc.identifier.issn1303-0914
dc.identifier.urihttps://hdl.handle.net/20.500.12418/5598
dc.descriptionIstanbul Universityen_US
dc.description.abstractThe rising opportunities of communication provided us with many documents in many different languages. Language identification has a key role for these documents to be understandable and to study natural language identification procedures. The increasing number of the documents and international communication requirements make new works on language identification obligatory. Until today, there have been a great number of studies on solving language identification problem about document based language identification. In these studies, characters, words and n-gram sequences have been used with machine learning techniques. In this study, sequence of n-gram frequencies will be used and using of the five different classification algorithms' accuracy performances will be analyzed via different sizes of documents belonging to 15 different languages. N-gram based feature method will be used to extract feature vector belonging to languages. The most appropriate method for the problem of language identification will be identified by comparing the performances of the Support Vector Machines, Multilayer Perceptron, Centroid Classifier, k-Means and Fuzzy C Means methods. During the experiments, trainining and testing data will be selected from ECI multilingual corpus.en_US
dc.language.isoengen_US
dc.rightsinfo:eu-repo/semantics/closedAccessen_US
dc.subjectDocument based language identificationen_US
dc.subjectECI corpusen_US
dc.subjectMachine learning algorithmsen_US
dc.subjectN-gram feature extraction methoden_US
dc.titleLanguage identification based on n-gram feature extraction method by using classifiersen_US
dc.typearticleen_US
dc.relation.journalIstanbul University - Journal of Electrical and Electronics Engineeringen_US
dc.contributor.departmentBayrak, Ş.H., Halic University, Department of Computer Engineering, Istanbul, Turkey -- Takçi, H., Cumhuriyet University, Department of Computer Engineering, Sivas, Turkey -- Eminli, M., Halic University, Department of Computer Engineering, Istanbul, Turkeyen_US
dc.identifier.volume13en_US
dc.identifier.issue2en_US
dc.identifier.endpage1639en_US
dc.identifier.startpage1629en_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record