A Study on Text Classification for Webmining Based Spatio Temporal Analysis of the Spread of Tropical Diseases

Wulandini, Fatimah and Nugroho, Anto Satriyo (2009) A Study on Text Classification for Webmining Based Spatio Temporal Analysis of the Spread of Tropical Diseases. Bachelor thesis, Swiss German University.

[img]
Preview
Text
Fatimah Wulandini 1-2105-052 TOC.pdf

Download (114kB) | Preview
[img] Text
Fatimah Wulandini 1-2105-052 1.pdf
Restricted to Registered users only

Download (150kB)
[img] Text
Fatimah Wulandini 1-2105-052 2.pdf
Restricted to Registered users only

Download (432kB)
[img] Text
Fatimah Wulandini 1-2105-052 3.pdf
Restricted to Registered users only

Download (613kB)
[img] Text
Fatimah Wulandini 1-2105-052 4.pdf
Restricted to Registered users only

Download (994kB)
[img] Text
Fatimah Wulandini 1-2105-052 5.pdf
Restricted to Registered users only

Download (101kB)
[img]
Preview
Text
Fatimah Wulandini 1-2105-052 Ref.pdf

Download (103kB) | Preview

Abstract

Tropical diseases such as Dengue Fever, Malaria and Bird Flu have become epidemic and particular problem in Indonesia. As the number of such cases increases, the availability of information regarding these diseases is important to facilitate experts in taking proper actions. Meanwhile, web mining is one of significant technologies applied to extract information from the web. By using web mining, spatio-temporal information of tropical diseases will be collected from the internet. This study aims to create a text classification system which classified the web document using several learning methods including naive Bayes, nearest neighbor, decision tree and support vector machine (SVM). The classification is intended to construct a spatio temporal analysis for documents classified into health. The result shows that naive Bayes and SVM achieve good performance (naïve Bayes: 95% and SVM: 92%). Multinomial distribution of naive Bayes is able to normalize the length of document while SVM performs well in high-dimensional data.

Item Type: Thesis (Bachelor)
Subjects: Q Science > QA Mathematics > QA76 Computer software > > QA76.91 Data mining
Q Science > QA Mathematics > QA76 Computer software > > QA76.93 Computer networks--Security measures
T Technology > T Technology (General) > T58.5 Information technology
Divisions: Faculty of Engineering and Information Technology > Department of Information Technology
Depositing User: Astuti Kusumaningrum
Date Deposited: 04 Nov 2020 02:40
Last Modified: 04 Nov 2020 02:40
URI: http://repository.sgu.ac.id/id/eprint/994

Actions (login required)

View Item View Item