Professional Documents
Culture Documents
37
International Journal of Computer Applications (0975 8887)
Volume 28 No.2, August 2011
additional pre-processing is required to remove / modify HTML retrieving metadata for classification and choice of machine
and other script tags [13]. learning technique for text classification.
38
International Journal of Computer Applications (0975 8887)
Volume 28 No.2, August 2011
39
International Journal of Computer Applications (0975 8887)
Volume 28 No.2, August 2011
[15] Deerwester S., Dumais S. T., Landauer T. K., Furnas G. IEEE Transactions on Knowledge and Data Engineering,
W., and Harshman R.. 1990. Indexing by Latent Semantic Vol. 20, No. 9, pp. 1264 1272.
Analysis. Journal of American Society of Information [22] Rujiang B., and Junhua L.. 2009. A novel conception based
Science, 41(6), pp. 391-407. text classification method. In proceedings of the IEEE
[16] Church K. W., and Hanks P. 1990. Word association international e-conference on Advanced Science and
norms, mutual information and lexicography. Technology, pp. 30 34.
Computational Linguistics, Vol. 16, No. 1, pp. 22-29. [23] Wang Z., Sun X., Zhang D., Li X. 2006. An optimal SVM-
[17] Meena M. J., and Chandran K. R. 2009. Nave bayes text based text classification algorithm. In proceedings of the
classification with positive features selected by statistical 5th IEEE international conference on Machine Learning
method. In proceedings of the IEEE international and Cybernetics, pp. 1378 1381.
conference on Advanced Computing, pp. 28 33. [24] Zhang M., and Zhang D.. 2008. Trained SVMs based rules
[18] Zhang W., Yoshida T., and Tang X.. 2008. TF-IDF, LSI extraction method for text classification. In proceedings of
and Multi-word in information retrieval and text the IEEE international symposium on IT in medicine and
categorization. In proceedings of the IEEE international Education, pp. 16 19.
conference on Systems, Man and Cybernetics, pp. 108 [25] Yuan P., Chen Y., Jin H., and Huang L. 2008. MSVM-kNN
113. : Combining SVM and k-NN for multi-class text
[19] Jones K. S. 2004. IDF term weighting and IR research classification. IEEE international workshop on Semantic
lessons. Journal of Documentation, Vol. 60, No. 5, pp. 521- Computing and Systems, pp. 133 140.
523. [26] Zhang B., Su J., and Xu X. 2006. A class-incremental
[20] Wang Z., He Y., and Jiang M.. 2006. A comparison among learning method for multi-class support vector machines in
three neural networks for text classification. In proceedings text classification. In proceedings of the 5th IEEE
of the IEEE 8th international conference on Signal international conference on Machine Learning and
Processing. Cybernetics, pp. 2581 2585.
[21] Isa D., Lee L. H., Kallimani V. P., and RajKumar R. [27] Quinlan J. R. 1986. Induction of Decision Trees. Machine
2008. Text document pre-processing with the Bayes Learning, pp. 81-106.
formula for classification using the support vector machine. [28] Quinlan J. R. 1993. C4.5: Programs for Machine Learning.
Morgan Kaufmann Publishers.
40