a performance of machine learning algorithm

J.SHARMILA,A.SUBRAMANI

Published in International Journal of Advanced Research in Computer Science Engineering and Information Technology

ISSN: 2321-3337          Impact Factor:1.521         Volume:3         Issue:3         Year: 11 November,2014         Pages:375-380

International Journal of Advanced Research in Computer Science Engineering and Information Technology

Abstract

Machine learning, a branch of Artificial Intelligence is about the construction and study of systems that can learn from data. It focuses on prediction, based on known properties learned from the training data. Data mining focuses on the discovery of unknown properties on the data. The Machine learning also employs data mining methods as unsupervised learning or as a preprocessing step to improve learner accuracy. The performance of Machine Learning Algorithm is usually evaluated with respect to the ability to reproduce known knowledge, while in Knowledge Discovery and Data Mining (KDD) the key task is the discovery of previously unknown knowledge. Evaluated with respect to known knowledge, an uninformed (unsupervised) method will easily be outperformed by supervised methods, while in a typical KDD task, supervised methods cannot be used due to the unavailability of training data...

Kewords

Machine Learning Algorithm – Data Mining – Learning.

Reference

Amati, G. and Crestani, F. 1999. Probabilistic learning for selective dissemination of information. Information Processing and Management 35, 5, 633–654. Androutsopoulos, I., Koutsias, J., Chandrinos, K. V., and Spyropoulos, C. D. 2000.An experimental comparison of naive Bayesian and keyword-based anti-spam filtering with personal e-mail messages. In Proceedings of SIGIR-00, 23rd ACM International Conference on Research and Development in Information Retrieval (Athens, GR, 2000), pp. 160–167. Apt´e, C., Damerau, F. J., and Weiss, S. M. 1994. Automated learning of decision rules for text categorization. ACM Transactions on Information Systems 12, 3, 233–251. Attardi, G., Di Marco, S., and Salvi, D. 1998. Categorization by context. Journal of Universal Computer Science 4, 9, 719–736. Baker, L. D. and McCallum, A. K. 1998. Distributional clustering of words for text classification. In Proceedings of SIGIR-98, 21st ACM International Conference on Research and Development in Information Retrieval (Melbourne, AU, 1998), pp. 96–103. Belkin, N. J. and Croft, W. B. 1992. Information filtering and information retrieval: two sides of the same coin? Communications of the ACM 35, 12, 29–38. Biebricher, P., Fuhr, N., Knorz, G., Lustig, G., and Schwantner, M. 1988. The automatic indexing system AIR/PHYS. From research to application. In Proceedings of SIGIR-88, 11th ACM International Conference on Research and Development in Information Retrieval (Grenoble, FR, 1988), pp. 333–342. Also reprinted in [Sparck Jones and Willett 1997], pp. 513–517. Borko, H. and Bernick, M. 1963. Automatic document classification. Journal of the Association for Computing Machinery 10, 2, 151–161. Caropreso, M. F., Matwin, S., and Sebastiani, F. 2001. A learner-independent evaluation of the usefulness of statistical phrases for automated text categorization. In A. G. Chin Ed., Text Databases and Document Management: Theory and Practice. Hershey, US: Idea Group Publishing. Forthcoming.