measuring semantic similarity by internet based knowledge

K.Anandhi,M.Karthick,K.Deepak kumar

Published in International Journal of Advanced Research in Computer Science Engineering and Information Technology

ISSN: 2321-3337          Impact Factor:1.521         Volume:4         Issue:3         Year: 30 March,2016         Pages:545-550

International Journal of Advanced Research in Computer Science Engineering and Information Technology

Abstract

Measuring the semantic similarity between words is an important component in various tasks on the web are relation extraction, community mining, document clustering, and automatic metadata extraction.The usefulness of semantic similarity measures in these applications, accurately measuring semantic similarity between two words remains a challenging task. The project proposes an empirical method to estimate semantic similarity using page counts and text snippets retrieved from a web search engine for two words. Specifically, it defines various word co-occurrence measures using page counts and integrates those with lexical patterns extracted from text snippets. A multidimensional ontology mining method, Specificity and Exhaustivity, is also introduced in the proposed model for analyzing specified concepts. Data mining an interdisciplinary subfield of computer science, is the computational process of discovering patterns in a large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. The overall aim of the data mining process is to extract information from a data set and transform it into an understandable structure for further usage. Aside from the raw analysis step, it involves database and data management aspects, data pre-processing, model and the inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and in online updating Data mining involves six common classes of tasks: Anomaly detection the identification of unusual data records, that is interesting or data errors that require further investigation.

Kewords

Dependency modeling

Reference

[1] AlaaQasim and Mohammed Salin, “Towards Exploring Semantic Similarity based on Wordnet Semantic Dictionary,” In Proc. of the 18th Intl Joint Conf. on Artif.Intell., pp. 805-810, 2003. [2] AlekhAgarwal and Pushpak Bhattacharyya, “Augmenting Wordnet with Polarity Information on Objectives,”Comput.Linguist., 32(1), pp.13-47, 2006. [3] Anna Sinapalnikova and PavelSmrz“Knowing a word Vs. Accessing a word wordnet and Word Association Norms as Interface to Electronic Dictionaries”IEEETrans. Knowl.Data Eng., 19(3), pp.370-383, 2007. [4] AutonioSanifilippo and Stephen Tratz and MichucGreqory, “Automatic Ontological Annotation with Wordnet”, MIT Press,2004.