Published in International Journal of Advanced Research in Computer Science Engineering and Information Technology
ISSN: 2321-3337 Impact Factor:1.521 Volume:5 Issue:3 Year: 22 March,2015 Pages:384-389
Internet has become a non-detachable part of human beings throughout the world. Using the internet we can obtain different types of information as well as we can do any other lots of daily task easily such as shopping. We can get the exact detail related to the products which we are going to purchase online. But sometimes happens that due to some issues maximum of the time user does not purchase any product using our online site . So to improve the business e-commercial companies are required to keep the total detail related to the website. So using the Hadoop and MongoDB we can obtain the required details of our website within a less amount of time. Hadoop provides the map-reduce which is the most commonly used context in parallel processing. This paper proposes the new methodology to improve the business of e-commercial companies by keeping all the relevant information related to their website using the MongoDB and Hadoop and obtain the final aggregated result which helps to take the decisions to improve their business.
Hadoop , MongoDB, Map-Reduce , Sharding ,Data Aggregation
[1] J. Dean and S.Ghemawat,Map-Reduce:Simplified data processing on large clusters, In processing of OSDI.pp. 137-150 2004. [2] N.Pansare,V.R.Borkar,online aggregation for large Map-Reduce jobs,VLDB 2011 Conference proceedings.pp.1135-1145 AUGUST 2011. [3] T.Condie,N.Conway,P.Alvaro,and J.M.Hellerstin,online aggregation and continuous query support in Map-Reduce, In SIGMOD 2010, Conference proceedings.pp.1115-1118,June 2010 [4] B.Rama Mohon Rao, “Sharded parallel Map reduce for online aggregation” [5] R. J. Bayardo and D. P. Miranker, ―Processing queries for first-few answers‖ In Proc. 5th International Conf. on Information and Knowledge Management, pages 45.52,1996. [6] G. Antoshenkov and M. Ziauddin, ―Query processing and optimization in Oracle Rdb‖, VLDB Journal, 5(4): 229-237, 1996. [7] J. M. Hellerstein, ―The case for online aggregation‖, Technical Report UCB//CSD-96-908, EECS Computer Science Division, University of California, Berkeley, CA,1996. [8] J. M. Hellerstein, P. J. Haas and H. J. Wang, ―Online aggregation‖, In Proc. 1997 ACM SIGMOD Intl. Conf. Managment of Data, pages 171–182. ACM Press, 1997. [9] 10gen, Inc: MongoDB, 2010, http://www.mongodb.org. [10] Strozzi and Carlo, ―NoSQL – A relational database management system ‖, 2007–2010.