Open Access Open Access  Restricted Access Subscription or Fee Access

Clustering Using Big Data

Varsha Bansal, Neetu Sharma

Abstract


Clustering deals with finding a structure in a collection of unlabeled data. Data stream mining is the process of extracting knowledge structures from continuous, rapid data records. As big data are referring to terabytes of data and clustering algorithms are come with high computational costs, the question is how to cope with this problem and how to deploy clustering techniques to big data and get the results in a reasonable time. Clustering helps to visually analyze the data and also assists in decision making. In this paper, we have discussed some big data mining clustering techniques and also provide a comparison among them.

Full Text:

PDF

References


http://home.deib.polimi.it/matteucc/Clustering/tutorial_html/.

http://www.srmuniv.ac.in/sites/default/files/2017/15CS331E-unitIV.pdf.

http://www.ijsrd.com/articles/IJSRDV3I70331.pdf.

https://link.springer.com/referenceworkentry/10.1007%2F978-0-387-30164-8_99.

http://www.iaeng.org/publication/IMECS2010/IMECS2010_pp566-569.pdf.

http://shodhganga.inflibnet.ac.in/bitstream/10603/28762/8/08_chapter%202.

http://engr.smu.edu/~mhd/dmbook/part2.ppt.

Mu-Yu Lu, SJSU Database System Concepts, Silberschatz, Korth, Sudarshan.

G. Sehgal, K. Garg. Comparison of various clustering algorithms, Int J Comp Sci Inform Technol. 2014; 5(3):3074–6p.

O.A. Abbas. Comparisons between data clustering algorithms.

P. Baser, J.R. Saini. A comparative analysis of various clustering techniques used for very large datasets.

S. Alam, G. Dobbie, P. Riddle, M. Asif Naeem. Particle swarm optimization based hierarchical agglomerative clustering, Int Conf Web Intel Intelligent Agnt Technol. 2010; 64–8p.

B. Rama. A survey on clustering current status and challenging issues, Int J Comp Sci Eng. 2010; 2(9): 2976–80p.

M. Ester, H.P. Kriegel, J. Sander, X. Xu. A density-based algorithm for discovering clusters in large spatial databases with noise, In: Proc 2nd Int Conf Know Disc Data Mining. 1996.

S. Brecheisen, H.P. Kriegel, M. Pfeifleisen. Multi-step density based clustering, Know Inform Syst. 2006; 9(3).

S.A. Elavarasi, J. Akilandeswari, B. Sathiyabhama. A survey on partition clustering algorithms, Int J Enter Comp Bus Sys Int Syst. 2011; 1: 1–13p.




DOI: https://doi.org/10.37628/ijocspl.v3i2.325

Refbacks

  • There are currently no refbacks.