Managing Big Data With Hadoop MapReduce For Solving The Problems Of Traditional RDBMS
  • Author(s): HTU RA
  • Paper ID: 1701904
  • Page: 89-94
  • Published Date: 15-02-2020
  • Published In: Iconic Research And Engineering Journals
  • Publisher: IRE Journals
  • e-ISSN: 2456-8880
  • Volume/Issue: Volume 3 Issue 8 February-2020
Abstract

According to the usage of Internet is extremely increasing, the amount of data being generated by everywhere makes traditional database technologies unable to store and process efficiently and effectively. Moreover, Relational Database Management System (RDBMS) is hardly possible to manage extremely large amount of data called ?Big Data? in structured, semi-structured and unstructured forms from diverse data sources. In this paper, Hadoop, distributed big data processing platform, is applied for managing big data to overcome the storage and processing issues of traditional RDBMS. For experimentation, ?Bag of Words? dataset from UCI Machine Learning Repository is utilized as unstructured big text data tested on Apache Hadoop Map Reduce Multi Node Cluster. According to the outcomes of experimentation, applying Hadoop offers not only faster execution time but also better data scalability performance for managing and processing big text data.

Keywords

Bag of Words, Big Data, Hadoop, Map Reduce, RDBMS

Citations

IRE Journals:
HTU RA "Managing Big Data With Hadoop MapReduce For Solving The Problems Of Traditional RDBMS" Iconic Research And Engineering Journals Volume 3 Issue 8 2020 Page 89-94

IEEE:
HTU RA "Managing Big Data With Hadoop MapReduce For Solving The Problems Of Traditional RDBMS" Iconic Research And Engineering Journals, 3(8)