A novel cloud based elastic framework for big data preprocessing

[thumbnail of 2014 A_Novel_Cloud_Based_Elastic_Framework_fo.pdf]
Preview
Text - Accepted Version
· Please see our End User Agreement before downloading.
| Preview

Please see our End User Agreement.

It is advisable to refer to the publisher's version if you intend to cite from this work. See Guidance on citing.

Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email

Dawelbeit, O. and McCrindle, R. (2014) A novel cloud based elastic framework for big data preprocessing. In: 6th Computer Science and Electronic Engineering Conference (CEEC), 2014, September 25-26, Essex, UK. doi: 10.1109/CEEC.2014.6958549

Abstract/Summary

A number of analytical big data services based on the cloud computing paradigm such as Amazon Redshift and Google Bigquery have recently emerged. These services are based on columnar databases rather than traditional Relational Database Management Systems (RDBMS) and are able to analyse massive datasets in mere seconds. This has led many organisations to retain and analyse their massive logs, sensory or marketing datasets, which were previously discarded due to the inability to either store or analyse them. Although these big data services have addressed the issue of big data analysis, the ability to efficiently de-normalise and prepare this data to a format that can be imported into these services remains a challenge. This paper describes and implements a novel, generic and scalable cloud based elastic framework for Big Data Preprocessing (BDP). Since the approach described by this paper is entirely based on cloud computing it is also possible to measure the overall cost incurred by these preprocessing activities.

Altmetric Badge

Item Type Conference or Workshop Item (Paper)
URI https://reading-clone.eprints-hosting.org/id/eprint/69738
Identification Number/DOI 10.1109/CEEC.2014.6958549
Refereed Yes
Divisions Life Sciences > School of Biological Sciences > Department of Bio-Engineering
Uncontrolled Keywords Big data, Cloud computing, Program processors, Google, Runtime, Educational institutions, Computer science
Download/View statistics View download statistics for this item

Downloads

Downloads per month over past year

University Staff: Request a correction | Centaur Editors: Update this record

Search Google Scholar