Apache Flume Cookbook Pdf Free Download

Article PDF Available

Analyzing Social Media through Big Data using InfoSphere BigInsights and Apache Flume

Abstract and Figures

Social Media provides organizations ability to survey feelings towards the contents and events associated to them in real time. Moreover, the first demarche of the sentiment analysis is the pre-processing of data collected from Social Media. Most of existing research works that deals with social media analysis based on extracting new features related to sentiment. This paper presents the usage of Twitter in a number of proposed subjects, which is the largest social networking website where Twitter data is in increasing at higher rates every day that considers it as Big Data Source. Then, describing in detail the way in which Big data technology, such as, InfoSphere BigInsights enables processing of this data, which are primarily collected from social networks by Apache Flume and stored in Hadoop storage. In addition, we have investigated a Big Data platform for collecting social media data based on Apache Flume and analyzing this data using InfoSphere BigInsights. Moreover, our paper integrates the visualization of these analysis results using BigSheets. To that end, evaluation through analysis of results confirms that the proposed Big Data platform produces better results in terms of social media analysis.

Flume configuration files for Twitter data.

Content may be subject to copyright.

Discover the world's research

20+ million members
135+ million publications
700k+ research projects

Join for free

Content may be subject to copyright.

ScienceDirect

Available online at www.sciencedirect.com

Procedia Computer Science 113 (2017) 280–285

Peer-review under responsibility of the Conference Program Chairs.

10.1016/j.procs.2017.08.299

Peer-review under responsibility of the Conference Program Chairs.

1877-0509

Available online at www.sciencedirect.com

ScienceDirect

Procedia Computer Science 00 (2017) 000 – 000

www.elsevier.com/locate/procedia

Peer-review under responsibility of the Conference Program Chairs.

The 8th International Conference on Emerging Ubiquitous Systems and Pervasive Networks

(EUSPN 2017)

Analyzing Social Media through Big Data using InfoSphere

BigInsights and Apache Flume

Marouane Birjalia,

, Abderrahim Beni-Hssanea, Mohammed Erritalib

aLAROSERI Laboratory, Department of Computer Sciences, University of Chouaib Doukkali, Faculty of Sciences, El Jadida, Morocco

bTIAD Laboratory, University of Sultan Moulay Slimane, Faculty of Sciences and Technologies, Béni Mellal, Morocco

Abstract

Social Media provides organizations ability to survey feelings towards the contents and events associated to them in real time.

Moreover, the first demarche of the sentiment analysis is the pre-processing of data collected from Social Media. Most of existing

research works that deals with social media analysis based on extracting new features related to sentiment. This paper presents the

usage of Twitter in a number of proposed subjects, which is the largest social networking website where Twitter data is in increasing

at higher rates every day that considers it as Big Data Source. Then, describing in detail the way in which Big data technology,

such as, InfoSphere BigInsights enables processing of this data, which are primarily collected from social networks by Apache

Flume and stored in Hadoop storage. In addition, we have investigated a Big Data platform for collecting social media data based

on Apache Flume and analyzing this data using InfoSphere BigInsights. Moreover, our paper integrates the visualization of these

analysis results using BigSheets. To that end , e valuation through analysis of results confirms that the proposed Big Data platform

produces better results in terms of social media analysis.

Peer-review under responsibility of the Conference Program Chairs.

Keywords: Big Data; Hadoop; Infosphere BigInsights; Social Media Analysis; BigSheets; Twitter Data; Apache Flume.

1. Introduction

Today, the companies face growing challenges from their commercial perspective. In particular, their adding value

should be produced from huge amount of data generated and also on the data complexity that can be in structured,

* Corresponding author.

E-mail address: birjali.marouane@gmail.com

Available online at www.sciencedirect.com

ScienceDirect

Procedia Computer Science 00 (2017) 000 – 000

www.elsevier.com/locate/procedia

Peer-review under responsibility of the Conference Program Chairs.

The 8th International Conference on Emerging Ubiquitous Systems and Pervasive Networks

(EUSPN 2017)

Analyzing Social Media through Big Data using InfoSphere

BigInsights and Apache Flume

Marouane Birjalia, *, Abderrahim Beni-Hssanea, Mohammed Erritalib