Big data Analytics study using advanced cluster computing framework of APACHE SPARK.

Project Abstract / Summary : This project is analyzing huge amount of data using "APACHE SPARK" , an advanced cluster computing frame work for BIG DATA. Big data analytics using Apache Spark is being famous among many companies as it provides faster analysis of huge amount of data generated from various sources and to get the desired results . Apache Spark has become famous due to its "in memory cluster computing" which provides 100x faster computing than the existing big data platforms such as HADOOP. It has also ecosystems which include iterative algorithms for machine learning, graph processing libraries and especially "Live Stream Processing " support
[​IMG]
[​IMG]

which is not provided by HADOOP and which is very useful for processing of live streaming data.

[​IMG]


Our interest in this project is to take a real world huge DATA-SETS and to run them on SPARK , and to extract useful information from the huge data sets, in typically a lesser processing time. what information we extract depends on the data set we consider , and currently we are learning and analyzing how to work with the APACHE SPARK.

Why did you choose to work on this project topic : As in the today's world huge amount of data is being generated day by day, it is needed to analyze such huge data for large companies to estimate consumer behaviour, also in Medical field, and many other fields, data analytics based on big data is an important advanced research in the data anlytics, where research aims at developing tools which are efficient and which takes less processing time in processing huge amounts of data.
Hence , the advanced big data framework of APACHE SPARK, which is already being implemented and are supported for development and research by many big companies around the world such as IBM, AMAZON, GOOGLE etc.

Project Highlights : Big data analytics has been linked with our day to day life from "placing of relevant ads by GOOGLE on the web pages based on the user's browsing", to "Wheather forecasting ". We don't say that our project has to win this contest, but we as "computer science graduates", We want to learn & contribute to big data analytics which is going to play a considerable impact on our lives in the future and as "APACHE SPARK", the advanced and effecient big data frame work has very less developers & Contributors as of now and hence we want to learn and contribute to it.


Project Category : CS / IT / Networking
------------------------------------------------------
Institute/College Name: National institute of technology, Raipur
City: raipur
State: chattisgarh
Participating Team From: Final Year

Replies

  • Irfan Amin
    Irfan Amin
    I am also interested in Big Data Analysis can you suggest me how can i adopt this path. Currently I learnt about architectures like Fermi, Kepler etc
    Looking for your kind response.
    -Irfan Amin

You are reading an archived discussion.

Related Posts

A group of scientists from the Brigham and Women's Hospital (BWH) has engineered an integrated organ-on-chip system powered by the much hyped, Google glass. In an open source paper, the...
SanDisk, one of the global leaders among digital storage solution providers has announced a new 200 GB wireless flash drive in India, called 'Sandisk Connect Wireless Stick'. Priced at Rs....
Can Aerospace engineer fill the form of IES paper ?
I made a small Effort to bring the list of some useful books, which you find easily on market and on internet. 1.The ICJ has published a compendium of articles...
The period of an undamped mass supported on a spring is equal to 2 .pie sq.rt of M/K A useful approximation for buildings with a regular distribution of mass and...