How to become a data scientist? - Books, Videos, Tutorials, Resources?
Intrigued by the subject of 'data science', I began reading about it and so far, here are my findings -
What is Data Science? What is the job of a Data Scientist?
Data science is a broad term that encompasses any technology/technique/theory involving extraction of knowledge from data. Not limited to computer science specialists, data science is a field that brings IT, mathematics, machine learning, pattern recognition, statistics, programming etc. all under one roof. The job of a data scientist if to investigate any given complex problem on the basis of his expertise in any one of the above fields. That said, we can easily say that a Data Scientist can not work alone. He/She has to work in a team of engineers, mathematicians, programmers etc.
In a very crude form we could say that - the role of a Data Scientist involves making sense of a large amount of data, without caring for the limitations of either hardware or software, and build visualizations/models and communicate his/her finding in such a way that it is understandable to others.
Online Courses to learn concepts in Data Science -
[In no particular order of importance]
1. John Hopkins University - 'Data Science' on Coursera
#-Link-Snipped-#
2. Data Science Certificate from Harvard Extension School
#-Link-Snipped-#
3. Introduction to Data Science - Udacity
#-Link-Snipped-#
4. Online Data Science course at Berkeley
#-Link-Snipped-#
^ All these are broad certification courses available over the internet.
Now to move on to the topics that one has to learn (read: to master) in order to build a career as a Data Scientist.
- Python
- Hadoop
- Machine learning
- Data analysis
- Data mining
- SQL
- Statistics