Skills required for data science
Skills required for data science xGXPy
There are 3 overarching topical skill sets that make-up data science (well, actually many more, but 3 that are the root topics): “Pure” Math (Calculus, Linear Algebra)Statistics (technically math, but it’s a more applied version)Programming (Generally in Python/R)
Start with learning the math (2–3 months)Move into programming tutorials purely on the language you’re using… don’t get caught up in the machine learning side of coding until you feel confident writing ‘regular’ code (1 month)Start jumping into machine learning codes, following tutorials. Kaggle is an excellent resource for some great tutorials (see the Titanic data set). Pick an algorithm you see in tutorials and look up how to write it from scratch. Really dig into it. Follow along with tutorials using pre-made datasets like this: Tutorial To Implement k-Nearest Neighbors in Python From Scratch (1–2 months)Really jump into one (or several) short term project(s) you are passionate about, but that aren’t super complex. Don’t try to cure cancer with data (yet)… maybe try to predict how successful a movie will be based on the actors they hired and the budget. Maybe try to predict all-stars in your favorite sport based on their stats (and the stats of all the previous all stars). (1+ month)
There are 3 overarching topical
skill
sets that
make
-up data science (well, actually
many
more,
but
3 that are the root topics): “Pure” Math (Calculus, Linear Algebra)Statistics (
technically
math,
but
it’s a more applied version)Programming (
Generally
in Python/R)
Start
with learning the math (2–3 months)
Move
into programming
tutorials
purely
on the language you’re using… don’t
get
caught up in the machine learning side of coding until you feel confident writing ‘regular’ code (1 month)
Start
jumping into machine learning codes, following
tutorials
.
Kaggle
is an excellent resource for
some
great
tutorials
(
see
the Titanic data set). Pick an algorithm you
see
in
tutorials
and look up how to write it from scratch.
Really
dig into it. Follow along with
tutorials
using pre-made datasets like this: Tutorial To Implement k-Nearest Neighbors in Python From Scratch (1–2 months)
Really
jump into one (or several) short term project(s) you are passionate about,
but
that aren’t super complex. Don’t try to cure cancer with data (
yet
)… maybe try to predict how successful a movie will
be based
on the actors they hired and the budget. Maybe try to predict all-stars in your favorite sport based on their stats (and the stats of all the previous all stars). (1+ month)
Do not write below this line