DataTaunew | comments | leaders | submitlogin
Steps to becoming a Data Scientist - breaking it down (datacamp.com)
4 points by rajitdasgupta 3449 days ago | 13 comments


2 points by jack 3448 days ago | link

I'm so sick of these info-graphics and guides. We need to address how much maths and stats is required for this field rather than just saying statements like first year linear algebra and calculus.

Figuring out where to go after 'Stats101' is one of the problems I believe 'data scientists' like myself are facing at the moment...

-----

1 point by achompas 3448 days ago | link

If you have stats 101, linear algebra, and calculus, study some machine learning basics. Perceptron and decision trees/random forests, for example, will add some CS-motivated algorithms to your toolbelt.

After that, just work on problems. Try Kaggle, develop your own projects, explore a subdomain (numerical optimization, graphical models, stochastic calculus, deep learning) and implement algorithms. The field is absurdly broad, and it takes a long time to familiarize yourself with the basics.

EDIT: Also note that the list is heavy on tech/software because the role requires engineering competence. Also also, upon reviewing the graphic, it's really topical. This stuff gets you a junior role unless you have academic experience with research.

-----

1 point by Quetelet 3446 days ago | link

Just wondering, how does one get started with numerical optimization?

-----

1 point by jack 3446 days ago | link

http://stanford.edu/~boyd/cvxbook/

or

http://intelligent-optimization.org/LIONbook/

-----

1 point by achompas 3446 days ago | link

See Nocedal and Wright or take Botd's MOOC on convex optimization. Then solve the classic meal-planning problem!

-----

1 point by jack 3448 days ago | link

I'm already in what I would consider a pretty good data role, perhaps I would be considered as a glorified data analyst.

I guess machine learning is something I need to work on. I was looking at doing a stats post grad degree but I have seen a few times that stats degrees are outdated and machine learning is the new modern data analysis. I'm just not 100% sold on investing in learning ML as recommenders are not really revenant to my work now, but maybe there is more I can get out of it.

Certainly there is a lot to think about!

-----

1 point by jonan 3448 days ago | link

I don't understand this distinction between stats and ML. ML is firmly under the umbrella of Statistics.

-----

2 points by achompas 3448 days ago | link

firmly is a strong word here, since the fields differ historically.

I like this post on stats.stackexchange, inspired indirectly by Breiman's classic paper on modeling. One commenter describes the difference as generative v. discriminative, which feels right--core ML algorithms (SVMs, neural networks) don't infer the data generating process.

http://stats.stackexchange.com/questions/6/the-two-cultures-...

-----

1 point by jack 3446 days ago | link

When I say Stats I think Classical stats, not modern (bayesian?) techniques

-----

1 point by achompas 3448 days ago | link

I hear you!

For what it's worth, ML is about so much more than recommenders. Would love to discuss via email if you'd like.

-----

1 point by jack 3446 days ago | link

Email is on my profile I believe!

-----

1 point by achompas 3446 days ago | link

Don't see it. My email is in my profile though!

-----

1 point by orangejuice 3446 days ago | link

The percentages in the 'Typical Background' section adds up to 101%...

-----




RSS | Announcements