-
Random Forest: "a collection of decision trees with controlled variance"
Wiki: https://en.wikipedia.org/wiki/Random_forest
Python: http://blog.yhat.com/posts/random-forests-in-python.html
paper: https://www.stat.berkeley.edu/~breiman/randomforest2001.pdf
Notes: Yali Amit, a co-creater is head of Stats at Uchicago
-
Lasso
Wiki: https://en.wikipedia.org/wiki/Lasso_(statistics)
https://www.analyticsvidhya.com/blog/2016/01/complete-tutorial-ridge-lasso-regression-python/
-
Cross Validation Stratified Cross validation
Wiki: https://en.wikipedia.org/wiki/Cross-validation_(statistics)
R: https://machinelearningmastery.com/how-to-estimate-model-accuracy-in-r-using-the-caret-package/
-
Data cleaning
https://cran.r-project.org/doc/contrib/de_Jonge+van_der_Loo-Introduction_to_data_cleaning_with_R.pdf
- XGBoost
- Neural Network
https://help.github.com/articles/basic-writing-and-formatting-syntax/
kdnuggets https://www.datatau.com https://news.ycombinator.com https://learnxinyminutes.com/docs/r/