Solved – Random Forests for predictor importance (Matlab)

I'm working with a dataset of approximately 150,000 observations and 50 features, using SVM for the final model. To trim down the feature count, I decided to look into using RF so SVM optimization doesn't take too long. I'm currently using the TreeBagger implementation in Matlab and had a few questions. When investigating feature importances, … Read more

Solved – Random Forest and Decision Tree Algorithm

A random forest is a collection of decision trees following the bagging concept. When we move from one decision tree to the next decision tree then how does the information learned by last decision tree move forward to the next? Because, as per my understanding, there is nothing like a trained model which gets created … Read more

Solved – randomForest vs randomForestSRC discrepancies

There are two popular R packages to build random forests introduced by Breiman (2001): randomForest and randomForestSRC. I am noticing small, yet significant discrepancies in terms of accuracy between the two packages, even when I try to use the same input parameters. I understand we would expect a slightly different random forest, but in example … Read more

Solved – randomForest vs randomForestSRC discrepancies

There are two popular R packages to build random forests introduced by Breiman (2001): randomForest and randomForestSRC. I am noticing small, yet significant discrepancies in terms of accuracy between the two packages, even when I try to use the same input parameters. I understand we would expect a slightly different random forest, but in example … Read more

Solved – Regression random forest and highly skewed response distribution

There is a great deal of information on how unbalanced data sets may impact predictive accuracy in classification problems. Several solutions have been proposed (see here). My questions are: Can a highly skewed target distribution (i.e. when the response variable is continuous and not categorical) create similar problems in a regression random forest? The response … Read more

Solved – Regression random forest and highly skewed response distribution

There is a great deal of information on how unbalanced data sets may impact predictive accuracy in classification problems. Several solutions have been proposed (see here). My questions are: Can a highly skewed target distribution (i.e. when the response variable is continuous and not categorical) create similar problems in a regression random forest? The response … Read more

Solved – Combining randomForests in R, why are the err.rate, mse and rsq components NULL

Closed. This question is off-topic. It is not currently accepting answers. This question appears to be off-topic because EITHER it is not about statistics, machine learning, data analysis, data mining, or data visualization, OR it focuses on programming, debugging, or performing routine operations within a statistical computing platform. If the latter, you could try the … Read more

Solved – Combining randomForests in R, why are the err.rate, mse and rsq components NULL

Closed. This question is off-topic. It is not currently accepting answers. This question appears to be off-topic because EITHER it is not about statistics, machine learning, data analysis, data mining, or data visualization, OR it focuses on programming, debugging, or performing routine operations within a statistical computing platform. If the latter, you could try the … Read more