The following are screenshots of the steps that are performed on the dataset :
- How many leaves are in the optimal tree?
- Which variable was used for the first split? What were the competing splits for this firstsplit?
- Add a second Decision Tree node to the diagram and connect it to the Data Partition node.
- How many leaves are in the optimal tree?
The three way split has 33 leaves.
- Based on the average square error, which of the decision tree models appears to be better?
From the above outputs Decision tree 2 is a better model, as with three-way splits has the lower validation average square error.