where $N$ is the total number of examples, $N_v$ is the number of examples reaching inner node $v$ and $\Delta i(v)$ is the impurity decrease achieved at node $v$. Here, the second sum in equation (1) runs over all inner nodes $v$ in tree $t$ where feature dimension $d$ is selected as split feature.

What is your opinion on the summarized work? Or do you know related work that is of interest? Let me know your thoughts in the comments below: