Hey can some one help me out with working out the gain with regards to decision trees, please. I'm looking at the exam paper example for 2010, question 5.2, How do we get firstly the 0,1 for the first I and then how do we go from the fractions in brackets to the next line???
ie how does 5/14 I(3/5,2/5) end up being 5/14.0.98??
I think I'm being really doff
The 0 and 1 in the first line is because there are 4 entries for Overcast, and all four of them have a YES Outlook. 0/4 = 0 & 4/4 = 1.
Then you have 5/4 * I(3/5, 2/5), you take the p value which is 3/5 = 0.6 go to the table in appendix A (at the end of the tut) find 0.6 and it gives you the value for I, which is 0.98.