Resolved: Improving the Model

Question

So it seems in all these years no one has attempted to improve the model. That's a bummer. I was hoping to see what others had tried.

What I came up with as an improvement, which is still no where near the 90%+ accuracy that Ilya was able to achieve, was a combination of a change to the preprocessing step along with a new parameter in the fit() method we did not learn about.

I did not undersample the data. I left all the data in tact and unbalanced. I then used a new parameter in the fit method called class_weight, which allows me to apply a higher penalty to the minority class. This way, an error for the class that is in the minority is penalized higher than the the majority one. This allows us to keep the entire dataset and not throw away information while still ensuring there is a balancing effect happening. I had to use Gemini to help me figure this out as there is no way I would have come up with this on my own as I'm still largely ignorant to what all is possilbe here.

I was able to move from an 81.25% accuracy on the undersampled dataset to an 85.88% accuracy with the class_weights dataset. I think that's pretty solid. Let me know what you think! Is there an even better way to approach this?

-Justin

Answer 1

Hi Justin,

The dataset is sorted by the targets. Moreover, shuffling before balancing does affect its accuracy.

The main reason for the accuracy issue is not that it is sorted by 1s, but because after the 1s it is sorted by some other feature, probably automatically (not sure), so in the preprocessing lecture (not exercise) it picked up some 0s that mesh nicely with the 1s to produce 91%. We have also tried running the algorithm without balancing (with some extra fixes) and we reach 91% accuracy. Therefore, I believe that there are patterns in the data that could be explained at 90+% using this particular neural network.

However, the preprocessing exercise does introduce some harmful randomness to our results. You can reach 85% validation accuracy but not more than that.

***

Note that the videos are based on the dataset with only shuffling after balancing (not on the preprocessing exercise). Therefore 91% is easily reached.

Sorry for the inconvenience!

Answer 2

Ned, thank you so much for taking the time to answer this question. I really appreciate it. You've been very helpful not only here, but also in other forums. I'll attempt what you suggested and see if that works.

-Justin

Answer 3

Hey Ned,

I can confirm I have executed the lesson with the dataset that has been shuffled only AFTER balancing. I still cannot reach that 91% accuracy.

Could the issue be the hardware? I am on an M1 Macbook Pro. I have heard that the math executed on intel vs. arm chipsets can be different due to floating point errors and such. Is this true? Is it possible this could cause the discrepency between my results and the instructors?

Here is my latest attempt with the dataset as you described.

valication accuracy: 81.88%

Resolved: Improving the Model

Submit an answer

related questions