Resolved: Reflection on Categorial Variables
Although result of applying describe() method on categorical varibles are not meaningful, we can see that the mean of sex column is 0.45700. Can we infer from this that more female visited store than male as females take the responsibility of managing of household?
thanks for reaching out! You are right that over 50% of the users in the dataset are females, however the margins are very close, around a 4% difference. So, in this case the difference in numbers isn't sufficient enough to make any conclusions, and even if the difference was significant it would be difficult to conclude what the direct reason for that is. It could be that the area where the data is pulled from has a bigger female population, or something else entirely. So inferring the cause for imbalanced data can be difficult, if we don't have so much prior information on the data. Later when we form the segments, we make some conclusion based on the data within the segments themselves, and you're welcome to form your hypothesis there and share your findings here in the Q & A.
Hope this helps!