Couldn't interpret Categorical variables

Question

How does using Categorical variables justifies the principle of PCA ?
How can we transform categorical variables using the characteristics of standard normal distribution ?
It doesn't make sense to represent the categorical variables which have no position in geometry ?
It could lead to loss of information .

Answer 1

Hi Swarntam!
Thanks for reaching out!
In this particular case, categorical variables are part of the data we're working on and we convert them into dummies in order to participate in the process of building principal components. Categorical data is not from a normal distribution since the categories don't have an order.
Hope this helps but feel free to post another question if needed. Thank you.
Best,
Ivan

Couldn't interpret Categorical variables

Submit an answer

related questions