🛠️ Scheduled Maintenance | We’ll be undergoing scheduled maintenance and upgrades between 00:00 PST Jan 26th until 00:00 PST Jan 28th. There may be brief interruption of services in that period. We apologize for the inconvenience.

The 365 Data Science team is proud to invite you to our own community forum. A very well built system to support your queries, questions and give the chance to show your knowledge and help others in their path of becoming Data Science specialists.
Anybody can ask a question
Anybody can answer
The best answers are voted up and moderated by our team

Preprocessing the Discrete Variables

Preprocessing the Discrete Variables


In the lesson Data Preparation. Preprocessing Discrete Variables: Creating dummies , 
For the variable ‘addr_state’ it was explained that grouping into categories for the final model has to be done according to the WoE values keeping into the consideration of the No.of observations(Borrowers)
But , in the homework problem , when we try to preprocess the discrete variable ‘purpose’ shouldn’t we group them just as above , then into the following categories:
# small_business, educational, moving ,house
# Other
# renewable_energy, medical, wedding, vacation
# debt_conslidation
#major_purchase, car
But the solution is given as :
# We combine ‘educational’, ‘small_business’, ‘wedding’, ‘renewable_energy’, ‘moving’, ‘house’ in one category: ‘educ__sm_b__wedd__ren_en__mov__house’.
# We combine ‘other’, ‘medical’, ‘vacation’ in one category: ‘oth__med__vacation’.
# We combine ‘major_purchase’, ‘car’, ‘home_improvement’ in one category: ‘major_purch__car__home_impr’.
# We leave ‘debt_consolidtion’ in a separate category.
# We leave ‘credit_card’ in a separate category.
Can you just explain me on what basis the above grouping is being done?

1 Answer

365 Team

Hi venkat, 
thank you for the query and apologies for the late response! Are you still experiencing difficulties with the credit risk modeling lecture?
Thank you in advance!
365 Eli