Clarify Practice Exam 2 - Machine Learning Section Question 7 & 8
Question 7: I executed the code below as written in the reviewed answer and i got the answer in days. Is this correct?
diff_Days = pd.to_datetime(df_music['career_end_date'], format = '%Y-%m-%d') - pd.to_datetime(df_music['Career_start_date'], format = '%Y-%m-%d')
data:image/s3,"s3://crabby-images/64ad1/64ad1f0cc8d91cf8e203a5b4d085ae5e4dadff56" alt=""
Question 8 - I proceeded to question 8 and I got this error below: that unit 'Y' is not supported.
How do I resolve this please?
data:image/s3,"s3://crabby-images/07133/07133f800c5bf2156bffc983dd03207f337413d7" alt=""
data:image/s3,"s3://crabby-images/c9473/c94734866ae19b64f3b5350944654c576040343b" alt=""
Hi Nike!
Thanks for reaching out.
Question 7: Yes, when you subtract two `datetime` objects in Pandas, the result is a `Timedelta` object that represents the difference in days (and potentially in hours, minutes, and seconds if the time is also provided).
Question 8: The error regarding the unit 'Y' not being supported is likely because you tried to convert the `Timedelta` object into years using a method or function that doesn't recognize 'Y' as a valid unit for year. In Pandas, you can't directly convert a `Timedelta` to years because a year can vary in length (due to leap years). However, you can approximate this by dividing the number of days in the `Timedelta` by 365.25 (taking into account leap years on average):
years = diff_Days / pd.Timedelta(days=365.25)
This will give you the duration in years as a floating-point number. If you need to operate on a `Timedelta` object and want to extract the number of years, always ensure to handle it with a proper conversion like the above, since there's no direct 'Y' unit in Pandas time deltas.
Hope this helps.
Best,
Tsvetelin
Where do we get the absenteeism_module file?