Samenvatting
I am doing a project for credit risk using Python. I'd love a sanity check on my pipeline and some opinions on gaps or mistakes or anything which might improve my current modeling pipeline. Also would be grateful if you can score my current pipeline out of 100% as per your assessment :) My current pipeline Import data Missing value analysis : bucketed by % missing (0–10%, 10–20%, ..., 90–100%) Zero-variance feature removal Sentinel value handling ( -1 to Nan for categoricals) Leakage variable...