AI & Analytics

Advies over Modelpipeline en Modelleringmethodologie

Reddit r/datascience

Samenvatting

I am doing a project for credit risk using Python. I'd love a sanity check on my pipeline and some opinions on gaps or mistakes or anything which might improve my current modeling pipeline. Also would be grateful if you can score my current pipeline out of 100% as per your assessment :) My current pipeline Import data Missing value analysis : bucketed by % missing (0–10%, 10–20%, ..., 90–100%) Zero-variance feature removal Sentinel value handling ( -1 to Nan for categoricals) Leakage variable...

Lees het volledige artikel