AI & Analytics

Precision and recall > .90 on holdout data

Reddit r/datascience 6 Apr 2026, 18:41

Samenvatting

I'm running ML models (XGBoost and elastic net logistic regression) predicting a 0/1 outcome in a post period based on pre period observations in a large unbalanced dataset. I've undersampled from the majority category class to achieve a balanced dataset that fits into memory and doesn't take hours to run. I understand sampling can distort precision or recall metrics. However I'm testing model performance on a raw holdout dataset (no sampling or rebalancing). Are my crazy high precision and r...

Lees het volledige artikel

Deepen your knowledge

Knowledge Base

Precision and recall > .90 on holdout data

Samenvatting

Deepen your knowledge

AI in Power BI — Copilot, Smart Narratives and more

ChatGPT and BI — How AI is transforming data analysis

Predictive Analytics — What can it do for your business?

Precision and recall > .90 on holdout data

Samenvatting

Deepen your knowledge

AI in Power BI — Copilot, Smart Narratives and more

ChatGPT and BI — How AI is transforming data analysis

Predictive Analytics — What can it do for your business?

Gerelateerde artikelen

Building A Bulletproof Strategy For Data Recovery (Sponsored)

The Geometry Behind the Dot Product: Unit Vectors, Projections, and Intuition

AI Isn’t Coming For Your Job: Automation Is

Do MLEs actually reduce your workload in your job?