Machine Learning
Data Quality Testing for ML Pipelines: Great Expectations, Soda, and Deequ
ML models fail silently when their input data changes. A column that was never null suddenly has 30% nulls. A categorical feature gains new values the model was never trained on. A numeric distribution shifts because a upstream system changed its calculation. Data quality testing catches these problems before they