Testing
Amazon Deequ Data Quality Testing at Scale with Apache Spark
Amazon Deequ is an open-source data quality library built on Apache Spark, developed and used internally at Amazon to validate the quality of datasets at petabyte scale. Unlike SQL-based tools that work on a single database, Deequ runs as part of your Spark job — making it ideal for validating data