Tag archive

Data Transformation

4 articles tagged with "Data Transformation"

Explore all tags

The Problem of Data Cascades in Data Science

A cascade is a form of a "reinforcing feedback loop" that amplifies changes or trends in only one direction. This unidirectional nature makes it a compounding force. A "Data Cascade" is essentially a series of compounding events that lead to undesirable outcomes due to poor data quality.

Read article ->

Parquet for Data Science

A Parquet file uses less space for storage, is faster to process and is highly optimized for Data Science workflows. Should you use a Parquet file as your primary file format of choice?

Read article ->