O'Reilly logo

Learning Google BigQuery by Eric Brown, Thirukkumaran Haridass

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Sanitizing data

Most data warehousing projects follow a standard process. This process involves the extraction of data from a data source, the transformation of this data to both the standards of the data warehouse and the requirements of the end user, and the loading of data into the resulting database table. This process is more commonly known as the Extract, Transform, Load Process, or ETL for short. The transformation step is important for a few reasons:

  • Decoding of encoded values (that is, converting values of W and E to West and East)
  • Calculation of values (that is, calculating the average order value by dividing the revenue by the count of orders)
  • Splitting separated lists into individual columns
  • Aggregation
  • Data validation, either ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required