Summary

In this chapter, we learned what data munging is and why it is necessary for data science. Julia provides functionalities to facilitate data munging with the DataFrames.jl package, with features such as these:

  • NA: A missing value in Julia is represented by a specific data type, NA.
  • DataArray: DataArray provided in the DataFrames.jl provides features such as allowing us to store some missing values in an array.
  • DataFrame: DataFrame is 2-D data structure like spreadsheets. It is very similar to R or pandas's dataframes, and provides many functionalities to represent and analyze data. DataFrames has many features well suited for data analysis and statistical modeling.
  • A dataset can have different types of data in different columns.
  • Records have ...

Get Julia for Data Science now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.