Understanding the dataset

In this section, we will understand the meaning of data attributes, which will help us understand what kind of dataset we are going to deal with and what kind of preprocessing is needed for the dataset. We understand our dataset in two sections, and those sections are given as follows:

  • Understanding the DJIA dataset
  • Understanding the NYTimes news article dataset

Understanding the DJIA dataset

In the DJIA dataset, we have seven data attributes. They are quite easy to understand, so let's look at each of them one by one:

  • Date: The first column indicates the date in the YYYY-MM-DD format when you see data in the .csv file.
  • Open: This indicates the price at which the market opens, so it is the opening value for the DJIA index for ...

Get Machine Learning Solutions now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.