Debugging

When you run your producer, and the data does not end up in the redshift table, you should remember that there is a delay. That delay is set when you create the Kinesis delivery stream, and is set, by default, to 3,600 seconds. Set it to a minimum of 60 seconds if you want to avoid long waits. These are the places to check when your data is not streaming in your database:

  1. Check S3: The S3 prefix corresponds to a folder in the bucket you have defined. If there are errors, you will see a new subfolder called errors or processing errors. Click through the subfolders until you reach the actual error file, make it public (there's a button), download the file, and examine it. It will sometimes contain useful information. The error subfolder ...

Get Effective Amazon Machine Learning now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.