JOIN performance, here are some suggestions:
- It is advisable to perform the
JOINoperation on the biggest table first and then smaller tables
- Join subsequent tables depending on which table has the most selective filter
It is very important as there are a lot of cases that the jobs run out of memory due to a heavy JOINs operations.
Share this highlighthttp://www.safaribooksonline.com/a/learning-cloudera-impala/9910826/