Index

A

access control lists (ACLs), Data Security in Elastic MapReduce
activities, adding in data pipeline, Adding Activities
add-instance-group option, Scheduling with the CLI
Amazon Architecture Center, Amazon EMR Distributions
Amazon Cloudwatch, Amazon EMR and the Hadoop Ecosystem
Amazon Data Pipeline
adding activities, Adding Activities
adding data nodes, Adding Data Nodes
basics of, Amazon Web Services Used in This Book, Data Filtering Design Patterns and Scheduling Work
costs of, AWS Pipeline Costs
geographic availability of, Scheduling with AWS Data Pipeline
Job Flow scheduling with, Scheduling with AWS Data Pipeline
online resources for, Amazon AWS Cost Estimation Tools
pipeline creation, Creating a Pipeline
reviewing pipeline status, Reviewing Pipeline Status
scheduling pipelines, Scheduling Pipelines
Amazon Elastic Compute Cloud (EC2)
Bash script on, Simulating Syslog Data
basics of, Amazon Web Services Used in This Book
custom instance creation, Amazon EMR Distributions
key pairs in, Utilizing Pig in Amazon EMR
management console choices, Simulating Syslog Data
online resources for, Amazon AWS Online Resources
performance improvement with, Performance
pre-configured instances, AWS Best Practices and Architecture
Amazon Elastic MapReduce (EMR)
basics of, Preface, Amazon Web Services Used in This Book, Data Collection and Data Analysis with AWS
cluster interaction, Scheduling with the CLI
cluster overview, Amazon Elastic MapReduce, EMR and EC2 usage billed by the hour
cluster types, Amazon Job Flow ...

Get Programming Elastic MapReduce now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.