Index

A

  1. addAccumulator method

  2. addInPlace method

  3. Alternating least square (ALS)

  4. awaitTermination()method

B

  1. Batch processing

  2. Big Data systems, Spark

    1. acyclic graph

    2. canonical word-count

    3. MapReduce programming model

    4. Samza messages

    5. sensor network

    6. SQL to NoSQL

    7. stream-processing system

    8. Web 2.0 applications

    9. local Execution

    10. .sbt file

    11. standalone cluster mode

    12. YARN

C

  1. cache() function

  2. Call data record (CDR)

  3. Case-class method

  4. Cassandra Query Language (CQL)

  5. ChiSqSelector

  6. Chi-square selection

  7. Clickstream Dataset

  8. Collaborative filtering

  9. compute() method

  10. createCombiner function

  11. Custom receiver

    1. HttpInputDStream

    2. receiver interface method

D

  1. Data frame

    1. avoid shuffling

    2. cache aggressively

    3. MLlib

    4. persistence

    5. query transformation

      1. action

      2. aggregation expression

      3. cube operation

      4. DataFrameNaFunctions

      5. DataFrameStatFunctions ...

Get Pro Spark Streaming: The Zen of Real-Time Analytics Using Apache Spark now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.