Spark
Overview
Libraries
- intel-analytics/BigDL: BigDL: Distributed Deep Learning Library for Apache Spark
- GoogleCloudPlatform/spark-on-k8s-operator: Kubernetes CRD operator for specifying and running Apache Spark applications idiomatically on Kubernetes.
Cassandra
- Getting Started with Apache Spark and Cassandra | DataStax Academy: Free Cassandra Tutorials and Training
- datastax/spark-cassandra-connector: DataStax Spark Cassandra Connector
- Getting Started with Spark & Cassandra - YouTube
- (Spark + Cassandra) * Docker = · GitHub
Hive
Numba
GPU Computing With Apache Spark And Python | Schedule | Spark Summit 2016
基于Spark的多GPU分布式深度学习 - Deeplearning4j: Open-source, Distributed Deep Learning for the JVM
GPU Support In Spark And GPU:CPU Mixed Resource Scheduling At Production Scale - YouTube
REST interface
Livy
- cloudera/livy: Livy is an open source REST interface for interacting with Apache Spark from anywhere
- Building a REST Job Server for Interactive Spark as a Service
spark-jobserver
- spark-jobserver/spark-jobserver: REST job server for Apache Spark
- spark-jobserver/python-sjsclient: Python client for Spark Jobserver Rest API
Tools
Books
Applications
Similarity
Resources
- SparkHub - A Community Site for Apache Spark
- 用Apache Spark进行大数据处理——第一部分:入门介绍
- Spark简介 | Yunfeng’s Hexo Blog
- AMP Camp 4 hands-on exercises
- Testing rules failed
- spark-programming-guide-zh-cn · GitBook
- 与 Hadoop 对比,如何看待 Spark 技术? - 知乎
- Spark examples · GitBook
- Docker IV: Spark for Cassandra Data Analysis - Nico’s Blog