Developed by devhub.io
TensorFlowOnSpark brings TensorFlow programs onto Apache Spark clusters
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Sparkling Water provides H2O functionality inside Spark cluster
A better compressed bitset in Java
C# language binding and extensions to Apache Spark
Distributed Deep learning with Keras & Spark
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data related infrastructure.
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
spark ml 算法原理剖析以及具体的源码实现分析
Real Time Aggregation based on Spark Streaming
Cross-platform real-time collaboration client optimized for business and organizations.