serverless
Serverless Framework – Build web, mobile and IoT applications with serverless architectures using AWS Lambda, Azure Functions, Google CloudFunctions & more! –
data-science-ipython-notebooks
Recently updated with 50 new notebooks! Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various
avro-hadoop-starter
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
tdigest
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
hbase-object-mapper
Java-annotation based compact utility library for HBase that helps you: [1] convert objects of your bean-like classes to HBase rows and vice-versa [2] define data access classes for random access of HBase rows
social-news-bigdata
HN/Reddit social news websites big data analytic with Hadoop Hive and MapReduce. Discover correlation between a website's content and number of votes it receives on social news websites such as Reddit and HackerNews.
EasyMapReduce
EasyMapReduce leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
- «
- »