Python wrapper to access Hadoop HDFS REST API
-
Updated
Oct 26, 2016 - Python
Python wrapper to access Hadoop HDFS REST API
MapReduce Image Processing framework for Hadoop
Easy way to write java objects to apache orc files.
Spark Streaming via Kafka
Reproducing a bug about decommission monitor thread spending too much cpu time
Count the number of times a word occurs in 1GB (Big Data) Dataset of books using hadoop map-reduce
Ingestion pipeline to analyze soccer tweets
A debian:jessie based Spark + HadoopDFS docker container.
Apache Pig Latin script to count letters in multiple input text files, using the HortonWorks Hadoop Sandbox or Google Cloud Platform
Secure Erase utility for HDFS
Bulk I/O Dispatch, i.e. BID Schemes. We have designed and developed two contention avoidance storage solutions, collectively known as BID: Bulk I/O Dispatch, for big data environment. BID-HDD is a disk scheduling scheme. BID-Hybrid is another contention avoidance scheme using hybrid tiers of storage for improving HDD performance using SSDs. In t…
Setup hadoop cluster manually and automatically
Examples of hadoop implementations with different datasets.
Add a description, image, and links to the hadoop-filesystem topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-filesystem topic, visit your repo's landing page and select "manage topics."