Skip to content

hadoop

Back to application catalog

Description

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.

Homepage

https://hadoop.apache.org/

Available Versions on RCAC Clusters

Cluster Versions
ANVIL 3.3.0
BELL 3.4.0
NEGISHI 3.3.2

Module

You can load the module by:

module load hadoop

Note for using hadoop

Run module spider hadoop beforehand to check if this version requires any prerequisite modules.