Downloads of v 3.3.1:


Last Update:

16 Jun 2021

Package Maintainer(s):

Software Author(s):

  • Apache Software Foundation


apache admin hadoop hdfs yarn cluster mapreduce namenode datanode


  • 1
  • 2
  • 3

3.3.1 | Updated: 16 Jun 2021



Downloads of v 3.3.1:



Software Author(s):

  • Apache Software Foundation

  • 1
  • 2
  • 3
Hadoop 3.3.1

  • 1
  • 2
  • 3

This Package Contains an Exempted Check

Not All Tests Have Passed

Validation Testing Passed

Verification Testing Passed


Scan Testing Exemption for this package version only:

File too large to be scanned


To install Hadoop, run the following command from the command line or from PowerShell:


To upgrade Hadoop, run the following command from the command line or from PowerShell:


To uninstall Hadoop, run the following command from the command line or from PowerShell:


NOTE: This applies to both open source and commercial editions of Chocolatey.

1. Ensure you are set for organizational deployment

Please see the organizational deployment guide

  • Open Source or Commercial:
    • Proxy Repository - Create a proxy nuget repository on Nexus, Artifactory Pro, or a proxy Chocolatey repository on ProGet. Point your upstream to Packages cache on first access automatically. Make sure your choco clients are using your proxy repository as a source and NOT the default community repository. See source command for more information.
    • You can also just download the package and push it to a repository Download

3. Enter your internal repository url

(this should look similar to

4. Choose your deployment method:

choco upgrade hadoop -y --source="'STEP 3 URL'" [other options]

See options you can pass to upgrade.

See best practices for scripting.

Add this to a PowerShell script or use a Batch script with tools and in places where you are calling directly to Chocolatey. If you are integrating, keep in mind enhanced exit codes.

If you do use a PowerShell script, use the following to ensure bad exit codes are shown as failures:

choco upgrade hadoop -y --source="'STEP 3 URL'"

Write-Verbose "Exit code was $exitCode"
$validExitCodes = @(0, 1605, 1614, 1641, 3010)
if ($validExitCodes -contains $exitCode) {
  Exit 0

Exit $exitCode

- name: Ensure hadoop installed
    name: hadoop
    state: present
    version: 3.3.1
    source: STEP 3 URL

See docs at

chocolatey_package 'hadoop' do
  action    :install
  version  '3.3.1'
  source   'STEP 3 URL'

See docs at

    Name: hadoop,
    Version: 3.3.1,
    Source: STEP 3 URL

Requires Otter Chocolatey Extension. See docs at

cChocoPackageInstaller hadoop
   Name     = 'hadoop'
   Ensure   = 'Present'
   Version  = '3.3.1'
   Source   = 'STEP 3 URL'

Requires cChoco DSC Resource. See docs at

package { 'hadoop':
  provider => 'chocolatey',
  ensure   => '3.3.1',
  source   => 'STEP 3 URL',

Requires Puppet Chocolatey Provider module. See docs at

salt '*' chocolatey.install hadoop version="3.3.1" source="STEP 3 URL"

See docs at

5. If applicable - Chocolatey configuration/installation

See infrastructure management matrix for Chocolatey configuration elements and examples.

This package is exempt from moderation. While it is likely safe for you, there is more risk involved.

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.

The project includes these modules:

  • Hadoop Common: The common utilities that support the other Hadoop modules.
  • Hadoop Distributed File System (HDFS™): A distributed file system that provides high-throughput access to application data.
  • Hadoop YARN: A framework for job scheduling and cluster resource management.
  • Hadoop MapReduce: A YARN-based system for parallel processing of large data sets.


The package sets the HADOOP_HOME environment variable and adds HADOOP_HOME\bin to the machine path.

The package by default will unzip to C:\Hadoop. This location can be changed, see below.

Package Parameters

The following package parameters can be set:

  • /unzipLocation - Unzip to a different path than default.

These parameters can be passed to the installer with the use of -params.
For example: choco install hadoop -params '"/unzipLocation:D:\Hadoop"'.

$ErrorActionPreference = 'Stop';

$packageName    = 'hadoop'
$packageVersion = $env:chocolateyPackageVersion
$toolsDir       = "$(Split-Path -parent $MyInvocation.MyCommand.Definition)"
$hadoop_home    = "C:\Hadoop"
$mirrors        = ''
$checksum32     = 'ad770ae3293c8141cc074df4b623e40d79782d952507f511ef0a6b0fa3097bac'

# d/l from closest mirror
$get_mirror_page = Invoke-WebRequest -Uri $mirrors -UseBasicParsing
$url32 = $get_mirror_page.links | ? href -match '\.tar\.gz$' | select -First 1 -expand href

$pp = Get-PackageParameters
if ($pp.unzipLocation) { 
    $hadoop_home = $pp.unzipLocation
    Write-Host "Param: Unzipping (installing) to $hadoop_home"

$packageArgs = @{
  packageName   = $packageName
  unzipLocation = $toolsDir
  url           = $url32
  checksum      = $checksum32
  checksumType  = 'sha256'
  validExitCodes= @(0)

Install-ChocolateyZipPackage @packageArgs

# unzip tar
Get-ChocolateyUnzip -FileFullPath "$toolsDir\*.tar" -Destination $hadoop_home

Install-ChocolateyEnvironmentVariable `
    -VariableName "HADOOP_HOME" `
    -VariableValue $hadoop_home `
    -VariableType 'Machine'

Install-ChocolateyPath `
    -PathToInstall "%HADOOP_HOME%\$packageName-$packageVersion\bin" `
    -PathType 'Machine'

# Hadoop needs 8.3 path to find Java    
$sp = New-Object -ComObject Scripting.FileSystemObject
$f = $sp.GetFolder($env:JAVA_HOME)

Install-ChocolateyEnvironmentVariable `
    -VariableName "JAVA_HOME" `
    -VariableValue $f.ShortPath `
    -VariableType 'User'

# don't need tar anymore
Remove-Item $toolsDir\*.tar -ErrorAction SilentlyContinue -Force
$packageName = 'hadoop'
$packageVersion = $env:chocolateyPackageVersion
$hadoop_home = $env:HADOOP_HOME

# remove from Path
$path = [Environment]::GetEnvironmentVariable("Path", 'Machine')
$newPath = ($path.Split(';') | Where-Object { $_ -notmatch '.*hadoop.*' }) -join ';'
[Environment]::SetEnvironmentVariable("Path", $newPath, 'Machine')

Install-ChocolateyEnvironmentVariable `
    -VariableName "HADOOP_HOME" `
    -VariableValue $null `
    -VariableType 'Machine'

Install-ChocolateyEnvironmentVariable `
    -VariableName "JAVA_HOME" `
    -VariableValue $null `
    -VariableType 'User'

Remove-Item $hadoop_home -Recurse -Force

Log in or click on link to see number of positives.

In cases where actual malware is found, the packages are subject to removal. Software sometimes has false positives. Moderators do not necessarily validate the safety of the underlying software, only that a package retrieves software from the official distribution point and/or validate embedded software against official distribution point (where distribution rights allow redistribution).

Chocolatey Pro provides runtime protection from possible malware.

Version Downloads Last Updated Status
Hadoop 3.3.0 827 Friday, July 17, 2020 Approved
Hadoop 3.2.1 751 Wednesday, September 25, 2019 Approved
Hadoop 3.2.0 504 Wednesday, January 23, 2019 Approved
Hadoop 3.1.1 569 Friday, August 10, 2018 Approved
Hadoop 3.1.0 589 Monday, April 9, 2018 Approved
Hadoop 3.0.1 249 Tuesday, March 27, 2018 Approved
Hadoop 3.0.0 417 Friday, December 15, 2017 Approved
Hadoop 2.9.0 1132 Tuesday, November 21, 2017 Approved
Hadoop 2.8.2 329 Monday, November 6, 2017 Approved
Discussion for the Hadoop Package

Ground Rules:

  • This discussion is only about Hadoop and the Hadoop package. If you have feedback for Chocolatey, please contact the Google Group.
  • This discussion will carry over multiple versions. If you have a comment about a particular version, please note that in your comments.
  • The maintainers of this Chocolatey Package will be notified about new comments that are posted to this Disqus thread, however, it is NOT a guarantee that you will get a response. If you do not hear back from the maintainers after posting a message below, please follow up by using the link on the left side of this page or follow this link to contact maintainers. If you still hear nothing back, please follow the package triage process.
  • Tell us what you love about the package or Hadoop, or tell us what needs improvement.
  • Share your experiences with the package, or extra configuration or gotchas that you've found.
  • If you use a url, the comment will be flagged for moderation until you've been whitelisted. Disqus moderated comments are approved on a weekly schedule if not sooner. It could take between 1-5 days for your comment to show up.
comments powered by Disqus