

RapidMiner Radoop requires access to a variety of ports on the cluster.

Verifying port availability for RapidMiner Radoop After installing RapidMiner Radoop and creating connections, refer to networking setup for more information. Make sure that RapidMiner Radoop can connect to your Hadoop cluster. See the supported data warehouse systems. The system must be installed on a Hadoop cluster. RapidMiner Radoop supports Apache Hive or Impala. See Hadoop cluster requirements and supported Hadoop distributions. RapidMiner Radoop requires connection to a properly configured Hadoop cluster. (Note that Radoop Basic is not enough to use Radoop.) If you are interested in enabling advanced capabilities and support, contact us to purchase a RapidMiner Radoop license. Radoop free license is automatically downloaded once logged in. If necessary, see the instructions for RapidMiner Studio installation or RapidMiner Server installation. You need RapidMiner Studio, and optionally, RapidMiner Server installed.

If any of these prerequisites have not yet been met, be sure to finish them before proceeding with the installation. The installation instructions assume that you have completed the following tasks. The following instructions describe the process for installing the RapidMiner Radoop extension. Integrating RapidMiner Radoop into the RapidMiner advanced analytics suite is as easy as downloading the extension and making some configuration changes. RapidMiner Radoop runs on any platform that supports Java. It can be installed on RapidMiner Studio and/or RapidMiner Server, and provides a platform for editing and running ETL, data analytics, and machine learning processes in a Hadoop environment. RapidMiner Radoop is client software with an easy-to-use graphical interface for processing and analyzing big data on a Hadoop cluster. Installing RapidMiner Radoop on RapidMiner Studio
