MapR Cluster Administration Training
Training for system engineers who want to set up and run MapR clusters.
The Training sessions are usually held in German. Please contact us if you are interested in Training sessions in English.
Target audience: Administrators, System Engineers
Length: 3 days
Dates: Dates available upon request
Times: 9 am – 5 pm
Number of participants: min. 3, max. 12
Price: 2,400 euros plus VAT
This Training provides the knowledge required to develop Big Data Applications on the basis of Apache Spark 2.1.
Participants first learn how to use the Spark shell to load data sets from various sources and formats and analyse them interactively. Building on that, the participants then develop an independent Spark application to process data in the form of data sets and DataFrames locally or in a computing cluster.
The Training is rounded off with an introduction to Spark Streaming for the processing of data streams, GraphFrame for the analysis of graphs and the machine-learning library MLlib.
Introduction to the MapR Converged Data Platform (HDFS core components, MapR-FS core components, MapR-FS versus HDFS)
Installation preparation of security modes (planning of the service layout, preparation of cluster hardware, testing of nodes)
Installation of the MapR Converged Data Platform (MapR Installer, implementation of a manual installation, licensing of the cluster)
Verification and testing of the cluster (verification of the cluster status, post-installation benchmark tests, cluster structures)
Work with volumes (introduction to volumes, cluster topology, attributes for standard volumes, development of a volume plan, setting up and configuration of volumes)
Work with snapshots (introduction to snapshots, working with snapshots, use and management of snapshots)
Work with mirrors (introduction to mirrors, working with local mirrors, working with remote mirrors, remote mirrors and disaster recovery)
Configuration of user and cluster parameters (management of users and groups, access control expressions (ACEs), user and group quotas, configuration of topology and email notifications)
Configuration of cluster access (access to data in the cluster, virtual IP addresses for NFS access, client configuration)
Cluster monitoring and management (use of MCS and CLI, MapR Monitoring, reacting to alarms)
Disk and node maintenance (adding disks, replacing faulty disks, node maintenance, adding nodes)
Troubleshooting of cluster problems (fundamental troubleshooting, tools and utilities)
Installation and configuration of YARN (YARN services, YARN job execution flow, YARN configuration)
- The course fee includes Training materials, certificates of participation, lunches, drinks and snacks.
- Participants must bring their own laptop to the Training sessions.
Rostislaw Krassow is a Big Data Engineer at inovex. Rostislaw has worked in the Hadoop environment since 2015 with technologies like Apache Spark, Hive, Drill, Flume and Sqoop, and is a certified MapR trainer. Before entering the big-data world, he built data platforms on the basis of classic databases such as Oracle.
Do you have any questions?
Head of inovex Academy
Spark Training as On-Site Training
Would you like to have several of your employees trained right now? Contact us to schedule an appointment at your place & a date of your choice.More information