Evolve logo The conference for data & analytics leaders | Frankfurt, June 27

About This Training

One of the most critical functions of a data-driven enterprise is the ability to manage ingest and data flow across complex ecosystems. Does your team have the tools and skill sets to succeed at this?

Apache NiFi provides this capability and our three-day Cloudera Dataflow: Flow Management with Apache Nifi course delivers the foundational training you'll need to succeed with NiFi. In addition to learning NiFi's key features and concepts, participants will gain hands-on experience creating, executing, managing, and optimizing NiFi dataflows throughout a variety of scenarios.

What Skills You Will Gain

During this course, you learn how to:

  • Navigate the NiFi user interface
  • Define, configure, organize, and manage dataflows
  • Transform and trace data as it flows to its destination
  • Track changes to dataflows with NiFi Registry
  • Use the NiFi Expression Language to control dataflows
  • Optimize dataflows for better performance and maintainability
  • Connect dataflows with other systems, such as Apache Kafka, Apache Hive, and HDFS

Who Should Take This Course?

This course is designed for developers, data engineers, administrators, and others with an interest in learning NiFi's innovative no-code, graphical approach to data ingest. Although programming experience is not required, basic experience with Linux is presumed, and previous exposure to big data concepts and applications is helpful.

Other Training That Might Interest You

 

Book the course

Course Outline

Introduction to Cloudera Flow Management

  • Overview of Cloudera Flow Management and NiFi
  • The NiFi User Interface
  • Demonstration: NiFi User Interface
  • Exercise: Build Your First Dataflow

Processors

  • Overview of Processors
  • Processor Surface Panel
  • Processor Configuration
  • Exercise: Start Building a Dataflow Using Processors

Connections

  • Overview of Connections
  • Connection Configuration
  • Connector Context Menu
  • Exercise: Connect Processors in a Dataflow

Dataflows

  • Command and Control of a Dataflow
  • Processor Relationships
  • Back Pressure
  • Prioritizers
  • Labels
  • Exercise: Build a More Complex Dataflow
  • Exercise: Creating a Fork Using Relationships
  • Exercise: Set Back Pressure Thresholds

Process Groups

  • Anatomy of a Process Group
  • Input and Output Ports
  • Exercise: Simplify Dataflows Using Process Groups

FlowFile Provenance

  • Data Provenance Events
  • FlowFile Lineage
  • Replaying a FlowFile
  • Exercise: Using Data Provenance

Dataflow Templates

  • Templates Overview
  • Managing Templates
  • Exercise: Creating, Using, and Managing Templates

Apache NiFi Registry

  • Apache NiFi Registry Overview
  • Using the Registry
  • Exercise: Versioning Flows Using NiFi Registry

FlowFile Attributes

  • FlowFile Attributes
  • Routing on Attributes
  • Exercise: Working with FlowFile Attributes

NiFi Expression Language

  • NiFi Expression Language Overview
  • Syntax
  • Expression Language Editor
  • Setting Conditional Values
  • Exercise: Using the NiFi Expression Language

Dataflow Optimization

  • Dataflow Optimization
  • Control Rate
  • Managing Compute
  • Exercise: Building an Optimized Dataflow

NiFi Architecture

  • NiFi Architecture Overview
  • Cluster Architecture
  • Heartbeats
  • Managing Clusters

Site-to-Site Dataflows

  • Site-to-Site Theory
  • Site-to-Site Architecture
  • Anatomy of a Remote Process Group
  • Adding and Configuring Remote Process Groups
  • Exercise: Building Site-to-Site Dataflows

Cloudera Edge Management and MiNiFi

  • Overview of MiNiFi
  • Example Walk-through

Monitoring and Reporting

  • Monitoring from NiFi
  • Overview of Reporting
  • Examples of Common Reporting Tasks
  • Exercise: Monitoring and Report

Controller Services

  • Controller Services Overview
  • Common Controller Services
  • Exercise: Adding Apache Hive Controller

Integrating NiFi with the Cloudera Ecosystem

  • NiFi Integration Architecture
  • NiFi Ecosystem Processors
  • A Closer Look at NiFi and Apache Hive
  • A Closer Look at NiFi and Apache Kafka
  • Exercise: Integrating Dataflows with Kafka and HDFS

NiFi Security

  • NiFi Security Overview
  • Securing Access to the NiFi UI
  • Authentication
  • Authorization
  • NiFi Registry Security
  • NiFi Security Summary

Cloudera has not only prepared us for success today, but has also trained us to face and prevail over our big data challenges in the future by using Hadoop.

Persado

Learn more

CCA Spark and Hadoop Developer Certification

This course is excellent preparation for the CCA Spark and Hadoop Developer exam. Although we recommend further training and hands-on experience before attempting the exam, this course covers many of the subjects tested. 

Certification is a great differentiator. It helps establish you as a leader in the field, providing employers and customers with tangible evidence of your skills and expertise.

Advance your career

Big data developers are among the world's most in-demand and highly-compensated technical roles. Check out some of the job opportunities currently listed that match the professional profile, many of which seek CCA qualifications.

Private training

We also provide private training at your site, at your pace, and tailored to your needs.

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.