Cloudera DataFlow (previously knows as Hortonworks Data Flow) is a scalable, real-time streaming analytics platform that ingests, curates and analyzes data for key insights and immediate actionable intelligence. Cloudera DataFlow addresses the key challenges enterprises face with data-in-motion—real-time stream processing of data at high volume and high scale, data provenance and ingestion from IoT devices, edge applications and streaming sources. Now that Cloudera and Hortonworks have merged, Cloudera is rapidly delivering Cloudera Data Flow for use with Cloudera clusters in order to make real-time streaming analytics at scale a reality for our new customers.
How to provision Cloudera EDH
Add Kafka, Nifi, Kudu
Build out real-time ingestion in Nifi
Attendees need to bring their laptop. Internet connectivity is required.
This workshop is in cooperation with Business & Decision and typically targeted at Data Engineer