On Monday, I wrote about how to install and setup Cloudera’s Virtual Hadoop Cluster, which is named the Cloudera Quickstart VM. After the cluster is up and running, the next step is to build a data lake so that you can import data to analyze. You can find guidance on how to do that at Part II: Building a Basic Data Lake for the Cloudera Quickstart VM. If you haven’t finished installing the Quickstart VM, you can find a step-by-step guide here.
Both Part I and Part II have a contact form at the bottom: if you get stuck, feel free to reach out, and I’ll do my best to help.