Skip to main content

Setup a data warehouse

Data warehouses for dummies​

Historically, setting up a data warehouse was a complex task requiring a dedicated team of data engineers. However, we live in an era where setting up a cloud data warehouse is a breeze.

This guide will walk you through the steps to set up a data warehouse for Mitzu.

info

The great thing about this guide is that it can serve you in the later stages of your data journey.

Step 1. Choose a data warehouse solution​

For most companies, we suggest getting started with the simplest solutions:

  • Clickhouse
  • Snowflake
  • BigQuery
  • PostgreSQL

In this guide, we will use BigQuery as an example.

info

Google BigQuery is an excellent choice for most companies, starting with data warehouses. It is easy to set up and has a free tier.

Please get in touch with us if you want to use another data warehouse. Here you can find our Slack community.

Step 2. Setup BigQuery (5-10 mins)​

Starting with BigQuery is easy:

  1. Go to the BigQuery console.
  2. Create a new free account (it requires a credit card). But 10GB of data storage is free.
  3. Go to the BigQuery console: BQ Admin. Here, you should see your default project.
  4. Create a new dataset!
    image
    I suggest keeping it as a single region dataset.
  5. Once done, your BigQuery project should look like this:
    image
  6. Create a service account to access your data warehouse. Your service account should have the role BigQuery Admin. You can change this later if needed.
  7. Create a new JSON access key under the Manage keys menu item.
    image
    image
    . Mitzu will use this JSON key to access your data warehouse later in the next steps.
success

Congratulations! You have successfully set up your data warehouse.

Step 3. Collect data to BigQuery (10-15 mins)​

Moving data to your data warehouse is also very simple. You have multiple options available:

This guide will use Jitsu as this is the simplest way to start.

note

Jitsu is an excellent choice for most companies that need data ingestion to the data warehouse. It is easy to set up and has a very generous free tier.

By the end of this guide, you will have a Jitsu account created and data from your landing page, your application, and Stripe data collected to BigQuery. You should see the result below:

image

Let's get started!

Go to Jitsu and create a new free account.

image

3.1. Add BigQuery as a destination​

Create your first destination by clicking on + Add button at the destinations. Choose BigQuery as a destination.

image

3.2. Add your landing page visits as a data source​

Create your first data source by clicking on + Add button at the data sources.

Then, you need to name your data source and create a browser key. You can leave the rest as default.

image

You must embed a JavaScript snippet into your website to finalize your setup. This part is probably the easiest. Copy Jitsu's javascript snippet and paste it into your website's <head> section. You can find the snipped under the Setup instructions menu item.

image

Finally, connect your new data source to the BigQuery destination. You can do this by clicking on the Connections button in the middle of the overview page.

3.3. Add your application as a data source​

Adding the application data source is similar to adding the landing page data source. However, you will most likely use an SDK to collect the data. Jitsu currently supports multiple SDKs. You can find the list of supported SDKs under the Setup instructions menu item.

image
info

Remember to connect your data source to the BigQuery destination.

3.4. Add Stripe as a data source​

Now, we will add Stripe as a data source. This will be valuable in measuring revenue inside Mitzu. Mitzu will allow you to analyze revenue based on user segments.

warning

Mitzu works best with Stripe data for consumer products. It is also required to have the product userID as Stripe customers' metadata. Follow this guide to add metadata to your Stripe customers.

The best way to integrate Stripe with Mitzu is to follow the guide provided by Jitsu. This guide will appear when you add your Stripe connector.

image

Here is what it should look like:

image

Step 4. Verify your setup​

As a final step, let's verify everything is working as expected. You should see in BigQuery the following tables:

image

Step 4. Connect BigQuery to Mitzu (10 mins)​

Follow the guide at setup Mitzu.