Setup a data warehouse
Data warehouses for dummies​
Historically, setting up a data warehouse was a complex task requiring a dedicated team of data engineers. However, we live in an era where setting up a cloud data warehouse is a breeze.
This guide will walk you through the steps to set up a data warehouse for Mitzu.
The great thing about this guide is that it can serve you in the later stages of your data journey.
Step 1. Choose a data warehouse solution​
For most companies, we suggest getting started with the simplest solutions:
- Clickhouse
- Snowflake
- BigQuery
- PostgreSQL
In this guide, we will use BigQuery as an example.
Google BigQuery is an excellent choice for most companies, starting with data warehouses. It is easy to set up and has a free tier.
Please get in touch with us if you want to use another data warehouse. Here you can find our Slack community.
Step 2. Setup BigQuery (5-10 mins)​
Starting with BigQuery is easy:
- Go to the BigQuery console.
- Create a new free account (it requires a credit card). But 10GB of data storage is free.
- Go to the BigQuery console: BQ Admin. Here, you should see your default project.
- Create a new dataset! I suggest keeping it as a single region dataset.
- Once done, your BigQuery project should look like this:
- Create a service account to access your data warehouse. Your service account should have the role
BigQuery Admin
. You can change this later if needed. - Create a new JSON access key under the
Manage keys
menu item.. Mitzu will use this JSON key to access your data warehouse later in the next steps.
Congratulations! You have successfully set up your data warehouse.
Step 3. Collect data to BigQuery (10-15 mins)​
Moving data to your data warehouse is also very simple. You have multiple options available:
- Using a CDP - (Customer Data Platform) like Segment, RudderStack, Jitsu or Snowplow
- DIY - build your solution that collects data (Not recommended)
This guide will use Jitsu as this is the simplest way to start.
Jitsu is an excellent choice for most companies that need data ingestion to the data warehouse. It is easy to set up and has a very generous free tier.
By the end of this guide, you will have a Jitsu account created and data from your landing page, your application, and Stripe data collected to BigQuery. You should see the result below:
Let's get started!
Go to Jitsu and create a new free account.
3.1. Add BigQuery as a destination​
Create your first destination by clicking on + Add
button at the destinations.
Choose BigQuery
as a destination.
3.2. Add your landing page visits as a data source​
Create your first data source by clicking on + Add
button at the data sources.
Then, you need to name your data source and create a browser key. You can leave the rest as default.
You must embed a JavaScript snippet into your website to finalize your setup.
This part is probably the easiest. Copy Jitsu's javascript snippet and paste it into your website's <head>
section.
You can find the snipped under the Setup instructions
menu item.
Finally, connect your new data source to the BigQuery destination. You can do this by clicking on the Connections
button in the middle of the overview page.
3.3. Add your application as a data source​
Adding the application data source is similar to adding the landing page data source.
However, you will most likely use an SDK to collect the data. Jitsu currently supports multiple SDKs.
You can find the list of supported SDKs under the Setup instructions
menu item.
Remember to connect your data source to the BigQuery destination.
3.4. Add Stripe as a data source​
Now, we will add Stripe as a data source. This will be valuable in measuring revenue inside Mitzu. Mitzu will allow you to analyze revenue based on user segments.
Mitzu works best with Stripe data for consumer products. It is also required to have the product userID
as Stripe customers' metadata.
Follow this guide to add metadata to your Stripe customers.
The best way to integrate Stripe with Mitzu is to follow the guide provided by Jitsu. This guide will appear when you add your Stripe connector.
Here is what it should look like:
Step 4. Verify your setup​
As a final step, let's verify everything is working as expected. You should see in BigQuery the following tables:
Step 4. Connect BigQuery to Mitzu (10 mins)​
Follow the guide at setup Mitzu.