You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* fix outdated reference to intergration tests
* doc tweaks
* removing reference to google's obfuscated data set which no longer works
* fixing reference to integration_tests folder
Copy file name to clipboardExpand all lines: README.md
+8-20Lines changed: 8 additions & 20 deletions
Original file line number
Diff line number
Diff line change
@@ -6,11 +6,10 @@ Features include:
6
6
- Flattened models to access common events and event parameters such as `page_view`, `session_start`, and `purchase`
7
7
- Conversion of sharded event tables into a single partitioned table
8
8
- Incremental loading of GA4 data into your staging tables
9
-
-Session and user dimensional models with conversion counts
10
-
-Easy access to query parameters such as GCLID and UTM params
11
-
- Support for custom event parameters & custom user properties
9
+
-Page, session and user dimensional models with conversion counts
10
+
-Simple methods for accessing query parameters (like UTM params) or filtering query parameters (like click IDs)
11
+
- Support for custom event parameters & user properties
12
12
- Mapping from source/medium to default channel grouping
13
-
- Ability to exclude query parameters (like `fbclid`) from page paths
14
13
15
14
# Models
16
15
@@ -23,7 +22,7 @@ Features include:
23
22
| stg_ga4__user_properties | Finds the most recent occurance of specified user_properties for each user |
24
23
| stg_ga4__derived_user_properties | Finds the most recent occurance of specific event_params value and assigns them to a user_pseudo_id. Derived user properties are specified as variables (see documentation below) |
25
24
| stg_ga4__derived_session_properties | Finds the most recent occurance of specific event_params or user_properties value and assigns them to a session's session_key. Derived session properties are specified as variables (see documentation below) |
26
-
| stg_ga4__session_conversions_daily | Produces daily counts of conversions per session. The lsit of conversion events to include is configurable (see documentation below) |
25
+
| stg_ga4__session_conversions_daily | Produces daily counts of conversions per session. The list of conversion events to include is configurable (see documentation below) |
27
26
| stg_ga4__sessions_traffic_sources | Finds the first source, medium, campaign, content, paid search term (from UTM tracking), and default channel grouping for each session |
28
27
| dim_ga4__user_pseudo_ids | Dimension table for user devices as indicated by user_pseudo_ids. Contains attributes such as first and last page viewed.|
29
28
| dim_ga4__sessions | Dimension table for sessions which contains useful attributes such as geography, device information, and campaign data |
@@ -70,28 +69,17 @@ packages:
70
69
```
71
70
## Required Variables
72
71
73
-
This package assumes that you have an existing DBT project with a BigQuery profile and a BigQuery GCP instance available with GA4 event data loaded. Source data is located using the following variables which must be set in your `dbt_project.yml` file.
72
+
This package assumes that you have an existing DBT project with a BigQuery profile and a BigQuery GCP instance available with GA4 event data loaded. Source data is defined using the following variables which must be set in `dbt_project.yml`.
74
73
75
74
```
76
75
vars:
77
76
ga4:
78
77
project: "your_gcp_project"
79
78
dataset: "your_ga4_dataset"
80
79
start_date: "YYYYMMDD" # Earliest date to load
81
-
frequency: "daily" # daily|streaming|daily+streaming Match to the type of export configured in GA4; daily+streaming appends today's intraday data to daily data
80
+
frequency: "daily" # daily|streaming|daily+streaming. See 'Export Frequency' below.
82
81
```
83
82
84
-
If you don't have any GA4 data of your own, you can connect to Google's public data set with the following settings:
85
-
86
-
```
87
-
vars:
88
-
project: "bigquery-public-data"
89
-
dataset: "ga4_obfuscated_sample_ecommerce"
90
-
start_date: "20210120"
91
-
```
92
-
93
-
More info about the GA4 obfuscated dataset here: https://support.google.com/analytics/answer/10937659?hl=en#zippy=%2Cin-this-article
94
-
95
83
## Optional Variables
96
84
97
85
### Query Parameter Exclusions
@@ -275,6 +263,6 @@ The easiest option is using OAuth with your Google Account. Summarized instructi
This package uses `pytest` as a method of unit testing individual models. More details can be found in the [integration_tests/README.md](integration_tests) folder.
268
+
This package uses `pytest` as a method of unit testing individual models. More details can be found in the [unit_tests/README.md](unit_tests) folder.
Copy file name to clipboardExpand all lines: unit_tests/README.md
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ The dbt-ga4 package treats each model and macro as a 'unit' of code. If we fix t
7
7
8
8
You'll need to install pytest, pytest-dotenv and create a `.env` file with a `BIGQUERY_PROJECT` key containing the name of your BigQuery project. An 'oauth' connection method is assumed for local development.
9
9
10
-
Installing pytest & pytest-dotenv can be done using the requirements.txt file. Navigate to the `integration_tests` folder and run
10
+
Installing pytest & pytest-dotenv can be done using the requirements.txt file. Navigate to the `unit_tests` folder and run
0 commit comments