Dataddo Use cases, Tutorials, Web analytics

Automated Google Analytics data retrieval using Dataddo

One of the main struggles when doing regular reports using Google Analytics data is the need to conduct daily, weekly or monthly repetitive tasks, usually consisting of dataset definition, setting data range or manually exporting and copy-pasting the data . The whole process might get very time consuming, particularly when managing multiple Google Analytics accounts. To cope with this rather annoying issue, we have developed a solution enabling Dataddo users to programmatically in defined time period retrieve and synchronize data from Google Analytics. In following paragraphs will present a short tutorial how to use this great feature to both save your time and money. In case you want to know more about other capabilities of Dataddo Google Analytics connector, please check this video.

Define query

Providing you already have Dataddo account, go to “Sources” and click on “Add a new source” and select “Google Analytics”. In the Google Analytics connector interface set metrics, dimensions, date range and others according to your preferences.

Google Analytics connector - daily visits

Set data sync

Clicking on “Data sync” section in the Google Analytics connector allows you to define period in which the data should be automatically retrieved. First, set the time zone to make sure the schedule setting is in accordance with your local time. Select the period in which you want to grab the data (daily, weekly or monthly) and set all the appropriate fields – hour and day in week/day in month when fire the data grabbing process.  Also, set the “Data storage” to define the way how you want to store newly retrieved data in each iteration.  You can select either “Clean storage”, meaning that all historic data will be erased or “Incremental storage”, placing the new data on top of existing.

GA data sync

Check everything is up and running

When clicking back on “Sources”, you can see the list of all available sources. You can check date & time (clicking on clock icon), when the process will be fired next time. Also, the log of all past data synchronization activity is available within the overview.

GA sources

Dataddo Use cases, Tutorials, Web analytics

Correlation between weather and website performance

This article should demonstrate one of many great features of Dataddo – fusing together data from many different sources and discovering valuable insights for your business. Writing it in the middle of extraordinary long heat wave is giving me an impulse to use long-observed correlation between performance of certain websites and weather as an example.

Methodology

Correlation is “a statistical technique that can show whether and how strongly pairs of variables are related”, therefore it is important to choose suitable datasets representing both variables. In following example I have chosen daily visits reported in Google Analytics in June 2015 to represent website performance and daily temperatures in June 2015 to represent weather. Of course, it is possible to choose other datasets that might fit better to your case such as daily transactions, article views or content interactions for website performance and daily rainfall or humidity for weather.

Obtaining the data

Since Dataddo features many different data connectors, obtaining the data is rather simple. Website performance dataset (daily visits / sessions from Google Analytics) is retrieved using Dataddo Google Analytics connector, setting dimension to “Date” and metric to “Sessions”. Daily temperatures (Prague, Czech Republic) are obtained from Czech Hydrometeorological Institute in CSV file and imported to Dataddo using CSV connector.

Google Analytics connector - daily visits CSV connector

Merging the data

Dataddo allows you to define structure – a collection of one or multiple data sources. Within each structure, you can define 1:1/1:n relations between the sources and thus fuse the data together. In following example date (“ga:date” and Date) is used as “bonding key”.

designer

Calculating correlation

Finally, merged data in the structure can be explored using Data explorer. Moreover, within Data explorer interface many statistical computations, including correlation, can be conducted. The calculated value of Pearson correlation coefficient  for “ga:session” and “AVG temperature” is -0.68, representing a loose negative linear correlation between both variables. As a result, weather (temperature more precisely) has a certain impact on performance of examined website.

explorer

Tutorials, Uncategorized, Web analytics

How to filter referral spam in Google Analytics

In your Google Analytics reports, you might have came across with traffic coming from sources such as semalt.com, darodar.com or social-buttons.com. Such traffic is commonly know as “referral spam”. Recently, the amount of referral spam in Google Analytics has significantly increased. It might seem your site has a lot of visitors, but in reality none of such visits is real. Although referral spam does not pose a serious threat to your site (except bursts of workload of your server) it might seriously distort your web analytics data, especially when your site does not have tens of thousands visits per month.

Luckily, the solution is quite simple, you just need to define a filter on your view (make sure you have at least one view where unfiltered data are available).

screenshot-www.google.com 2015-04-08 09-42-40 (1)Simply go to “Admin” section of Google Analytics and select a view on which you want to apply the filter. Then select a “Custom” filter type that exclude filter field “Referral”. Set a filter pattern on following regular expression:

 

.*(semalt.com|social-buttons|darodar.com|makemoneyonline|googlsucks|best-seo-solution|best-seo-offer|.*share-buttons|buttons-.*-website|guardlan|buy-cheap-online|seo-success).*

It filters out the referral spam from following domains :

  • semalt.com
  • social-buttons.com
  • darodar.com
  • buttons-for-website.com
  • makemoneyonline.com
  • googlsucks.com
  • best-seo-solutions.com
  • best-seo-offer.com
  • free-share-buttons.com
  • buttons-for-your-website.com
  • guardlan.com
  • buy-cheap-online.info
  • success-seo.com

Make sure you include “.*” on both beginning and end of your expression since Google Analytics always compare the expression to full referral path. There is a great tool called Rubular, which you can use for testing your regular expressions. In case you came across with other spam referrals, simply add the to your regular expression. You can find a full list of referrals used by spammers here.

Tutorials, Uncategorized, Web analytics

Easy data integration with Dataddo

When developing Dataddo, we put a lot of effort to simplify daily routines that our users have to deal with on daily basis. One of the cool examples is the way how you can easily integrate data from various sources together. With user-friendly drag-n-drop interface, you can mash together various information, regardless whether it is a traffic report extracted from Google Analytics, product list in the form of CSV file or social campaign data from Facebook Insights. See the video tutorial below for more details: