![pentaho data integration api pentaho data integration api](https://docs.treasuredata.com/download/attachments/328223/Pentaho3.png)
There you can set the permissions for Google accounts – mind that for the API access you need an account that does not use Two-Factor-Authentication (a technology I otherwise very much recommend – it won’t keep the NSA away though). How the data is transported from “week list” to “google analytics” I described in my last post on bulk-downloading files with Kettle – there I just used a Job on the receiving end but the principle is exactly the same.įor every API request the resulting figures are stored in a TSV file which can then be fed to the spreadsheet program of your choice for further processing.īefore you can access Google Analytics with its API you have to activate that service via the Google API Console. This set of rows is passed on to the next transformation “google analytics” which is executed for every row individually.
![pentaho data integration api pentaho data integration api](https://i.stack.imgur.com/l4CT7.png)
“week list” creates a set of rows for every time span keeping the calendar week number (cw), the startDate and the endDate of the time span. The jobīecause the API request for the number of unique visitors has to be invoked for every needed time span – the calender weeks (Monday to Sunday) and the time spans stretching two consecutive calendar weeks. I will focus on the steps and aspects relevant to the subject.
![pentaho data integration api pentaho data integration api](https://zappysys.com/blog/wp-content/uploads/2018/08/pentaho.png)
#Pentaho data integration api how to#
PDI provides a step for fetching data from Google Analytics and I am going to describe in this post how to use this feature on the basis of the job I clicked together for the article on “returning visitors”. At work as well as for personal projects I use for ETL processes Pentaho Data Integration (PDI) aka Kettle. In my last post I was describing how to calculate “returning visitors” in a customizable way depending on how you want to define “returning”.