know.bi blog

Google Drive in Pentaho Data Integration

Jun 11, 2018 10:00:00 AM / by Hans Van Akelyen posted in pentaho data integration, cloud, Google Cloud Platform

0 Comments

One of the new features in Pentaho Data Integration 8.1 is the ability to directly connect to Google Drive. PDI uses the Virtual File System (VFS) which allows you to connect to a variety of file systems in a transparent way.

Read More

What's new in Pentaho 8.1

Jun 5, 2018 10:00:00 AM / by Hans Van Akelyen posted in pentaho, Big Data, cloud, data integration, hitachivantara

0 Comments

On May, 16th 2018, Hitachi Vantara released Pentaho 8.1 Although this is a minor follow-up release to 8.0 as far as version numbers go, but nevertheless a lot of new exciting features and improvements have been added.

Read More

Build a dashboard with AWS Athena and Quicksight in less than an hour

Feb 28, 2018 10:00:00 AM / by Hans Van Akelyen posted in amazon, analytics, cloud, aws

0 Comments

Quickly move from data to insight

Read More

5 Key Components For Your Cloud Analytics Project

Feb 9, 2018 10:00:00 AM / by Bart Maertens posted in aws, amazon, business intelligence, analytics, analytical database, column store, cloud etl, cloud, data engineering

0 Comments

Why move your BI to the cloud? 

As discussed in a previous post, there are many reasons to move your BI to the cloud.
Security, being able to work from anywhere and delivering faster, with more resource flexibility and at a lower cost are just a few.  

Read More

5 reasons to move your bi to the cloud

Jan 9, 2018 9:30:00 AM / by Hans Van Akelyen posted in pentaho, amazon, cloud

0 Comments

Cloud computing is the way to the future, and the way to bring your company to the next level. With the abillity to have enterprise grade services and technologies at a significantly lower price, your company can focus on creating more value while your IT department has to spend less time on maintaining infrastructure. 

These are our top five reasons to move your BI infrastructure to the cloud:

Read More

Getting Started with AWS DMS

Dec 27, 2017 9:30:00 AM / by Willem Dullaart posted in amazon, aws, database migration service, dms, cloud

0 Comments

What is Amazon DMS

Every day, more and more companies are moving towards cloud computing, with Amazon Web Services (AWS) undoubtedly being the biggest player. Having all the possible AWS services available at your fingertips is great, but you still need to migrate your existing infrastructure and data into the (AWS) cloud. At re:Invent 2015, Amazon announced “AWS Database Migration Service”, aiming to make the process of moving data into databases on AWS a lot easier.

AWS DMS supports most open-source and commercial databases such as PostgreSQL, MySQL, MariaDB, Oracle, Microsoft SQL Server, and of course their own Aurora, Redshift, DynamoDB and S3 services. Both homogeneous (e.g. Postgres to Postgres) and heterogeneous migrations (e.g. Oracle to MySQL) are supported. Either the source or target database is required to be in the AWS cloud. DMS regularly gets updated with new features and supported engines.

DMS Overview

At the highest level, you have three components to take care of when starting a migration using DMS:

Read More

3 Reasons to take a look at WebSpoon for web or cloud ETL

Nov 29, 2017 8:30:00 AM / by Bart Maertens posted in pentaho, webspoon, pentaho data integration, cloud, cloud etl, web etl

3 Comments

3 reasons to move your ETL to the web, cloud

ETL development heavily relies on the desktop with files, database and network connections that require the developer to be the resources that are located in the company network.
Apart from these access restrictions, most of the established ETL platforms have a history of over a decade and were originally developed in an era where web based applications were basic at best.
Times have changed, however, and web applications have come a long way. We'll look at a number of reasons to move your ETL to the web and/or cloud. 

1. Data can't leave the organization

There are plenty of cases where data is considered to be too sensitive to leave the organization's premises or (virtual) private cloud. 
With a centralized ETL infrastructure, ETL developers and data engineers can work from anywhere in the world. All of the data is managed over secure connections without the need for a single byte of data to leave the organization's systems. 

2. Data is too big to copy or changes frequently

ETL developers and data engineers often need to work in geographically separate locations, while the data remains in one location. 
Developing ETL or working with frequently changing data over VPN connections and remote deskop protocols is painful, if possible at all.
Life can be a lot easier if the ETL and data management work can be done over a standard HTTP(S) protocol from anywhere in the world. 

3. Simplified installation, configuration and project management

Last but not least, ETL configuration management and overall DevOps for a large number of desktop installations can be a burden. 
Instead of maintining an installation on every ETL developer's or data engineer's machine, a centralized approach can significantly simplify the process. 

With a centralized installation, developers are guaranteed to work on the same standardized software version, configuration and set of plugins. 
Additionally, ETL working practices and conventions are a lot easier to enforce from a centralized environment. 

 

 

Try it out for yourself

If you're using or considering Pentaho (now part of Hitachi Vantara), all of this is within grasp: with the WebSpoon project, your existing ETL can simply be moved to the web and cloud. No changes to your existing code base are required, and you can gradually (or partially) make the switch to web or cloud based ETL. 

We've set up a demo environment for WebSpoon, feel free to give it a try.  

WebSpoon is available as open source and is not (yet) part of the Pentaho Enterprise Edition. Let us know if you'd like to find out how we can help you bridge the gap. 

  Talk to an expert!

Disclaimer: the use cases and images in this post were taken from WebSpoon author Hiromu Hota's presentation

 

 

 

Read More

Globchain recap

Nov 27, 2017 8:30:00 AM / by Lokke Bruyndonckx posted in GlobChain, BlockChain, pentaho, erp, cloud, co-creation, logistics

0 Comments

Read More

Subscribe to Email Updates

Recent Posts