Looking for values over intranet or the Internet

This example is similar to the previous one, with the difference being that you have to lookup the museum opening hours on a website instead of a web server. In this case, you will use the HTTP Client step. This step is useful to retrieve information from websites that do not normally provide data through web services, like in the previous recipe. This method is also known as web scraping.

Getting ready

You must have a database with the museum structure shown in Appendix A, Data Structures, and a web page that provides the museum opening hours. The recipe uses an ASP page named hours.asp, but you can use the language of your preference. This recipe will require a server that supports ASP (or the ...

Get Pentaho Data Integration Cookbook Second Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.