As the input is ready for extraction, we will begin with HTML parsing using the DOM method.
If you don't know what DOM is, you can have a quick start with the DOM tutorial at:
Let's move on to the details of how it works in Jsoup.
This section will parse the content of the page at, http://jsoup.org.
index.html file in the project is provided if you want to have a file as input, instead of connecting to the URL.
The following screenshot shows the page that is going to be parsed:
By viewing the source code for this HTML page, we know the site structure. ...