Extracting One Table
Among Several in a Web Page
This tutorial will take you through the process of extracting data from a particular table in a page.
While this window is showing instructions, the user interface of OutWit Hub remains operational.
You can still interact normally with the application and you can move this tutorial window around on the screen to better see the parts of the interface that you want.
MLB Top Teams
If you scroll down this wikipedia page, you will find several HTML tables.
We want to extract the table of World Series Records.
When selecting the 'tables' view in the left side panel, the program displays all the HTML table content in the view's datasheet.
You just need to select a row within the table we are interested in, right-click on it and choose "Select Block".
The program will select the whole block of data corresponding to the desired table.
Then, to delete the rows you do not want, right-click on the selection and choose "Delete Unselected".
In the remaining data, the top line is a title we do not want to keep, neither do we need the footnotes, at the bottom. We can simply delete these with the delete key.
The data is now ready to be exported.
The export preview panel displays the extracted data as it will be exported in the format that you select in the top left menu.
Try saving an export file on your hard disk in the format you prefer.
Now try on your own pages
You can now grab virtually any tables from Web pages. With these functions and the many others you will find in the help center, you can feed excel spreadsheets, databases or websites with readily usable data.
We will publish other tutorials to lead you through the main features of OutWit Hub. Stay tuned.