ScrapeStorm’s “Smart Mode” function automatically extracts list data and identifies the page. We can use ScrapeStorm to extract the information we need on employment websites, such as jobs, salary, company address and so on. Through the analysis of these data, we can get the current employment trends and hot industries in the UK.
ScrapeStorm for Windows, MacOS and Linux Download:
Step 1.Creating a task.
This step can refer to the previous lesson tutorial:
Product list link:
Delete unwanted data and increase the required data.
Step 2. Scraping into the the product listing page.
Select the title link column and click “Scrape Into”.
Click “Pre-login”, sign in on the popup page, then close the page. Wait a while, the extraction page will be refreshed.
On detail page click “Add Field” button and then select the element in web page to extract its related text.
Select “Modify Data” from the drop down box, and click”Extract Number”.
Step 3. Starting to extract.
Click “Start”, check “Block Ads” in the pop-up box to prevent the extraction of ads and change the request time to 5s. Then you can find that ScrapeStorm has extracted data.
Click “Export” to download your data.
After the extraction is completed, you can export the data to a local file (including excel, html, csv, etc.) and a database.
P.S. The data of the list page and the detail page will be merged during the extraction.
The following image is a screenshot of the file exported to excel2007:
If you are still confused about the process, please watch the tutorial video as below：