In this blog, you will learn how to scrape products, prices and reviews with ParseHub, our free web scraper!

Belk, Inc. was founded in 1888 in North Carolina, first named New York Racket, then Belk Brothers, and finally Belk. The company has over 300 locations spanning 16 different states, over 17,000 employees and has generated over 3.3 billion in revenue in 2022. The Belk department stores and website primarily sell women’s, men’s and kid’s clothing, shoes, handbags, jewelry, beauty, bed and bath products, and much more!

Let’s begin scraping!

Step 1: Scraping Products

  1. Firstly, run the ParseHub application and log in.
  2. Click the New Project button.
  3. Enter the URL you would like to scrape, we will be scraping men’s shoes:
  4. When the page loads, click the first product’s name, the rest should turn yellow.
  5. Click the next product’s name to train the algorithm; you may need to repeat this on missing products.
  6. Rename this selection on the left to “product”.

Step 2: Scraping Relative Data

We will use ParseHub’s Relative Select tool to scrape additional details from each product:

  1. Begin by clicking the PLUS(+) button next to your product selection.
  2. Choose “Relative Select”.
  3. Click the first product’s name, then its price.
  4. Do this for the next product as well to train the algorithm; you may need to do this for other products, where the price is not extracted.
  5. Rename this selection on the left to “price”.
  6. You can do these steps again for other data such as the reviews.

Step 3: Pagination

  1. Scroll all the way down until you see the page navigation button.
  2. Click the PLUS(+) button next to your page selection, not to be confused with product.
  3. Choose “Select” and click the next button chevron.
  4. Rename the selection to “pagination” and expand it.
  5. Then, delete the two extractions in there.
  6. Click the PLUS(+) button next to your pagination extraction and choose “Click”.
  7. On the popup, choose “Yes” and then enter the additional amount of pages to scrape!

Step 4: Bypassing Blocks

During our test scrape, our results came back empty, which means that we ran into a scrape block. To bypass this, we need to use ParseHub’s IP Rotation; note this is a paid feature.

  1. Begin by clicking the Gear icon, at the top left of ParseHub.
  2. Click “Settings” and then tick the Rotate IP Addresses checkbox.
  3. Click “Ok” and then you are ready to scrape!

Step 5: Begin Scraping

Great job! To begin scraping all the pages you specified in our pagination step, click the green Get Data button on the left pane of ParseHub. You can Test, Run or Schedule your scrape. Testing is great for finding faults in your project, Run will run your scrape on ParseHub’s servers, and Scheduling will help with getting up-to-date data!

Here is what our sample export looked like:

Running into scraping issues? Contact our live chat support!

Happy Scraping! 💻