Follow up on the latest improvements and updates.

RSS

August 22, 2023

new

Dummy websites to practice web scraping

We have launched training.mrscraper.com, a collection of dummy websites to practice web scraping.
We will be writing a set of guides using these websites, but feel free to create your own standard and AI scrapers to practice against these sites.

August 19, 2023

improved

Webhook events for specific scrapers

You can now create webhooks for specific scrapers
. 🎉
To send events to specific scrapers, simply go to create a new webhook and toggle the "Apply to specific scrapers
" option.
If not toggled, the selected events will be sent for all your scrapers.

April 19, 2023

new

Disable page resources

✨ Added a new option inside the advanced tab to prevent resources to load.
💡 For example, disabling images and styles when screenshot is not needed can speed-up the scraper a lot.

April 19, 2023

new

improved

[Parsers] Keep regex match

Added a new type of parser to keep one or all matches of a regex expression.

April 2, 2023

new

Scheduler and account timezone

✨ It's now possible to define a timezone for a scheduled scraper!
(additionally, it's now possible to select a default timezone in your profile settings to localize dates and timestamps).

February 26, 2023

improved

Improved scraping results page

The new results page has the following sections:

Extracted data
: View, copy and download the data from the defined extractors.

Scraped source code
: View, copy and download the HTML of the scraped website. Useful to debug unsuccessful scrapings along with the HTML scraper free tool.

Screenshot (screen or full-page): If the scraper fails or the screenshot option is enabled in the scraper, a partial or full-page screenshot will be displayed here.

February 26, 2023

new

Scraping screenshots

It's now possible to save a screenshot of the scraped page.
Screenshots are available for all paid plans, and full-page screenshots for the Ultimate and Business plans.
If a scraping fails, a screenshoot is going to be attached, even if the screenshot option is disabled. Screenshots for failed scrapings are also triggered in the free tier.

February 26, 2023

fixed

Fixed an issue with encoding extracted data

The results page now prints the extracted data with unescaped UNICODE and other chars such as Japanese or Chinese.

February 24, 2023

new

Share a scraper

It's now possible to share a scraper configuration with other users (or the support team if you need help debugging your extractors).
You will find the share action at the edit/view scraper pages, by opening the dropdown menu.
By clicking the share
 button, you will open the configuration menu where you can enable/disable the sharing status and copy the shareable link.

February 21, 2023

fixed

Scrape URLs containing non ASCII chars

Fixed an error where the URL validation logic was not accepting scrapings to URLs containing non ASCII chars (Japanese, Chinese, German, etc).

→