Changelog
Follow up on the latest improvements and updates.
RSS
We have launched training.mrscraper.com, a collection of dummy websites to practice web scraping.
We will be writing a set of guides using these websites, but feel free to create your own standard and AI scrapers to practice against these sites.
![Xnapper-2023-08-22-07](https://canny.io/images/65b3bb268f2e0e54a810876b3a169a59.png)
You can now create
webhooks for specific scrapers
. 🎉To send events to specific scrapers, simply go to create a new webhook and toggle the "
Apply to specific scrapers
" option.If not toggled, the selected events will be sent for all your scrapers.
![specificscraper](https://canny.io/images/c45f9c3d63146d63093db1b25c6f5d61.png)
✨ Added a new option inside the advanced tab to prevent resources to load.
💡 For example, disabling images and styles when screenshot is not needed can speed-up the scraper a lot.
![advances_disable](https://canny.io/images/a3702bffbe68f929769d21068b53c0bd.png)
new
improved
[Parsers] Keep regex match
Added a new type of parser to keep one or all matches of a regex expression.
![parser config](https://canny.io/images/7e1a4b661577d18ba62a4ebce5ecd6a5.png)
![regex-parser](https://canny.io/images/419dd1134c33204d7394266b3363c0da.png)
✨ It's now possible to define a timezone for a scheduled scraper!
(additionally, it's now possible to select a default timezone in your profile settings to localize dates and timestamps).
![Xnapper-2023-04-02-17](https://canny.io/images/c84b1f74e7c318cf72bd1223ac1706a2.png)
improved
Improved scraping results page
The new results page has the following sections:
- Extracted data: View, copy and download the data from the defined extractors.
![Xnapper-2023-02-26-20](https://canny.io/images/c59a124ecfd429a3d191a1db5b6dfca5.png)
- Scraped source code: View, copy and download the HTML of the scraped website. Useful to debug unsuccessful scrapings along with the HTML scraper free tool.
![Xnapper-2023-02-26-20](https://canny.io/images/2b3c728e9f3fdf8278cb3141858b645a.png)
- Screenshot (screen or full-page): If the scraper fails or the screenshot option is enabled in the scraper, a partial or full-page screenshot will be displayed here.
![Xnapper-2023-02-26-20](https://canny.io/images/3adadf6b22c84b4596bfb6a911b02bcd.png)
It's now possible to save a screenshot of the scraped page.
Screenshots are available for all paid plans, and full-page screenshots for the Ultimate and Business plans.
If a scraping fails, a screenshoot is going to be attached, even if the screenshot option is disabled. Screenshots for failed scrapings are also triggered in the free tier.
![Xnapper-2023-02-26-21](https://canny.io/images/5773712589fe03f0bb56c7d186354436.png)
The results page now prints the extracted data with unescaped UNICODE and other chars such as Japanese or Chinese.
![Xnapper-2023-02-26-21](https://canny.io/images/9ea546c8eb9df2d31e6247083f85032a.png)
It's now possible to share a scraper configuration with other users (or the support team if you need help debugging your extractors).
You will find the share action at the edit/view scraper pages, by opening the dropdown menu.
![Xnapper-2023-02-24-20](https://canny.io/images/119b32c768536966603f030eab99452e.png)
By clicking the
share
button, you will open the configuration menu where you can enable/disable the sharing status and copy the shareable link.![Xnapper-2023-02-24-20](https://canny.io/images/80fee5bc977cf31276b24d958b8355c3.png)
Fixed an error where the URL validation logic was not accepting scrapings to URLs containing non ASCII chars (Japanese, Chinese, German, etc).
Load More
→