A fictional bookstore that desperately wants to be scraped. It's a safe place for beginners learning web scraping and for developers validating their scraping technologies as well. Available at: books.toscrape.com
| Details | |
|---|---|
| Amount of items | 1000 |
| Pagination | ✔ |
| Items per page | max 20 |
| Requires JavaScript | ✘ |
A website that lists quotes from famous people. It has many endpoints showing the quotes in many different ways, each of them including new scraping challenges for you, as described below.
| Endpoints | |
|---|---|
| Default | Microdata and pagination |
| Scroll | infinite scrolling pagination |
| JavaScript | JavaScript generated content |
| Delayed | Same as JavaScript but with a delay (?delay=10000) |
| Tableful | a table based messed-up layout |
| Login | login with CSRF token (any user/passwd works) |
| ViewState | an AJAX based filter form with ViewStates |
| Random | a single random quote |