Crawling paginated json files

BenPottier · January 15, 2020, 2:47pm

This is actually possible with the original setup. The key points are:

The settings that worked for me in a local test (mimicking your JSON layout):

crawler.link_extraction_group=1
crawler.link_extraction_regular_expression="next"\s*:\s*"(.*?)"