Hello, I am writing a news parser on scrapy, I need it to start parsing from the starting url to open every news to retrieve data, go further to the next page and do the same thing. My parsit is only the first, but does not want to go on
class GuardianSpider(CrawlSpider): name = 'guardian' allowed_domains = ['theguardian.com'] start_urls = ['https://www.theguardian.com/world/europe-news'] rules = ( Rule(LinkExtractor(restrict_xpaths=("//div[@class='u-cf index-page']",), allow=('https://www.theguardian.com/\w+/\d+/\w+/\d+/\w+',)), callback = 'parser_items'), Rule(LinkExtractor(restrict_xpaths=("//div[@class='u-cf index-page']",), allow=('https://www.theguardian.com/\w+/\w+?page=\d+',)), follow = True), )