Having trouble selecting a button using xpath in Scrapy Python?

Question

Having trouble selecting a button using xpath in Scrapy Python?

I am currently attempting to scrape quotes from a website called using my spider code that looks something like this:

class quote(scrapy.Spider): name = 'quotes' # defining Name start_urls = [''] # Targeted urls

def parse(self, response):
    total_count = len(response.xpath('//dl/dt').getall())  # counter for loop        
    for i in range(1, total_count + 1):  # loop for retriving data continuosly

        xp_quote = f'//dl/dt[{i}]/a/text()'
        xp_writer = f'//dl/dd[{i}]/b/a/text()'
        page_quote_writer = response.xpath(xp_writer).get()
        page_quote = response.xpath(xp_quote).get()

        yield {  # dictionay return
            'Writer': (page_quote_writer if page_quote_writer != None else 'Unable to fetch'),
            'Quote': (page_quote if page_quote != None else 'Unable to fetch')
        }

    next_page = response.css('#content tbody td a::attr(href)').getall()
    print(next_page)

I have run into an issue with the "next_page" section where I cannot retrieve any data. I have double-checked with xpath as well, but the problem persists. I attempted to use Chrome's Inspect Element feature to copy the xpath and later the css selector, but it did not solve the issue. Interestingly, the same xpath and css selector work perfectly fine in Chrome's Inspect Elements Find section.

Any help would be greatly appreciated.

css python-3.x xpath scrapy

Answer 1

Answer №1

Give this xpath a shot:

//td[@id='content']//tbody//td//a[b]/@href

Answer 2

Give this xpath a shot:

//td[@id='content']//tbody//td//a[b]/@href

Having trouble selecting a button using xpath in Scrapy Python?

Answer №1

Similar questions

Responsive design in Android does not function as intended

What is the best method for positioning span content above all other elements without having it wrap around them?

The webpage failed to display the element despite its presence

Overlapping dynamic content causing CSS nested scrollbars to clash at full width of 100%

What's the most effective method for implementing a stylesheet through Javascript in a style switcher?

Guide on stacking a transformed 3D div on top of a masked layer with higher Z-index without success

Which specific transitionend (or animationend) event should I use for this application?

Specify the width of one element to always be the same as the width of another

Unable to handle a POST request initiated by an HTML Form

Error encountered in Python 2.7 with Selenium 2: 'NoneType' object does not possess the attribute 'group'

Is there a way to seamlessly incorporate a PHP tag into a Bootstrap dropdown for perfect alignment?

Overriding a CSS property with !important proves ineffective

Is a fixed div located inside a fluid grid container?

scraping disaster strikes - twisted critical error in scrapy

minimize javascript syntax (stackoverflow 2022)

svg viewbox cannot be adjusted in size

Incorporate a link to an image following a click event toggle using Jquery

trouble with the layout of the table

Obtain an additional Element within the JSON Path

For a while now, I've been attempting to transform my paragraph into a block within a section that showcases elements in an inline-flex layout