Currently, my main goal is to extract and save a single image from a website post logging in. After examining the image, I discovered that it has a full xpath of
/html/body/form/main/div/section/div[1]/div/div[2]/div/img
. My plan is to utilize beautiful soup or an image crawler to save the image into a variable and then use tesseract to extract text from the image. So far, I've encountered difficulties with urllib, urllib.requests, and selenium's method of reading images by xpath. I initially attempted to use selenium to save the image but yielded no successful outcomes. At this point, I am seeking assistance with the coding aspect to determine if it's feasible to store the image in a variable and whether tesseract can access the image through that variable. Both the image samples and their inspection images are provided below (the highlighted image showcases the inspected text). Please note that the form displayed is only a representation and does not actually exist in reality - at least to my knowledge. Any guidance on this matter would be greatly appreciated. Thank you.
Image 1:
https://i.stack.imgur.com/kpJ55.png
Image 2: