After spending several hours researching and experimenting, I find myself a bit confused about the topic at hand.
My issue: I am attempting to retrieve the complete HTML content (including dynamically generated JavaScript content) of a specific web page. Here's what I've already attempted:
- I initially tried using Jsoup, but had to switch gears due to its inability to handle JavaScript content.
- I experimented with HtmlUtil, but encountered numerous errors while loading the targeted webpage (such as Css error, runtimeError, EcmaError, etc.)
- I resorted to using the basic Chrome function to save the entire webpage content, then utilized the Jsoup library to extract the specific information I needed. This workaround proved to be the only way to achieve my desired results.
My current question is: How can I replicate the functionality of the "save as" feature in a browser, or more broadly, how can I extract the full HTML content first and then utilize Jsoup to parse the static HTML content effectively?
Thank you in advance for your guidance and assistance!