There is a htmlunit parser that parses the site where the content is loaded by Ajax, example code:
java.util.logging.Logger.getLogger("com.gargoylesoftware.htmlunit").setLevel(Level.OFF); WebClient webClient = new WebClient(BrowserVersion.FIREFOX_3); webClient.setThrowExceptionOnScriptError(false); webClient.setAjaxController(new NicelyResynchronizingAjaxController()); HtmlPage page = (HtmlPage) webClient.getPage("https://site.name"); div with the overview loaded via the AJAX; when you call page.asXml() , the necessary div is inside the page ; when you call page.getElementById("overview") a NullPointerException error occurs. How to make the required div be accessible via getElementById() ?