-
Notifications
You must be signed in to change notification settings - Fork 1
Using DOM
Alex edited this page Nov 21, 2013
·
1 revision
Usually when we want to crawl a site we must work with the real source code of the site.
But maybe, for some reason, there is a site where if we start the addon the site disappear or maybe we just prefer to work with the DOM because we can obtain the html with the JS executed.
To use this feature, just press DOM button
- Could resolve some problems that appear using the source code. (The problems coming from the source code are usually because a not well formed HTML). Problems like not seeing the code or obtaining an illogical suggested regex.
- JS executed. Can be interesting for some cases.
- If we have want to make a crawler, we usually are going to work with the source code and not with the DOM.
The DOM and the source code can have some differences, not only JS executed, but things like the position of an attribute in a tag.
E.g:
DOM:<a href="#" title="The link">
SC:<a title="The link" href="#">
So if we work with DOM, we can obtain a regex that maybe is not going to work with the source code.