Add support for crawling subdomains#27
Add support for crawling subdomains#27alexspeller wants to merge 1 commit intochriskite:nextfrom alexspeller:4419464056d3de337162
Conversation
|
This feature is very useful. |
|
MaGonglei: It is very simple to gather external links using Anemone, and comparably simple to actually check these links to verify they are valid, etc. The 'on_every_page' block is very helpful in this regard. If you'd like some code that does exactly what you are asking, I could send an example your way. |
|
Hi,wokkaflokka,thanks for your reply. Because if I use the "on_every_page" block to search the external links (e.g. "page.doc.xpath '//a[@href]') ,it seemed cost too much CPU and Memorys. If I'm wrong,give me the example. Thanks. |
Merge changes to support subdomain crawling from runa@91559bd