SEO Macroscope: The Monster
Pre-releaseSEO Macroscope is firming up now, there's still a lot to be done, but it's close to being useful enough for day-to-day tasks.
In this release, I've reworked the way that "hyperlinks in" are managed, moving them out into the document collection manager, instead of handling them within each document. This makes it a little simpler to rebuild and scan the cross-links within the crawled collection. There are still some URL types that do not have inbound links properly managed, I'll be working on those in the next release.
I've been building out the collection of Excel reports too, combining related data into consolidated report types.
One experimental feature that I have implemented is an Levenshtein Edit Distance library to try and detect duplicate, or near-duplicate content in the collection. The intent here is to find pages that have very similar content; as these may be flagged as duplicates by the search engines.