Skip to content

SEO Macroscope: The Monster

Pre-release
Pre-release
Compare
Choose a tag to compare
@nazuke nazuke released this 14 Mar 14:49
· 513 commits to master since this release

SEO Macroscope is firming up now, there's still a lot to be done, but it's close to being useful enough for day-to-day tasks.

In this release, I've reworked the way that "hyperlinks in" are managed, moving them out into the document collection manager, instead of handling them within each document. This makes it a little simpler to rebuild and scan the cross-links within the crawled collection. There are still some URL types that do not have inbound links properly managed, I'll be working on those in the next release.

I've been building out the collection of Excel reports too, combining related data into consolidated report types.

One experimental feature that I have implemented is an Levenshtein Edit Distance library to try and detect duplicate, or near-duplicate content in the collection. The intent here is to find pages that have very similar content; as these may be flagged as duplicates by the search engines.