You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**This project is still very much under development, expect bugs and unimplemented features.**
5
+
**This project is under continuous development, expect bugs and unimplemented features.**
6
6
7
7
## SYNOPSIS
8
8
9
-
**SEO Macroscope** for Microsoft Windows is a website broken link checking and scanning tool, with some technical SEO functionality.
9
+
**SEO Macroscope** for Microsoft Windows is a free and open source website broken link checking and scanning tool, with some technical SEO functionality.
10
10
11
11
More information about how SEO Macroscope may be used for everyday tasks may be found in the blog at [https://nazuke.github.io/SEOMacroscope/blog/](https://nazuke.github.io/SEOMacroscope/blog/).
12
12
@@ -39,15 +39,17 @@ However, it is functional enough to be used for many link checking tasks right n
39
39
* Reporting metadata about each found URL.
40
40
* Generating an HrefLang Matrix, for multilingual websites.
41
41
* Isolating scans to URL subsets.
42
-
* Extraction and report of telephone number and email links.
42
+
* Extraction and reporting of telephone number and email links.
43
43
44
44
### SEO functionality includes:
45
45
46
46
* Set and report acceptable word, character, and pixel lengths for titles.
47
47
* Set and report acceptable word, character lengths for descriptions.
48
48
* Report on titles, descriptions, keywords, and headings.
49
49
* Optionally generate keyword occurrences across pages and the entire site.
50
-
* Scan and extract metadata for PDFs.
50
+
* Scan and extract metadata from PDFs.
51
+
* Custom filters to identify content that is missing/present.
52
+
* Data extractors to scrape arbitrary content from pages.
51
53
52
54
### Reports may be exported to Excel format. Currently, these include:
53
55
@@ -57,6 +59,8 @@ However, it is functional enough to be used for many link checking tasks right n
57
59
* Broken links.
58
60
* URI Analysis of the pages crawled.
59
61
* Reporting of duplicate content, including duplicate titles, checksums, ETags, and a Fastenshtein method.
Copy file name to clipboardexpand all lines: docs/_posts/2017-05-04-seo-macroscope-release-v1.1.0.0.md
+2
Original file line number
Diff line number
Diff line change
@@ -8,8 +8,10 @@ excerpt: "This release of SEO Macroscope includes support for automatic language
8
8
---
9
9
10
10
This release of SEO Macroscope includes support for automatic language detection of title, description, and content in your web pages and PDFs, where possible.
11
+
{: .lead }
11
12
12
13
Chiefly, this is another tool to help identify problems with multilingual websites. For example, to detect content that is in the wrong language for a particular site.
14
+
{: .lead }
13
15
14
16
Source code and an installer can be found on GitHub at:
title: "New v1.5 release of SEO Macroscope: Hard Vacuum"
4
+
date: "2017-07-11 21:00:00 -09:00"
5
+
published: true
6
+
description: "This release of SEO Macroscope includes support for custom filters and data extractors (web scraping)."
7
+
excerpt: "This release of SEO Macroscope includes support for custom filters and data extractors (web scraping)."
8
+
---
9
+
10
+
This release of SEO Macroscope includes support for custom filters and data extractors (web scraping). Content may be extracted from web pages, and some text-based documents, using CSS selectors, regular expressions, and XPath queries.
11
+
{: .lead }
12
+
13
+
The custom filters, for example, may be used to verify that all of your HTML pages have a particular tracking code installed, such as a Google Analytics tracking code.
14
+
15
+
The data extractors (web scraping) may be used to extract arbitrary content from your HTML pages, and some other document types. For example, you may want to extract all HREF attributes that match a certain pattern, or build a list of specific element contents across your site.
16
+
17
+
Chiefly, the data extractors are for identifying data in the site being crawled that may be specific to your interests, and is not already being extracted by the application.
18
+
19
+
Source code and an installer can be found on GitHub at:
Please check the [downloads page]({{ "/downloads/" | relative_url }}) for more recent versions.
24
+
25
+
This version is 64 bit only. If all goes well, I shall be continuing to release in 64bit. This should eliminate many of the out-of-memory issues that occurred with the 32 bit versions previously.
26
+
27
+
I've also fixed many minor bugs and other issues.
28
+
29
+
## New features in this release include:
30
+
31
+
* Custom filters.
32
+
33
+
* Data extractors (web scrapers) using CSS selectors, regular expressions, and XPath queries.
34
+
35
+
* Google XML and text format sitemap generators.
36
+
37
+
* More Excel and CSV format report generators.
38
+
39
+
* Include/Exclude patterns now use regular expressions.
40
+
41
+
Please report issues at [https://github.com/nazuke/SEOMacroscope/issues](https://github.com/nazuke/SEOMacroscope/issues).
* Windows 32 bit installer: [SEOMacroscope-Installer-1.1.0.0.exe](https://github.com/nazuke/SEOMacroscope/releases/download/v1.1.0.0/SEOMacroscope-Installer-1.1.0.0.exe){: .link-installer }
* Windows 64 bit installer: [SEOMacroscope-Installer-1.5.0.0.exe](https://github.com/nazuke/SEOMacroscope/releases/download/v1.5.0.0/SEOMacroscope-Installer-1.5.0.0.exe){: .link-installer }
* Windows 32 bit installer: [SEOMacroscope-Installer-1.1.0.0.exe](https://github.com/nazuke/SEOMacroscope/releases/download/v1.1.0.0/SEOMacroscope-Installer-1.1.0.0.exe){: .link-installer }
* Windows 32 bit installer: [SEOMacroscope-Installer-1.0.0.0.exe](https://github.com/nazuke/SEOMacroscope/releases/download/v1.0.0.0/SEOMacroscope-Installer-1.0.0.0.exe){: .link-installer }
0 commit comments