Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Save DocumentCloud Cloudflare Stats #1860

Open
morisy opened this issue May 6, 2024 · 4 comments
Open

Save DocumentCloud Cloudflare Stats #1860

morisy opened this issue May 6, 2024 · 4 comments
Assignees

Comments

@morisy
Copy link
Member

morisy commented May 6, 2024

Cloudflare keeps a variety of interesting statistics, however it only maintains them for 30 days after which the information is purged. It's also not the most usable of interfaces

Image

We'd like to capture some data to help assist evaluating content to promote as well as impact of the site.

On the backend, I'd like to start storing each previous day's unique visitors.

Additionally, it would be good to start generating an email that includes the 25 most popular paths (configured so they have the full URL, not just the backend of it like in the currently display):

Image

over the previous 24 hours, along with the statistics.

The email could:

Subject Line: DocumentCloud Top Docs - May 6, 2024

  1. 2023-01699-F Responsive Documents 3 (Uploader Name, Organization) -- 435,604 Pageviews https://www.documentcloud.org/documents/24649101-2023-01699-f-responsive-documents-3

  2. Another-Document-Name (Uploader Name, Organization) -- 335,604 Pageviews https://www.documentcloud.org/documents/24649101-2023-01699-f-responsive-documents-3

etc etc. Depending on how much time pulling the document title, uploader name, etc takes, even just the URL and page views would be very helpful.

Others might have other wishlist items that would be good to have, so would be good to get them to weigh in on data that would be good to have, format, etc.

@mitchelljkotler mitchelljkotler self-assigned this May 13, 2024
@eyeseast
Copy link

@duckduckgrayduck
Copy link
Contributor

This seems to be the GraphQL groups that should be queried to access the Web Analytics programmatically:
https://community.cloudflare.com/t/downloading-web-analytics-data/473295
I think I've got a working graphql query to get some of the information we want, but it looks like graphql is paywalled:
https://community.cloudflare.com/t/graphql-not-autorized-for-that-account/469748
I opened a ticket with Cloudflare to see if they would grant us free or reduced access to the resource. Otherwise, we may choose to not move forward with this

@duckduckgrayduck
Copy link
Contributor

I followed up with Cloudflare again on Friday and haven't heard back after Christopher's initial response

@duckduckgrayduck
Copy link
Contributor

My ticket has been switched over from Zendesk to Salesforce as they changed ticketing systems. Still no reply.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants