Skip to content

Collecting popular quotes from Goodreads (inc. relevant attributes) to figure out the logic behind popular highlights.

License

Notifications You must be signed in to change notification settings

theresia/reading-insights

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

reading-insights

Why

Wanted to see if I can figure out the logic behind what makes a sentence gets quoted / highlighted a lot.

How

Collect quotes from Goodreads (inc. relevant attributes).

What

Three crawlers:

  1. goodreads.quotes.popular

Starts from https://www.goodreads.com/quotes?page=1

Collects the first-level Popular quotes, follows paginations.

  1. goodreads.quotes.by_category

Starts from https://www.goodreads.com/quotes?page=1

Traverses the categories, collects Popular quotes for that category, follows pagination.

  1. goodreads.quotes.book

Not implemented yet. The idea is to collect all the quotes for a particular book. TODO: accept the book URL as spider arg.

About

Collecting popular quotes from Goodreads (inc. relevant attributes) to figure out the logic behind popular highlights.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages