Skip to content

RoloEdits/scrapetoon

Repository files navigation

Scrapetoon

Guided by Fate

A tool for scraping information from Webtoons


scrapetoon is a commandline tool that offers a way to get a Webtoon's stats as well as the ability to download its episodes. Binaries are provided here.

Remember to always verify the data gathered before using!

Usage

scrapetoon as two subcommands: stats and download:

scrapetoon stats
scrapetoon download

Both require filling in --url | -u, --output | -o, and an --episodes | -e.

  • --url | -u takes an input like: https://www.webtoons.com/en/fantasy/tower-of-god/list?title_no=95.
  • --output | -o is given a path of which to save:
    • stats needs a filename: tower-of-god.csv
    • download needs a directory of which to save all the images of the episodes: Tower of God/
  • --episodes | -e can take either a single integer (e.g., 150) or as a range (e.g., 1..100). Ranges can also be open ended:
    • ..200 (up to and including episode 200)
    • 350.. (starting from episode 350)
    • .. (all available episodes)

One thing to watch out for is the range of episodes you wish to get. The #NUM to the right of the episode might not match up fully to the actual episodes number. To be sure, you can click on the episode you wish to get and look at url in the address bar for this part: episode_no=653. This is the number you should be working around.

Example

scrapetoon stats --url "https://www.webtoons.com/en/action/omniscient-reader/list?title_no=2154" --episodes 1..100 --output orv.csv
scrapetoon download --url "https://www.webtoons.com/en/action/omniscient-reader/list?title_no=2154" --episodes 1..100 --output "panels/"

Stats

The data gathered from here is organized like so:

id creator title genre views subscribers rating episode likes comments replies

The episode, likes, comments, and replies are all relative to one episode, with a new episode on each row.

Example plots using the data:

Download

Downloaded episodes are downloaded as one big image and saved as a png. The title of the image corresponds to the episode number.