Skip to content

Seiyuu image scraper in python. Uses scrapy web scraping framework.

Notifications You must be signed in to change notification settings

iequivocality/seidownpy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SeidownPy v0.5

An application that will download images from any link or any of the supported websites. Built-on top of Scrapy, a Python based web scrapping framework.

Support

Future Support

  • Trivial face-detection

Requirements

  • scrapy
  • image

Both can installed via pip

Installation Mac OS X

  1. Open Terminal
  2. Run the following command: python
  3. Install virtualenv through pip (pip install virtualenv). This is in order to prevent affecting the built-in Python executable.

Customization

Ameblo Per entry scrapy crawl ameblo -a name=name -a entry=entrynumber

./ameblo.sh name entrynumber

Bulk

  • scrapy crawl ameblo -a first=first page -a last=last_page -a name=blog -o output file

  • ameblo (required) - images will be fetched from http://ameblo.jp

  • name (required) - name of the blog where images will be fetched from

  • first (optional) - first page where images are fetched

  • last (optional) - last page where images are fetched

  • o (optional) - output file where logs are kept

./ameblo.sh name first last

Tumblr

  • scrapy crawl tumblr -a first=first page -a last=last_page -a name=blog -o output file

  • ameblo (required) - images will be fetched from http://ameblo.jp

  • name (required) - name of the blog where images will be fetched from

  • first (optional) - first page where images are fetched

  • last (optional) - last page where images are fetched

  • o (optional) - output file where logs are kept

./tumblr.sh name first last

Lineblog

  • scrapy crawl lineblog -a first=first page -a last=last_page -a name=blog -o output file

  • ameblo (required) - images will be fetched from http://ameblo.jp

  • name (required) - name of the blog where images will be fetched from

  • first (optional) - first page where images are fetched

  • last (optional) - last page where images are fetched

  • o (optional) - output file where logs are kept

./lineblog.sh name first last

Other Links

  • scrapy crawl single -a link=url

Shortcuts Scripts are added under the script folder which are basically shortcuts to the syntax, without the need for typing arguments yourself.

Reference

Version History

To be added.

About

Seiyuu image scraper in python. Uses scrapy web scraping framework.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages