You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.rst
+44-33Lines changed: 44 additions & 33 deletions
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,11 @@
1
-
BASC Thread Archiver
2
-
====================
1
+
BASC Archiver
2
+
=============
3
3
4
-
This script uses the `4chan API <https://github.com/4chan/4chan-API>`_ to:
4
+
The **BASC Archiver** is a Python library used to archive imageboard threads. It uses the `4chan API <https://github.com/4chan/4chan-API>`_ with the py4chan wrapper. Developers are free to use the BASC-Archiver library for some interesting third-party applications, as it is licensed under the LGPLv3.
5
+
6
+
It comes with a CLI interface for archiving threads, the **thread-archiver**. (A GUI interface, the BASC-Archiver, is under development.)
7
+
8
+
The **thread-archiver** is designed to archive all content from a 4chan thread:
5
9
6
10
* Download all images and/or thumbnails in given threads.
7
11
* Download a JSON dump of thread comments using the 4chan API.
@@ -11,11 +15,28 @@ This script uses the `4chan API <https://github.com/4chan/4chan-API>`_ to:
11
15
* Keep downloading until 404 (with a user-set delay)
12
16
* Can be restarted at any time
13
17
14
-
This script is designed to replace the typical "Right-click Save As, Web Page Complete" action, since that does not save full-sized images or JSON.
18
+
The **thread-archiver** replaces the typical "Right-click Save As, Web Page Complete" action, which does not save full-sized images or JSON. It works as a guerilla, static HTML alternative to Fuuka.
15
19
16
-
This can also be used as a guerilla, static HTML alternative to Fuuka.
20
+
Usage
21
+
=====
17
22
18
-
Part of the JSON-based-chanarchiver by Antonizoon Overtwater, built 2013/04/04.
--path=<string> Path to folder where archives will be saved [default: ./archive]
31
+
--runonce Downloads the thread as it is presently, then exits
32
+
--delay=<int> Delay between thread checks [default: 20]
33
+
--nothumbs Don't download thumbnails
34
+
--thumbsonly Download thumbnails, no images
35
+
--ssl Download using HTTPS
36
+
--silent Suppresses mundane printouts, prints what's important
37
+
--verbose Printout more information than normal
38
+
-h --help Show help
39
+
-v --version Show version
19
40
20
41
Example
21
42
=======
@@ -27,11 +48,11 @@ Example
27
48
Installation
28
49
============
29
50
51
+
The BASC-Archiver works on both Python 2.x and 3.x, and can be installed on Windows, Linux, or Mac OS X.
52
+
30
53
Windows
31
54
-------
32
55
33
-
> **Note:** This script is now fixed for Windows and Python 3.x. PyQt GUI coming soon.
34
-
35
56
1. Install `ActivePython <http://www.activestate.com/activepython/downloads>`_, Either version 2.x and 3.x will work. Make sure to enable the PyPM option, or else `pip will not be installed! <http://stackoverflow.com/questions/4750806/how-to-install-pip-on-windows/4750846#4750846>`_
36
57
2. After installation, go to the Start Menu and under the **ActiveState ActivePython** programs folder, click **Python Package Manager (PyPM)**.
37
58
3. A command prompt will appear. Type in the command below and press enter:
6. Sometimes, 4chan will make changes to it's API. We aim to update the script as soon as any change occurs, so if something's not working right, first use the command below to update. If you're still having problems, raise an issue on our `Github. <https://github.com/bibanon/BA-4chan-thread-archiver>`_
76
+
6. Sometimes, 4chan will make changes to it's API. We aim to update the script as soon as any change occurs, so if something's not working right, use the command below to upgrade to the latest version. If you're still having problems, raise an issue on our `Github. <https://github.com/bibanon/BA-4chan-thread-archiver>`_
4. Sometimes, 4chan will make changes to it's API. We aim to update the script as soon as any change occurs, so if something's not working right, first use the command below to update. If you're still having problems, raise an issue on our `Github. <https://github.com/bibanon/BA-4chan-thread-archiver>`_
99
+
4. Sometimes, 4chan will make changes to it's API. We aim to update the script as soon as any change occurs, so if something's not working right, use the command below to upgrade to the latest version. If you're still having problems, raise an issue on our `Github. <https://github.com/bibanon/BA-4chan-thread-archiver>`_
79
100
80
101
::
81
102
@@ -84,41 +105,31 @@ Linux/Mac
84
105
Where to Post Archived threads
85
106
===============================
86
107
87
-
After archiving your threads, you can just upload the entire `4chan` to any static HTML host (no PHP needed). We strongly recommend that you share them with the world on some kind of Static HTML host, such as the following:
108
+
Your archived threads can be viewed from any web browser. Just enter the thread's folder, and open the HTML file.
109
+
110
+
Alternatively, you can upload the entire **archive** folder to any static HTML host, no PHP required. We strongly recommend that you share them with the world on some kind of Static HTML host, such as the following:
88
111
89
112
* Github Pages
90
113
* Gitorious Pages
91
114
* Google Pages
115
+
* 000webhost
92
116
* And more!
93
117
94
118
Please make sure that your content follows the chosen host's Terms of Service policies (keep your pr0n to yourself, etc.).
95
119
96
-
In the future we will make an extension that records links to currently downloaded threads in an index.html file...
97
-
98
-
Modifications to original
99
-
==========================
100
-
101
-
Originally forked from Socketub's `4chan-thread-archiver. <https://github.com/socketubs/4chan-thread-archiver>`_
102
-
103
-
However, all the original has long since been replaced, and the scripts are totally different. Here is a list of additions:
104
-
105
-
* Based on `py4chan <https://github.com/e000/py-4chan>`_
106
-
* Downloads HTML dump of thread
107
-
* New --thumbsonly option to download thumbnails and no images
108
-
* Code modularization
109
-
* More comments in code
110
-
* Support for new 4cdn.org server
120
+
Wishlist
121
+
========
111
122
112
-
More info and a full journal can be found in ``documents/log.md``.
123
+
* **Migrate to BA-py4chan**, the improved fork of Edgeworth's original py-4chan wrapper.
124
+
* **.chan.arc** - Standard archival format definition for imageboards.
125
+
* Create a **pyFuuka**, for archiving from Fuuka's API.
126
+
* **index.html list** - In the future we will make an extension that records links to currently downloaded threads in an index.html file...
113
127
114
128
License
115
129
=======
116
130
117
-
The 4chan Archiver Class is based on Bibliotheca Anonoma `BA-4chan-thread-archive<https://github.com/bibanon/BA-4chan-thread-archiver>`_ tool, and was originally forked from Socketub's `4chan-thread-archiver.<https://github.com/socketubs/4chan-thread-archiver>`_ The original license of Socketub's archiver is the GNU Affero General Public License v3 or later.
131
+
The 4chan Archiver Class is jointly written and maintained by by `antonizoon.`_ <https://github.com/antonizoon> and `Daniel Oaks.`_ <https://github.com/DanielOaks>.
118
132
119
-
Wishlist
120
-
=========
133
+
It is based on, and supersedes the Bibliotheca Anonoma's `BA-4chan-thread-archiver <https://github.com/bibanon/BA-4chan-thread-archiver>`_ tool, written by `antonizoon.`_ <https://github.com/antonizoon>.
121
134
122
-
* Prompt user for metadata information.
123
-
* Define the ``.chan.zip`` format for 4chan thread archive transfer
124
-
* Create a PyQt GUI
135
+
The BA-4chan-thread-archiver was originally forked from Socketub's `4chan-thread-archiver. <https://github.com/socketubs/4chan-thread-archiver>`_, originally licensed under the GNU Affero General Public License v3 or later.
0 commit comments