Skip to content

Files

Latest commit

 

History

History

arquivamento

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
Jul 21, 2024

Arquivamento

Recursos gerais

Bellingcat caixa de ferramenta

Witness Archive Guide

Awesome Web Archiving

Web Archiving software comparison

Web

Web Archive

Extensões

Use NO MÍNIMO um navegador separado, ou seja, diferente daquele que você navega.

O Chrome costuma ter boas extensões, é possível usar variantes como o Brave ou o Chromium.

CLI

wget

https://www.gnu.org/software/wget/ Explore os parâmetros. Exemplo:

wget -i links.txt

youtube-dl

https://github.com/ytdl-org/youtube-dl

Não se engane com o nome, funciona também outras redes

snscrape

https://github.com/JustAnotherArchivist/snscrape

you-get

https://you-get.org/

twint

https://github.com/kevctae/twint

Ferramentas

Archive Box

https://github.com/ArchiveBox/ArchiveBox

Auto Archiver

https://github.com/bellingcat/auto-archiver

Archivematica

https://www.archivematica.org/en/

Perma.cc

https://perma.cc

Sugarcube

https://sugarcubetools.net/

Google Sheets + Python

!pip install snscrape
!pip install you-get

from google.colab import auth
from google.auth import default
from google.colab import drive

import gspread
import pandas as pd
import snscrape.modules.twitter as sntwitter
import os

auth.authenticate_user()
creds, _ = default()
gc = gspread.authorize(creds)

worksheet = gc.open('NOME_DA_SUA_PLANILHA').sheet1
drive.mount('/content/drive/')

df = pd.DataFrame(worksheet.get_all_values()[1:],columns=worksheet.get_all_values()[0])