5baa61e4c9b93f3f0682250b6cf8331b7ee68fd8

Cracking passwords with deep learning

Backstory

Back in 2012, LinkedIn suffered a hack that leaked the entire database of user emails and passwords. The passwords were not stored in plain-text, they were hashed with SHA-1 and left unsalted. This means that the passwords were put through a one-way cryptographic function that left the password unknown unless guessed exactly. For example, instead of:

[email protected], cat
[email protected], password
[email protected], mrsbutterworth
[email protected], cat

The password dump looked like

[email protected], 9d989e8d27dc9e0ec3389fc855f142c3d40f0c50
[email protected], 5baa61e4c9b93f3f0682250b6cf8331b7ee68fd8
[email protected], 71785ac6c2fb928c0652cdc26e1aba389565242b
[email protected], 9d989e8d27dc9e0ec3389fc855f142c3d40f0c50

There are two common attacks to crack the passwords, either brute-force all combinations or use a dictionary of previously identified passwords. The first method utilizes the raw processing power of your hardware (CPU or GPU), see hashCat for an example of state-of-the-art. The second method, is effective but limited in scope due to the pre-determined list of passwords.

This project proposes a third method.

Experiment

We utilize a simple RNN-LSTM (Recurrent Neural Network with Long Short-Term Memory) built with tensorflow and tflearn. This RNN reads from a set of starter passwords and tries to predict new passwords from the linguistic patterns observed. These passwords are then validated against the LinkedIn dump and the RNN is re-trained.

To improve sampling speed, we sample from the RNN using hundreds of independent parallel streams. This avoids the expensive overhead of copying to the GPU for each character sampled. Full implementation can be found in src/model.py.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
images		images
md2reveal @ 515e223		md2reveal @ 515e223
reveal.js @ 9a89e39		reveal.js @ 9a89e39
src		src
.gitmodules		.gitmodules
HnT_pres.html		HnT_pres.html
HnT_pres.md		HnT_pres.md
Makefile		Makefile
README.md		README.md
generated_passwords.txt		generated_passwords.txt
reveal_options.json		reveal_options.json
starter_passwords.txt		starter_passwords.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

images

images

md2reveal @ 515e223

md2reveal @ 515e223

reveal.js @ 9a89e39

reveal.js @ 9a89e39

src

src

.gitmodules

.gitmodules

HnT_pres.html

HnT_pres.html

HnT_pres.md

HnT_pres.md

Makefile

Makefile

README.md

README.md

generated_passwords.txt

generated_passwords.txt

reveal_options.json

reveal_options.json

starter_passwords.txt

starter_passwords.txt

Repository files navigation

5baa61e4c9b93f3f0682250b6cf8331b7ee68fd8

Backstory

Experiment

Results

Password lists

Presentations

Further reading

About

Releases

Packages

Contributors 2

Languages

thoppe/5baa61e4c9b93f3f0682250b6cf8331b7ee68fd8

Folders and files

Latest commit

History

Repository files navigation

5baa61e4c9b93f3f0682250b6cf8331b7ee68fd8

Backstory

Experiment

Results

Password lists

Presentations

Further reading

About

Topics

Resources

Stars

Watchers

Forks

Languages