getpdftitle

getpdftitle extracts the title from pdf files. The extracted title is redirected to stdout.
When an explicit file name is not specified, the title from all pdf documents in the current directory is extracted and output as a list. For pdf files that do not contain title in metadata, the pdf is converted to txt file and the first non-empty line is extracted.

Usage

getpdftitle [-h] [-n] [-s] [filename [filename ...]]

positional arguments: filename Extracts title from file.pdf. Extracts from all pdf files in the
current directory if a filename is not specified

optional arguments:
-h, --help show this help message and exit
-n, --name Include filename in output
-s, --stat Show statistics of files parsed

Requirements

sudo pip install pdfrw
sudo pip install argparse

Author

Cibin Joseph ([email protected]).

License

MIT License See LICENSE for full text.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
getpdftitle.py		getpdftitle.py
sample1.pdf		sample1.pdf
sample2.pdf		sample2.pdf
test_getpdftitle.py		test_getpdftitle.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

getpdftitle.py

getpdftitle.py

sample1.pdf

sample1.pdf

sample2.pdf

sample2.pdf

test_getpdftitle.py

test_getpdftitle.py

Repository files navigation

getpdftitle

Usage

Requirements

Author

License

About

Releases

Packages

Languages

License

cibinjoseph/getpdftitle

Folders and files

Latest commit

History

Repository files navigation

getpdftitle

Usage

Requirements

Author

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages