Skip to content

wz is a faster alternative to GNU wc with UTF8 support and human readable output

License

Notifications You must be signed in to change notification settings

Altair-Bueno/wz

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

95 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wortzahl! Count words fast

wz is a faster alternative to GNU wc with UTF8 support and human readable output, written in Rust

wz

Installation

Cargo

cargo install wz
cargo install --git https://github.com/Altair-Bueno/wz.git

Features

It's fast!

wz is heavily optimized for performance. Even if you are counting multiple files, thanks to rayon. Just look at the benchmarks

Human readable and machine readable formats

Output the results on a nice table, or pipe them to another program that reads JSON. Stop messing with sed and awk!

$ wz *(.) --output json | jq .total.lines
1470

Multiple encoding support

Characters and line lengths are count using UTF8 or UTF161 encoding, meaning that files with non ASCII characters are count correctly

Multiple line breaks support

Got a file from an old Macintosh? Change the line break to carriage returns ('\r')

wz macintosh.txt -n cr

Performance

See BENCH.md

Usage

Run wz --help to see the full list of options

Footnotes

  1. UTF16 support coming later