Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make tokenize CLI tool have nicer command line arguments. #6188

Open
wants to merge 4 commits into
base: master
Choose a base branch
from

Commits on Mar 26, 2024

  1. Make tokenizer.cpp CLI tool nicer.

    Before this commit, tokenize was a simple CLI tool like this:
    
      tokenize MODEL_FILENAME PROMPT [--ids]
    
    This simple tool loads the model, takes the prompt, and shows the tokens
    llama.cpp is interpreting.
    
    This changeset makes the tokenize more sophisticated, and more useful
    for debugging and troubleshooting:
    
      tokenize [-m, --model MODEL_FILENAME]
               [--ids]
               [--stdin]
               [--prompt]
               [-f, --file]
               [--no-bos]
               [--log-disable]
    
    It also behaves nicer on Windows now, interpreting and rendering Unicode
    from command line arguments and pipes no matter what code page the user
    has set on their terminal.
    Noeda committed Mar 26, 2024
    Configuration menu
    Copy the full SHA
    cd7b5f7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a837649 View commit details
    Browse the repository at this point in the history

Commits on Mar 28, 2024

  1. Simplify tokenize.cpp; by getting rid of handling positional style ar…

    …guments.
    
    It must now be invoked with long --model, --prompt etc. arguments only.
    Shortens the code.
    Noeda committed Mar 28, 2024
    Configuration menu
    Copy the full SHA
    71a0867 View commit details
    Browse the repository at this point in the history

Commits on May 9, 2024

  1. Configuration menu
    Copy the full SHA
    877f059 View commit details
    Browse the repository at this point in the history