repotokens

repotokens is a Python package for analyzing entire source repositories to count tokens and estimate costs for various LLMs. It can be used both as a command-line tool and as an importable Python library.

Features

Analyzes entire repositories to count tokens on common source file types.
Opinionated about excluding those filetypes / utilities that are unlikely to be useful for LLM reasoning on top of the repository.
Estimates costs of for different LLMs based on token count
Respects .gitignore files and common ignore patterns
Can be used as both a Python library and a command-line tool

Installation

You can install repotokens using pip:

pip install repotokens

Usage as a Python Library

You can use repotokens in your Python scripts by importing it as follows:

from repotokens import analyze_directory, calculate_costs, MODELS

Analyzing a Directory

To analyze a directory and get token counts:

results = analyze_directory("/path/to/your/repo")
print(f"Total tokens: {results['total_tokens']}")
print(f"Processed files: {results['processed_files']}")

# Print token counts for each file
for file, tokens in results['file_tokens'].items():
    print(f"{file}: {tokens} tokens")

Calculating Costs for a Specific Model

To calculate costs for a specific model:

model = "gpt-4o"
total_tokens = results['total_tokens']
costs = calculate_costs(total_tokens, model)

print(f"Costs for {model}:")
print(f"Input cost: ${costs['input_cost']:.2f}")
print(f"Output cost: ${costs['output_cost']:.2f}")

Getting Information for All Models

You can access information about all available models:

for model, prices in MODELS.items():
    print(f"{model}:")
    print(f"  Input price: ${prices['input_price']} per 1M tokens")
    print(f"  Output price: ${prices['output_price']} per 1M tokens")

Command-Line Usage

repotokens can also be used as a command-line tool:

Analyze Current Directory

repotokens

Analyze a Specific Directory

repotokens /path/to/your/repo

Analyze a Directory and Show Costs for a Specific Model

repotokens /path/to/your/repo --model gpt-4o-mini

Show Version

repotokens --version

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.idea		.idea
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
__init__.py		__init__.py
pyproject.toml		pyproject.toml
repotokens.py		repotokens.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

repotokens

Features

Installation

Usage as a Python Library

Analyzing a Directory

Calculating Costs for a Specific Model

Getting Information for All Models

Command-Line Usage

Analyze Current Directory

Analyze a Specific Directory

Analyze a Directory and Show Costs for a Specific Model

Show Version

About

Releases

Packages

Languages

License

gr-b/repotokens

Folders and files

Latest commit

History

Repository files navigation

repotokens

Features

Installation

Usage as a Python Library

Analyzing a Directory

Calculating Costs for a Specific Model

Getting Information for All Models

Command-Line Usage

Analyze Current Directory

Analyze a Specific Directory

Analyze a Directory and Show Costs for a Specific Model

Show Version

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages