Skip to content

🚧 [IN DEVELOPMENT] - Automatic ETL & Data Products for 1CijferHO (Local). ⚑ Runs with uv

Notifications You must be signed in to change notification settings

asewnandan/1cijferho

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Braille fonts

1CijferHO Tool

πŸš€ Transform DUO data into research insights

Windows macOS Linux GitHub Last Commit Contributors GitHub License

🎬 Demo Video (Coming Soon!)

πŸ“‹ Overview

Note

No Python or technical knowledge required! This tool is designed for everyone, regardless of programming experience.

Transform DUO's complex educational data into research-ready insights in minutes, not hours.

🚨 The Problem

DUO data comes as:

  • Fixed-width ASCII files with no clear field separation
  • Unstructured .txt metadata files
  • Hours of manual processing required
  • High risk of errors and data loss

✨ The Solution

Automated pipeline that delivers:

  • ⚑ Fast - Process GB files in minutes using multiprocessing
  • 🎯 Accurate - Automated validation prevents errors
  • πŸ”’ Secure - Auto-anonymization of sensitive data (BSN, etc.)
  • πŸ“Š Ready - Clean CSV/Parquet output for immediate analysis

✨ Features

  • Smart Extraction - Auto-finds field positions in messy .txt files
  • Intelligent Matching - Links data files with metadata automatically
  • Turbo Conversion - Multiprocessing for blazing speed
  • Quality Validation - Comprehensive error checking
  • File Optimization - 60-80% size reduction via Parquet compression
  • Privacy Protection - Cryptographic anonymization
  • Zero Setup - uv handles all dependencies automatically

πŸ”§ First Time Setup

Warning

Do not skip these steps if this is your first time using this application. It will not work without them.

Tip

Save the repository in a Projects/CEDA folder on your main drive for quick access.

1. Get the Repository

Option A: Clone with Git (or Github Desktop)

git clone https://github.com/cedanl/1cijferho.git
cd 1cijferho

Option B: Download ZIP

Download Repository

After downloading extract the ZIP file and navigate into the folder.

2. Install uv Badge

MacOS & Linux (Terminal)

curl -LsSf https://astral.sh/uv/install.sh | sh

Windows (Powershell or Windows Terminal)

powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

Close and reopen your terminal after installation.

Verify installation

uv self update

See the installation documentation for details and alternative installation methods.


πŸš€ Running the Application

Ready to see the magic happen? Your 1CijferHO Tool is just one command away! ✨

First, get to the right spot:

Open a terminal in your 1cijferho folder - it's super easy!

  • Windows: Shift + Right-click in folder β†’ Open in Windows Terminal
  • Mac: Right-click folder β†’ New Terminal at Folder
  • VS Code: Just click Terminal β†’ New Terminal

Or simply navigate there:

cd path/to/1cijferho

Then, launch with a single command:

uv run streamlit run src/main.py

That's it! The app will automatically spring to life in your browser. If you've completed all the steps in the First Time Setup correctly, this is the only command you'll need going forward. πŸŽ‰

Pro Tip: Create a shortcut: .bat file (Windows) or .sh script (macOS/Linux) Pro Tip: Check out our architecture.md for technical details!

Happy analyzing! βœ¨πŸ“ŠπŸ“


πŸ› οΈ Built With

uv Badge Streamlit Badge Python Badge

🀲 Support

If you find this project helpful, please consider:

  • ⭐ Starring the repo
  • πŸ› Reporting bugs
  • πŸ’‘ Suggesting features
  • πŸ’» Contributing code

If you encounter any issues or need further assistance, please feel free to open an issue or contact [email protected] | [email protected]

πŸ™ Acknowledgements

Special thanks to:

  • Ash Sewnandan & Tomer Iwan for setting the foundation with a clean, user-friendly interface and robust architecture.
  • CEDA & Npuls for making this project possible by providing valuable resources and support.

πŸ«‚ Contributors

Thank you to all the people who have already contributed to 1cijferho.

🚦 License

GitHub License

About

🚧 [IN DEVELOPMENT] - Automatic ETL & Data Products for 1CijferHO (Local). ⚑ Runs with uv

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages