![Banner](https://private-user-images.githubusercontent.com/53175384/408289990-be25f12a-0746-415a-b910-8a60af4167c7.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzg5MTE1MzMsIm5iZiI6MTczODkxMTIzMywicGF0aCI6Ii81MzE3NTM4NC80MDgyODk5OTAtYmUyNWYxMmEtMDc0Ni00MTVhLWI5MTAtOGE2MGFmNDE2N2M3LmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMDclMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjA3VDA2NTM1M1omWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWI3MjY4YzgzNjIyZDlhYTk3YzU3NTIyZTgzOTI0OTllM2U0NDg0MjEzZmM2MmU4MDRjNWVmZDIzNGFmYjAxYzcmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.sWYTE_Lb5MkZRsrtYpCkvJgfUuNjCbET_PGkoFCevxE)
I build, write, showcase around zero-shot vision, multimodality, optimization and more (mostly transformers).
π€ My Hugging Face profile has a lot of cool stuff and I also write blogs on everything cutting-edge over there.
π± smol-vision: notebooks, scripts and more on various zero-shot vision/multimodal model optimizations
π€ smolagents: a lightweight library on agentic ML
Below are a couple of write-ups about some of the things I worked on at Hugging Face or illustrations of concepts related to ML β¬οΈ
π SmolVLM - small vision LM π Even smaller SmolVLM - tiniest vision LM ever
π Introducing smolagents π Vision LMs in smolagents
π Vision Language Models Explained π Autoencoders Visualized
π Introduction to Quantization π ConvNets Visualized
π PaliGemma β Google's Cutting-Edge Open Vision Language Model and PaliGemma 2