Skip to content
View omar-A-hassan's full-sized avatar

Highlights

  • Pro

Block or report omar-A-hassan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
omar-A-hassan/README.md

Omar Hassan (@omar-a-hassan)

Projects where ( Biology ) ∩ ( AI & ML ), a dash of NLP & LLMs

Selected Projects

  1. Arabic NLP Benchmarks & Evaluation

Benchmark-Whisper

Expanded upon Pico-voice ASR benchmark to support benchmarking of OpenAI's Whisper in transcription of Egyptian ARZ speech into ARZ and english text.

Benchmark-Nllb200

Expanded lm-evaluation-harness to support machine translation of ARZ to english and vice versa with a new ARZ corpus dataset from hugging face and Meta's Nllb200.

  1. LLM Hallucination Detection & Mitigation

ACE-Hallucinations

Competition solution for Russian hallucination detection (codeforces) using Agentic Context Engine (ACE)—trained Gemma-3-270M (Generator) with Gemini Flash 2.5 (Reflector/Curator) to learn 83 anti-hallucination strategies from 380 SberQuAD-derived examples that were transformed using Gemini Flash 2.5, achieving 0% hallucination rate with conservative refusal behavior deployed via llama.cpp + GGUF for GPU inference for the competition.

  1. Geospaital AI Esri North Africa

Presto-GeoAI

As part of my GIS intership at Esri, A fine tuned Presto (GeoAI foundational model) on the task of classifying 3 crop types across Africa and also as part of a GeoAI competition on Zindi where it achieved leaderboard position 37, The model's outputs integrate seamlessly into ArcGIs and can be viewed as a feature layer.

  1. Explainable AI and Adversarial Attacks

XAI, Adversarial attacks and model hardening on CNNs tutorial

A 4 step workflow outlining how to fine-tune ResNet-34/MobileNetV2 on Caltech-101, generate FGSM adversaries using torchattacks, interpret with Grad-CAM & saliency, and evaluate defenses (adversarial training, input transforms).

  1. Robotics & Voice Control

SHATO — Voice-Controlled Robotic Assistant

A production-grade microservices system that turns speech into schema-validated robot commands—Whisper STT → Fine tuned Gemma3-270M intent extraction → Pydantic validation (self-correction) → Parler TTS—built with FastAPI and Docker, with zero malformed commands reaching hardware.

You're probably wondering where's the Bio AI projects, here's one ;)

  1. Parkinson's Disease Voice Detection

Parkinsons

Production-grade ML pipeline for detecting Parkinson's Disease from voice recordings using Wav2Vec2 foundation model features—implements dual approach: fine-tuned neural classifier (Wav2Vec2-base-960h → pooling → classification head with dropout) vs Random Forest on frozen embeddings, trained on 81-sample balanced dataset (HC/PD) with stratified split, comprehensive test suite, CI/CD (GitHub Actions), and Makefile workflow—engineered for small medical datasets with configurable YAML-driven training, W&B logging, and device-agnostic execution (MPS/CUDA/CPU).

Pinned Loading

  1. Project-Shato Project-Shato Public

    Voice-controlled robotic assistant with natural language processing, command validation, and speech synthesis. Built with a microservices architecture.

    Python 1

  2. ACE-Hallucinations ACE-Hallucinations Public

    No hallucinations in question answering across various topics in russian

    Python

  3. Presto-for-Crop-classification Presto-for-Crop-classification Public

    An attempt at the Zindi challenge "Amini GeoFM Decoding the Field Challenge"

    Python 1

  4. Adversarial-attacks-and-XAI-on-CNNS Adversarial-attacks-and-XAI-on-CNNS Public

    Fine-tuned resnet34 and mobilenetv2 on the caltech101 dataset. Tested FGSM attacks and used XAI techniques to understand both models behaviours then implemented two defensive measures against the a…

    Jupyter Notebook 1

  5. Benchmark-Whisper Benchmark-Whisper Public

    A comprehensive benchmark of Open AI's Whipser STT model in Egyptian ARZ translation to english

    Jupyter Notebook 1

  6. lm-evaluation-harness lm-evaluation-harness Public

    Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    Python 1