A RESULT DATA SCRAPPER AND ANALYZER with CAPTCHA SOLVER using tesseract-OCR, automation in SELENIUM and PANDAS dataframe.
A Scrapper that scraps results from various GTU result sites.
Implemented using Python.
It uses Text Captcha Human Verification to get the result.
Here i cracked the verification process with OCR(Optical Character Recognition) module.
By using selenium and pytesseract ocr module we can scrap any Text Captcha Human Verification.
Used Pandas for managing data in tabular format.
Here i provided source codes for scrapping data and further used for analytics.
https://youtu.be/2nPUuaq4RRI (gturesults.in)
https://youtu.be/jXfwSVq0uA8 (students.gtu.ac.in)