Boundary Banter is a project focused on automating the generation of cricket news from live text commentary using advanced Natural Language Processing (NLP) techniques. This project aims to transform live sports commentary into concise news summaries, enhancing user experience and enabling real-time information dissemination.
Sporting events, especially cricket, attract a global audience. Many enthusiasts prefer reading news articles over watching live streams due to time constraints. This project addresses the need for rapid and automated news production post-match by converting live commentary into news articles using NLP.
- Enhanced User Experience: Provides timely and concise summaries of cricket matches.
- Automation in Journalism: Reduces the manual effort required for drafting reports.
- Personalized Content Delivery: Allows customization of news based on user preferences.
- Real-time Information Dissemination: Facilitates rapid sharing of match updates and analyses.
- Research Advancements: Contributes to advancements in NLP and machine learning research.
- Commercial Applications: Techniques can be applied across various domains like finance, healthcare, and e-commerce.
- Ethical Considerations: Ensures fairness, accuracy, and transparency in automated news generation.
Data for this project is collected from the ESPNCricinfo website. The dataset, named CricSum, is available for further research and development. The dataset contains live text commentaries of cricket matches, which are used for training and evaluating the model.
The project utilizes the pre-trained BART model for abstractive text summarization. The model is trained in a template2template manner to generate coherent and fluent summaries. The performance of the model is evaluated using ROUGE scores:
- ROUGE 1: High
- ROUGE L: High
- ROUGE 2: Requires improvement
To install and run this project, follow these steps:
- Clone the repository:
git clone https://github.com/younglord088/BoundaryBanter.git
- Navigate to the project directory:
cd BoundaryBanter
- Install the required dependencies:
pip install -r requirements.txt
- To process live commentary and generate summaries, run:
python generate_summary.py --input live_commentary.txt --output summary.txt
- For detailed usage and options, refer to the documentation.
We welcome contributions from the community. To contribute, please follow these steps:
- Fork the repository.
- Create a new branch (
git checkout -b feature-branch
). - Commit your changes (
git commit -m 'Add some feature'
). - Push to the branch (
git push origin feature-branch
). - Open a Pull Request.
For more details, please refer to our contributing guidelines.
This project is licensed under the MIT License. See the LICENSE file for details.
For questions, feedback, or collaboration, please contact:
- Project Lead: Your Name
- GitHub: Your GitHub Profile
I appreciate your interest and contributions to Boundary Banter!