Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LinkedIn Job Extractor - Job data extraction tool from the web using Scrapy #375 #391

Merged
merged 3 commits into from
Oct 16, 2024

Conversation

Niraj1608
Copy link
Contributor

Related Issues or bug

This PR addresses issues related to the scraping of job postings from LinkedIn, including challenges with rate limits and IP blocking.

Fixes: #375

Proposed Changes

Implemented the Octoparse proxy middleware to manage requests and avoid IP bans.
Optimized the CSS selectors for improved accuracy in extracting job details.
Enhanced error handling and logging for better monitoring of scraping performance.
Added support for dynamic content scraping.

Additional Info

The Octoparse proxy middleware helps in rotating IPs and reduces the chances of getting blocked by LinkedIn during scraping.
This implementation allows for more efficient and reliable data extraction from LinkedIn job postings.

Screenshots

Screenshot 2024-10-16 194309

Copy link

👋 Thank you for opening this pull request! We appreciate your contribution to improving this project. Your PR is under review, and we'll get back to you shortly.
Don't forget to mention the issue you solved!.

To help move the process along, please tag @UppuluriKalyani, @Neilblaze, and @SaiNivedh26 for a faster review!

@Niraj1608
Copy link
Contributor Author

@UppuluriKalyani @Neilblaze kindly review my pr :)

@UppuluriKalyani UppuluriKalyani merged commit 491467f into UppuluriKalyani:main Oct 16, 2024
3 checks passed
Copy link

🎉🎉 Thank you for your contribution! Your PR #391 has been merged! 🎉🎉

@Niraj1608
Copy link
Contributor Author

@UppuluriKalyani thank you :)

@Niraj1608 Niraj1608 deleted the Jobscraper branch October 17, 2024 09:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

LinkedIn Job Extractor - Job data extraction tool from the web using Scrapy
2 participants