Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scrapoxy integration #213

Open
fabienvauchelles opened this issue Aug 3, 2018 · 3 comments
Open

Scrapoxy integration #213

fabienvauchelles opened this issue Aug 3, 2018 · 3 comments

Comments

@fabienvauchelles
Copy link

Hello @NikolaiT ,

Is it a good idea to plug GoogleScraper to Scrapoxy.io ?

Best regards,
Fabien

@NikolaiT
Copy link
Owner

Hi @fabienvauchelles

Your scrapoxy tool looks really cool. I will check this tool out and crawl a bit through the documentation and probably going to integrate it in GoogleScraper.

Thanks and you will hear from me.

@fabienvauchelles
Copy link
Author

nice !

@fabienvauchelles
Copy link
Author

Scrapoxy 4 is out!

Scrapoxy is a open source proxy aggregator, allowing you to manage all proxies in one place 🎯, rather than
spreading it across multiple scrapers 🕸️.

Smartly designed for efficient traffic routing 🔀, Scrapoxy minimizes #bans and boosts success rates 🚀.

The tech stack is built on the latest NodeJS, Typescript, utilizing the NestJS and Angular frameworks.

Here are the key features:

  • ☁️ Cloud Providers with easy installation: Scrapoxy supports many cloud providers like AWS, Azure, or GCP.
  • 🌐 Proxy Services: Scrapoxy supports many proxy services like Rayobyte, IPRoyal or Zyte.
  • 💻 Hardware materials: Scrapoxy supports many 4G proxy farms hardware types, like Proxidize or XProxy.io.
  • 📜 Free Proxy Lists: Scrapoxy supports lists of HTTP/HTTPS proxies and SOCKS4/SOCKS5 proxies.
  • ⏰ Timeout free: Scrapoxy only routes traffic to online proxies to avoid inactive connection.
  • 🔄 Auto-Rotate proxies: Scrapoxy automatically changes IP addresses at regular intervals.
  • 🏃 Auto-Scale proxies: Scrapoxy monitors incoming traffic and automatically scales the number of proxies according to your needs.
  • 🍪 Sticky sessions on Browser: Scrapoxy keeps the same IP address for a scraping session, even for browsers.
  • 🚨 Ban management: Scrapoxy injects the name of the proxy into the HTTP responses.
  • 📡 Traffic interception: Scrapoxy intercepts HTTP requests/responses to modify headers, keeping consistency in your scraping stack. It can add session cookies or specific headers like user-agent.
  • 📊 Traffic monitoring: Scrapoxy measures incoming and outgoing traffic to provide an overview of your scraping session.
  • 🌍 Coverage monitoring: Scrapoxy displays the geographic coverage of your proxies to better understand the global distribution of your proxies.
  • 🚀 Easy-to-use and production-ready: Scrapoxy is suitable for both beginners and experts (Kubernetes / Helm).
  • 🔓 Open Source: And of course, Scrapoxy is open source, under the MIT license.

Checkout https://scrapoxy.io/ !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants