http
glom
BeautifulSoup
Once you have an active Web Scraping API subscription, you can set your password in the dashboard Web Scraping API > Authentication method. You can also try sending a request right from the dashboard Web Scraping API > API playground method tab, simply by clicking on Send Request. You will also see an example of a curl request generated on the right.
Parser type | Example location | Download |
---|---|---|
HTML to JSON | Trustpilot_parser.py | curl https://raw.githubusercontent.com/Decodo/trustpilot_python_scraper/blob/main/Trustpilot_parser.py > Trustpilot_parser.py |
JSON to JSON | trustpilot_json_parser_glom.py | curl https://raw.githubusercontent.com/Decodo/trustpilot_python_scraper/blob/main/trustpilot_json_parser_glom.py > trustpilot_json_parser_glom.py |
This Python script extracts review text, review dates, and star ratings directly from the HTML of the Trustpilot website and saves them to a JSON file.
This Python script extracts dozens of data points from a JSON that gets loaded when you visit Trustpilot. Data points include, but are not limited to, business information (name, URL, website, location, etc.), reviews (text, rating, name, positive or negative), and ratings.