Scraper breaking on Alameda County Water District #176

zstumgoren · 2024-02-28T18:16:24Z

Alameda Water County Water District scrape is failing:

https://www.acwd.org/AgendaCenter

Removing the agency from our GDoc scraping list until we can debug.

Stacktrace from Prefect:

ERROR ON SCRAPER TASK for https://www.acwd.org/AgendaCenter. Here's the stack trace:
Traceback (most recent call last):
  File "/etl/utils/scrape.py", line 59, in scrape_agency
    assets_meta = site.scrape(
                  ^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/civic_scraper/platforms/civic_plus/site.py", line 68, in scrape
    file_metadata = self.parser_kls(raw_html).parse()
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/civic_scraper/platforms/civic_plus/parser.py", line 20, in parse
    metadata = self._extract_asset_data(divs)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/civic_scraper/platforms/civic_plus/parser.py", line 42, in _extract_asset_data
    cmte_name = self._committee_name(div)
                ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/civic_scraper/platforms/civic_plus/parser.py", line 71, in _committee_name
    div.h2.span.extract()
    ^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'span'
Traceback (most recent call last):
  File "/etl/utils/scrape.py", line 59, in scrape_agency
    assets_meta = site.scrape(
                  ^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/civic_scraper/platforms/civic_plus/site.py", line 68, in scrape
    file_metadata = self.parser_kls(raw_html).parse()
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/civic_scraper/platforms/civic_plus/parser.py", line 20, in parse
    metadata = self._extract_asset_data(divs)
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/civic_scraper/platforms/civic_plus/parser.py", line 42, in _extract_asset_data
    cmte_name = self._committee_name(div)
                ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/civic_scraper/platforms/civic_plus/parser.py", line 71, in _committee_name
    div.h2.span.extract()
    ^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'span'

zstumgoren added the bug Something isn't working label Feb 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scraper breaking on Alameda County Water District #176

Scraper breaking on Alameda County Water District #176

zstumgoren commented Feb 28, 2024 •

edited

Loading

Scraper breaking on Alameda County Water District #176

Scraper breaking on Alameda County Water District #176

Comments

zstumgoren commented Feb 28, 2024 • edited Loading

zstumgoren commented Feb 28, 2024 •

edited

Loading