Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

El flujo del ETL se interrumpe si falla la descarga de un catálogo #45

Open
abenassi opened this issue Jul 24, 2019 · 0 comments
Open
Labels
bug Something isn't working

Comments

@abenassi
Copy link
Member

abenassi commented Jul 24, 2019

Traceback (most recent call last):
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/download.py", line 33, in download
    verify=verify)
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/requests/api.py", line 75, in get
    return request('get', url, params=params, **kwargs)
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/requests/api.py", line 60, in request
    return session.request(method=method, url=url, **kwargs)
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/requests/sessions.py", line 533, in request
    resp = self.send(prep, **send_kwargs)
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/requests/sessions.py", line 668, in send
    history = [resp for resp in gen] if allow_redirects else []
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/requests/sessions.py", line 668, in <listcomp>
    history = [resp for resp in gen] if allow_redirects else []
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/requests/sessions.py", line 247, in resolve_redirects
    **adapter_kwargs
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/requests/sessions.py", line 646, in send
    r = adapter.send(request, **kwargs)
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/requests/adapters.py", line 514, in send
    raise SSLError(e, request=request)
requests.exceptions.SSLError: HTTPSConnectionPool(host='www.economia.gob.ar', port=443): Max retries exceeded with url: /download/infoeco/catalogo_sspm_prod.xlsx (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1056)')))

The above exception was the direct cause of the following exception:

Traceback (most recent call last):
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/bin/etl", line 11, in <module>
    load_entry_point('series-tiempo-ar-scraping', 'console_scripts', 'etl')()
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/click/core.py", line 764, in __call__
    return self.main(*args, **kwargs)
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/click/core.py", line 717, in main
    rv = self.invoke(ctx)
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/click/core.py", line 956, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/Users/abenassi/anaconda/envs/series-tiempo-ar-scraping-new/lib/python3.7/site-packages/click/core.py", line 555, in invoke
    return callback(*args, **kwargs)
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/main.py", line 52, in cli
    main(config, log_level)
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/main.py", line 65, in main
    config=config
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/base.py", line 717, in __init__
    super().__init__(identifier, parent, context)
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/base.py", line 72, in __init__
    self.init_childs()
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/base.py", line 732, in init_childs
    for catalog in self.catalogs_from_config.keys()
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/base.py", line 732, in <listcomp>
    for catalog in self.catalogs_from_config.keys()
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/base.py", line 268, in __init__
    super().__init__(identifier, parent, context)
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/base.py", line 70, in __init__
    self.init_metadata()
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/base.py", line 272, in init_metadata
    self.fetch_metadata_file()
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/base.py", line 291, in fetch_metadata_file
    config,
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/base.py", line 648, in download_with_config
    download.download_to_file(url, file_path, **config)
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/download.py", line 58, in download_to_file
    content = download(url, **kwargs)
  File "/Users/abenassi/github/series-tiempo-ar-scraping/series_tiempo_ar_scraping/download.py", line 44, in download
    raise DownloadException() from download_exception
series_tiempo_ar_scraping.download.DownloadException
@abenassi abenassi created this issue from a note in series-tiempo-ar-scraping (To do) Jul 24, 2019
@abenassi abenassi added the bug Something isn't working label Jul 24, 2019
@abenassi abenassi moved this from To do to Done in series-tiempo-ar-scraping Sep 23, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Development

No branches or pull requests

1 participant