Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add schools data set from INEP #190

Open
rafapereirabr opened this issue Oct 8, 2020 · 8 comments
Open

add schools data set from INEP #190

rafapereirabr opened this issue Oct 8, 2020 · 8 comments
Assignees

Comments

@rafapereirabr
Copy link
Member

Data available at http://portal.inep.gov.br/web/guest/dados/catalogo-de-escolas

@rafapereirabr
Copy link
Member Author

rafapereirabr commented Oct 8, 2020

This is a rather simple data set to include in the geobr package. The challeng here is in the translation of the columns. So far, this is what I'm proposing, but please feel free to add your suggestions. Perhaps this is something @schmert could give us hand. Carl has helped us many times so that geobr does not get lost in translation

dplyr::select(df,
              abbrev_state = 'UF',
              name_muni = 'Município',
              code_school = 'Código INEP',
              name_school = 'Escola',
              education_level = 'Etapas e Modalidade de Ensino Oferecidas',
              education_others = 'Outras Ofertas Educacionais',
              admin_category = 'Categoria Administrativa',
              address = 'Endereço',
              phone_number = 'Telefone',
              government_level = 'Dependência Administrativa',
              private_school_type = 'Categoria Escola Privada',
              conveniada_governo = 'Conveniada Poder Público',
              regulated_education_counsel = 'Regulamentação pelo Conselho de Educação',
              service_restriction ='Restrição de Atendimento',
              size = 'Porte da Escola',
              urban = 'Localização',
              location = 'Localidade Diferenciada',
              y = 'Latitude',
              x = 'Longitude'
)

@schmert
Copy link

schmert commented Oct 9, 2020 via email

@rafapereirabr
Copy link
Member Author

Thanks for your help, Carl. I'll comment on each question below:

  • education_others = 'Outras Ofertas
    This variable brings the following types of education service.
    • Atendimento Educacional Especializado
    • Atividade Complementar

I'm not entirely sure what these categories mean tbh. In any case, I belive the column name should be education_level_others to make it clear this is a coplement to the column education_level.

  • conveniada_governo = 'Conveniada Poder Público'

Yes, this means a private institution that receives goverment funds. I like your suggestion to translate this column as private_government_partnership.

  • regulated_education_counsel = 'Regulamentação pelo Conselho de Educação'

This brings info on whether the school is formerly overseen by a a board of the city ducation counsil. So yes, it involves a formal document. The response categories are yes, no and in progress. Thanks for the heads up on the typo.

-urban = 'Localização'
The reponse categories are Urbano and Rural, so we can convert it into a dummy. This way we could use location_type to name the next column Localidade Diferenciada.

@schmert
Copy link

schmert commented Oct 10, 2020 via email

@rafapereirabr
Copy link
Member Author

Thanks again, Carl. The data set has been processed and available in our server. My next push will include the new read_schools() function to the dev version of geobr and close this issue. I'lve also finally included you as a contributor to geobr in the package DESCRIPTION.

# update the dev version with latest features
utils::remove.packages('geobr')
devtools::install_github("ipeaGIT/geobr", subdir = "r-package")

library(geobr)
sc <- read_schools()

@JoaoCarabetta
Copy link
Collaborator

Add it to python

@lgabs
Copy link

lgabs commented Feb 8, 2021

Adding data for schools was a really nice idea! @JoaoCarabetta , I'm new to this package, but from what I've seen here, read_schools is so far only available for R package. Is that right?

@JoaoCarabetta
Copy link
Collaborator

Hi @lgabs, we still didn't implement it to the python version. But, we are working on it. If you want, you can open a PR to add it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants