Add pylint #884

iprafols · 2022-05-03T14:22:33Z

This PR adds pylint as an automatic check (only for delta_extraction)
In order to pass the test, some basic linting is performed. Before the PR is ready for review, some bits and pieces must be reorganized in order to avoid code duplication

…-pylint

…x and DesiTile to DesiData

…data

…ectedFlux

iprafols · 2022-05-04T08:57:14Z

Duplicated code is now removed. Overall I think the code is much simpler now. Before the merge is done, I need to update the data model with those changes, but this does not affect the code and it can be reviewed

Waelthus

Sounds good to me! Some comments attached.

README.md

py/picca/delta_extraction/data_catalogues/desi_data.py

Waelthus · 2022-05-04T15:32:28Z

py/picca/delta_extraction/data_catalogues/desi_healpix.py

-from picca.delta_extraction.astronomical_objects.forest import Forest
-from picca.delta_extraction.data_catalogues.desi_data import DesiData, defaults, accepted_options
+from picca.delta_extraction.data_catalogues.desi_data import DesiData
+from picca.delta_extraction.data_catalogues.desi_data import (# pylint: disable=unused-import


the split is just for line length? And the pragma is for pylint not to complain there because the import is multi-line?

The split is because one is used in the module and the others are simply loaded (they are necessary imports but are not used inside the module). The pragma is for pylint to not complain about the fact the two imported variables are not used inside the module

that's because they are only used for child classes? Or why would we elsewise import them here if they aren't used?

They are (or could be) used by child classes. In any case, they are also used by Config in order to enforce correct parameters in the config file and give early errors

py/picca/delta_extraction/data_catalogues/desi_healpix.py

py/picca/delta_extraction/expected_fluxes/dr16_expected_flux.py

Waelthus · 2022-05-04T15:47:30Z

py/picca/delta_extraction/expected_fluxes/dr16_expected_flux.py

+            weights = self.get_continuum_weights(forest, mean_expected_flux)
+            # this is needed as the weights from get_continuum_weights are
+            # divided by the continuum model squared, in this case mean_expected_flux
+            # TODO: check that we indeed need this or if the weights without it


Ok, so this does give the same result as before, right? It might be reasonable to have some kind of Fbar weights as an alternative, I guess... But this should be optional, not replacing the current setup

Yes, this is to reproduce the same results as before, but it'd be nice to always use the same weights and not slightly different versions here and there

While that would be nice, depending on the type of downstream analysis that can give unexpected effects. For Pk1d for instance, using the standard weights would be relatively simple to do when estimating the power spectrum from real space data, but when doing straight-forward FFTs each pixel of a spectrum will have the same weight in the output P(k), i.e. we cannot account for the different weighting of the pixels in a straightforward way.
This does not mean we couldn't in principle do the cont fitting with some weighting scheme, but effects this has need to be studied.
Agreed that within a run, i.e. after specifying which kind of weights one wants, things should be treated self-consistently.

Maybe we should open an issue to discuss how to proceed with this

I think it's fine as long as we keep the current major modes (i.e. standard or constant weights, not sure if anyone uses ivar-only weights) in working shape which allows us to do analyses similar to previous works.

We could have a discussion if other weightings might be useful, then implement those, and then if they have been tested and lead to improvement I'd be fine with dropping previous modes. In principle, already the change in binning from log to lin can be seen as a λ-dependent change in weighting, at least for e.g. constant weights.

Waelthus · 2022-05-04T15:53:53Z

pylintrc

@@ -11,7 +11,7 @@ ignore=CVS

 # Add files or directories matching the regex patterns to the blacklist. The


should we maybe relax a little on e..g. line lengths? here and for yapf? sometimes it's just simpler to allow slightly longer lines and not have as much of trouble with the wrapped lines which can become annoying (e.g. for imports, comments, deeply nested loops)

There is some margin that we can relax, but then lines that are too long are a pain to read as you have to keep moving the code left and right

If I recall properly, though, the values we are using are already a bit more relaxed than the default values for pylint

Ah, didn't read the whole pylintrc. We're using 100 chars, which is ok (the default is ~80 which is a little narrow often enough and going way larger leads to trouble when having 2 files next to each other e.g. in a diff). And indeed if I parse the regexp correctly we're allowing comments and web-addresses to go beyond the line limit (and could add other stuff to there if we figure out there's issues).

Is yapf using the same setup that the linter uses? Or will that produce the default settings or has a seperate config? Iirc there was some way to unify all of this, but I don't remember right now how it worked...

I guess yapf is using google coding standards with their defaults (as I have not created a config file for that)

ok, let's just keep it like that for the moment.

Waelthus · 2022-05-04T15:55:25Z

Please note that this will partially clash with #887, so might be good to have a look which merge order might be less troublesome

iprafols · 2022-05-05T07:28:02Z

I'll review #887 before merging this one. I'll check locally which is the easiest merge order

iprafols added 22 commits September 23, 2020 16:23

Added pylint tests

f9ac906

Merge branch 'master' into add-pylint

26904e7

Merge branch 'master' into add-pylint

e420143

Merge branch 'delta_extraction' into add-pylint

1278f2a

modified pylint workflow to work on delta_extraction only

578266a

fix to pylint workflow

90640e0

restored pylint workflow

00d7934

trying new configuration

f960527

trying new config

52a4098

test to figure out path in github actions

a9d255a

restored pylint test

8d4b6ee

removed ending /

b1f17a5

added picca install

cdc4f39

added install of missing dependencies

085288e

removed extra 'run'

0c97082

changed path to module

6d00753

changed path to module

195abf1

Merge branch 'add-pylint' of https://github.com/igmhub/picca into add…

84a32d4

…-pylint

Merge branch 'master' into add-pylint

8cdd88f

fixed delta extraction path

c566dbf

Merge branch 'master' into add-pylint

0d3bff6

basic linting

6961620

iprafols marked this pull request as draft May 3, 2022 14:22

iprafols added 7 commits May 3, 2022 16:26

Merge branch 'master' into add-pylint

36e3596

more basic linting on things from master

404e885

duplicated code responsible for formatting data moved from DesiHealpi…

6cc55de

…x and DesiTile to DesiData

updated README

72798d8

simplified function to minimize code repetitionclear

10e917a

fixed bug

fd5d17c

use function get_continuum_weights to avoid code repetition

4d42096

iprafols added 9 commits May 3, 2022 18:05

added TODOs

73b558a

decreased code repetition

29b1a00

fixed bug in testing

2566a25

healpix computation moved to quasar loading

4208230

basic linting

26991fe

weights computed using get_continuum_weights

1f9f5c6

moved variable inside loop as it was not used outside

d4f2e21

added a new function get_filename to encapsulate differences in read_…

5be5886

…data

temporarily increasing min-similarity-lines to 9 (see todo in Dr16Exp…

02b9a61

…ectedFlux

iprafols marked this pull request as ready for review May 4, 2022 08:55

iprafols requested a review from Waelthus May 4, 2022 08:55

iprafols added 4 commits May 4, 2022 11:26

updated pylint test to also be run on PR

d502550

updated data model

68647d9

updated docstrings, very minor changes

8bee5f2

removed trailing withespaces

7b9bee6

Waelthus approved these changes May 4, 2022

View reviewed changes

cleanup

727500a

Waelthus merged commit 6aa9cc7 into master May 5, 2022

iprafols deleted the add-pylint branch May 5, 2022 10:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add pylint #884

Add pylint #884

iprafols commented May 3, 2022

iprafols commented May 4, 2022

Waelthus left a comment

Waelthus May 4, 2022

iprafols May 4, 2022

Waelthus May 4, 2022

iprafols May 5, 2022

Waelthus May 4, 2022

iprafols May 4, 2022

Waelthus May 5, 2022

iprafols May 5, 2022

Waelthus May 5, 2022

Waelthus May 4, 2022

iprafols May 4, 2022

iprafols May 4, 2022

Waelthus May 5, 2022

iprafols May 5, 2022

Waelthus May 5, 2022

Waelthus commented May 4, 2022

iprafols commented May 5, 2022

		@@ -11,7 +11,7 @@ ignore=CVS

		# Add files or directories matching the regex patterns to the blacklist. The

Add pylint #884

Add pylint #884

Conversation

iprafols commented May 3, 2022

iprafols commented May 4, 2022

Waelthus left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Waelthus commented May 4, 2022

iprafols commented May 5, 2022