Add config for returning dummy object in the NRTMv3 response #924

Justin-APNIC · 2024-03-15T00:36:40Z

Add new configs in IRRD for returning the dummy object in NRTMv3 response

Example config

                nrtm_response_dummy_object_class:
                  - person
                  - role
                nrtm_response_dummy_attributes:
                  person: Dummy name for %s
                  role: Dummy role for %s
                  address: Dummy address for %s
                  phone: '+31205354444'
                  e-mail: [email protected]
                  upd-to: [email protected]
                  descr: Dummy description for %s
                nrtm_response_dummy_remarks: |
                  ****************************
                  * THIS OBJECT IS NOT VALID
                  * Please note that all personal data has been removed from this object.
                  * To view the original object, please query the APNIC Database at:
                  * http://www.apnic.net/whois
                  ****************************

Example response

role:           Dummy role for AT1599-AP
address:        Dummy address for AT1599-AP
country:        ZZ
phone:          +31205354444
e-mail:         [email protected]
admin-c:        TPLA28-AP
tech-c:         TPLA28-AP
nic-hdl:        AT1599-AP
mnt-by:         APNIC-ABUSE
last-modified:  2024-03-14T23:45:51Z
source:         APNIC
remarks:        ****************************
remarks:        * THIS OBJECT IS NOT VALID
remarks:        * Please note that all personal data has been removed from this object.
remarks:        * To view the original object, please query the APNIC Database at:
remarks:        * http://www.apnic.net/whois
remarks:        ****************************

mxsasha · 2024-03-15T09:06:40Z

Nice! This will take a bit of time to review, but my first thought is that the logic to generate dummified object text should be separated to make it easy to apply to RPSL full exports and NRTMv4 as well. Probably in the same place as remove_auth_hashes(). Makes tests a bit simpler too.

(Initially I thought about putting it on RPSLObject, but we intentionally don't have that in this situation, just the text from the database.)

Justin-APNIC · 2024-03-21T06:32:54Z

Nice! This will take a bit of time to review, but my first thought is that the logic to generate dummified object text should be separated to make it easy to apply to RPSL full exports and NRTMv4 as well. Probably in the same place as remove_auth_hashes(). Makes tests a bit simpler too.

Yes, the function dummy_rpsl_object() does exist in the text.py where the remove_auth_hashes() function is. The codes in nrtm_generator are just for retrieving the params that the dummy function needs from the configuration file.

Justin-APNIC · 2024-03-28T04:43:41Z

@mxsasha
I have a question regarding the primary key in the RPSL object. It appears that we are converting the primary key to uppercase all the time. Could we add any configurations or functions in the rpsl/parser.py to preserve the original primary key case? I've reviewed the code, and it seems that IRRD parses attributes in different object classes differently. Currently, I don't have a straightforward solution, so I'd appreciate your input on this matter.

The reason I want to keep the original value is that I want to return the original pk instead of the uppercase one in the dummy NRTM response.
For example, I want to have IRT-Youminet-CN in the dummy address instead of IRT-YOUMINET-CN

irt:            IRT-Youminet-CN
address:        Dummy address for IRT-YOUMINET-CN

mxsasha · 2024-04-08T15:26:19Z

Nice! This will take a bit of time to review, but my first thought is that the logic to generate dummified object text should be separated to make it easy to apply to RPSL full exports and NRTMv4 as well. Probably in the same place as remove_auth_hashes(). Makes tests a bit simpler too.

Yes, the function dummy_rpsl_object() does exist in the text.py where the remove_auth_hashes() function is. The codes in nrtm_generator are just for retrieving the params that the dummy function needs from the configuration file.

What I meant is: to properly support dummy mirroring, dummifying needs to happen in the NRTM v3 generator, the source export (if unfiltered is not set, currently called "remove_auth_hashes" I think), and the NRTMv4 server. With as little duplication as possible. So the call from the NRTMv3 generator should be something like: text = dummify_object_text(source, object_class, text).

@mxsasha I have a question regarding the primary key in the RPSL object. It appears that we are converting the primary key to uppercase all the time. Could we add any configurations or functions in the rpsl/parser.py to preserve the original primary key case? I've reviewed the code, and it seems that IRRD parses attributes in different object classes differently. Currently, I don't have a straightforward solution, so I'd appreciate your input on this matter.

The only place where we retain the original case for the PK is in the object text, so we'd have to re-parse the whole object. PKs are case insensitive, so the indexed key is normalised. But I don't follow why you need it, your current implementation in text.py seems the most straightforward, without needing the PK separately.

Justin-APNIC · 2024-04-17T04:32:25Z

Nice! This will take a bit of time to review, but my first thought is that the logic to generate dummified object text should be separated to make it easy to apply to RPSL full exports and NRTMv4 as well. Probably in the same place as remove_auth_hashes(). Makes tests a bit simpler too.

Yes, the function dummy_rpsl_object() does exist in the text.py where the remove_auth_hashes() function is. The codes in nrtm_generator are just for retrieving the params that the dummy function needs from the configuration file.

What I meant is: to properly support dummy mirroring, dummifying needs to happen in the NRTM v3 generator, the source export (if unfiltered is not set, currently called "remove_auth_hashes" I think), and the NRTMv4 server. With as little duplication as possible. So the call from the NRTMv3 generator should be something like: text = dummify_object_text(source, object_class, text).

Ok

@mxsasha I have a question regarding the primary key in the RPSL object. It appears that we are converting the primary key to uppercase all the time. Could we add any configurations or functions in the rpsl/parser.py to preserve the original primary key case? I've reviewed the code, and it seems that IRRD parses attributes in different object classes differently. Currently, I don't have a straightforward solution, so I'd appreciate your input on this matter.

The only place where we retain the original case for the PK is in the object text, so we'd have to re-parse the whole object. PKs are case insensitive, so the indexed key is normalised. But I don't follow why you need it, your current implementation in text.py seems the most straightforward, without needing the PK separately.

My implementation in text.py needs a pk to be provided. As I mentioned above, If my config for the dummy attribute is like
address: Dummy address for %s
The %s will be replaced by the pk. However, the current codes will convert it to the uppercase if I call pk() function on the whole object or retrieve the pk from db directly, so the final dummy attribute will be
address: Dummy address for IRT-YOUMINET-CN instead of address: Dummy address for IRT-Youminet-CN.

I can't find an easy way to find the original pk from the whole object.

mxsasha · 2024-05-28T16:44:54Z

My implementation in text.py needs a pk to be provided. As I mentioned above, If my config for the dummy attribute is like address: Dummy address for %s The %s will be replaced by the pk. However, the current codes will convert it to the uppercase if I call pk() function on the whole object or retrieve the pk from db directly, so the final dummy attribute will be address: Dummy address for IRT-YOUMINET-CN instead of address: Dummy address for IRT-Youminet-CN.

I can't find an easy way to find the original pk from the whole object.

There is none. The data simply isn't extracted, as PKs are case insensitive in every other usage, and therefore normalised to upper case during parsing of the initial object. Is it such an obstacle for you? Does the PK need to be repeated in the dummy text? After all, anyone who looks at the object will see the true PK already anyways.

mxsasha

Left some notes inline. We also need to extend this to the source export runner, and the NRTM4 server. In those cases, we don't have client IPs, so our only option is to always enable it if configured.

I also want to tweak the docs and settings names a bit, later.

irrd/conf/__init__.py

irrd/mirroring/nrtm_generator.py

irrd/utils/text.py

mxsasha · 2024-05-28T17:22:53Z

irrd/utils/text.py

+            if dummy_attributes:
+                if get_setting(f"sources.{source}.nrtm_response_dummy_remarks"):
+                    dummy_remarks = textwrap.indent(
+                        get_setting(f"sources.{source}.nrtm_response_dummy_remarks"), "remarks:".ljust(16)


Where does that 16 come from? From somewhere in irrd.rpsl right? Feels like we should make that a constant then.

yes, it comes from RPSL_ATTRIBUTE_TEXT_WIDTH which has been used in different places. Updated to use the constant, thanks

irrd/utils/text.py

Justin-APNIC · 2024-05-29T00:28:31Z

My implementation in text.py needs a pk to be provided. As I mentioned above, If my config for the dummy attribute is like address: Dummy address for %s The %s will be replaced by the pk. However, the current codes will convert it to the uppercase if I call pk() function on the whole object or retrieve the pk from db directly, so the final dummy attribute will be address: Dummy address for IRT-YOUMINET-CN instead of address: Dummy address for IRT-Youminet-CN.
I can't find an easy way to find the original pk from the whole object.

There is none. The data simply isn't extracted, as PKs are case insensitive in every other usage, and therefore normalised to upper case during parsing of the initial object. Is it such an obstacle for you? Does the PK need to be repeated in the dummy text? After all, anyone who looks at the object will see the true PK already anyways.

Nvm, thanks.

Justin-APNIC · 2024-05-29T01:40:25Z

Left some notes inline. We also need to extend this to the source export runner, and the NRTM4 server. In those cases, we don't have client IPs, so our only option is to always enable it if configured.

Ok, we need this because we have some internal clients for accessing the IRRD NRTM3 server, so it needs the original data.

I also want to tweak the docs and settings names a bit, later.

No problem, thanks for looking into it.

mxsasha force-pushed the add-config-for-making-dummy-nrtm-response branch from b689ff3 to fad6fe5 Compare March 15, 2024 09:02

mxsasha force-pushed the add-config-for-making-dummy-nrtm-response branch from 22d78e8 to 880bdc8 Compare April 8, 2024 15:26

Justin-APNIC added 13 commits May 8, 2024 14:29

Add config for returning dummy object in the NRTMv3 response

0b08544

lint

b3a805a

Fix failed tests

7789925

Lint

abc2f26

Add more tests

2311439

Lint

fa44bc0

Use the pk stored in the db

2ecfada

Lint

dfd5b81

Codes refactoring

083ff8e

Lint and fix test coverage error

50f8f45

Add config for keeping original data in NRTM stream for specific source

de1b869

Fix spelling error and failed test

aefdf74

Add new word to spelling word list

3f46413

Justin-APNIC force-pushed the add-config-for-making-dummy-nrtm-response branch from c52cafe to 3f46413 Compare May 8, 2024 04:29

mxsasha reviewed May 28, 2024

View reviewed changes

Justin-APNIC and others added 2 commits May 29, 2024 09:19

Merge branch 'main' into add-config-for-making-dummy-nrtm-response

4fe6564

Codes refactoring

7e64dbd

Justin-APNIC added 3 commits May 29, 2024 12:01

Delete local test reports

27f7dd0

Overwrite the auth attribute if there is a dummy one

1629e4b

lint

498431b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add config for returning dummy object in the NRTMv3 response #924

Add config for returning dummy object in the NRTMv3 response #924

Justin-APNIC commented Mar 15, 2024

mxsasha commented Mar 15, 2024

Justin-APNIC commented Mar 21, 2024

Justin-APNIC commented Mar 28, 2024

mxsasha commented Apr 8, 2024

Justin-APNIC commented Apr 17, 2024

mxsasha commented May 28, 2024

mxsasha left a comment

mxsasha May 28, 2024

Justin-APNIC May 29, 2024

Justin-APNIC commented May 29, 2024

Justin-APNIC commented May 29, 2024 •

edited

Loading

Add config for returning dummy object in the NRTMv3 response #924

Are you sure you want to change the base?

Add config for returning dummy object in the NRTMv3 response #924

Conversation

Justin-APNIC commented Mar 15, 2024

mxsasha commented Mar 15, 2024

Justin-APNIC commented Mar 21, 2024

Justin-APNIC commented Mar 28, 2024

mxsasha commented Apr 8, 2024

Justin-APNIC commented Apr 17, 2024

mxsasha commented May 28, 2024

mxsasha left a comment

Choose a reason for hiding this comment

mxsasha May 28, 2024

Choose a reason for hiding this comment

Justin-APNIC May 29, 2024

Choose a reason for hiding this comment

Justin-APNIC commented May 29, 2024

Justin-APNIC commented May 29, 2024 • edited Loading

Justin-APNIC commented May 29, 2024 •

edited

Loading