Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IPT contacts sync issues #116

Open
3 tasks done
vjrj opened this issue Apr 25, 2022 · 7 comments · Fixed by #236
Open
3 tasks done

IPT contacts sync issues #116

vjrj opened this issue Apr 25, 2022 · 7 comments · Fixed by #236
Assignees
Milestone

Comments

@vjrj
Copy link
Contributor

vjrj commented Apr 25, 2022

Currently I see some issues with the contacts sync of IPT resources:

image

  • so existing contacts are not updated.

Are these issues intentional or PRs are welcome :-) ?

@djtfmartin
Copy link
Member

PRs welcome :)

@adam-collins
Copy link
Contributor

Can this be closed?

@vjrj
Copy link
Contributor Author

vjrj commented Mar 26, 2024

I missed this. It's still unresolved

@vjrj vjrj self-assigned this Apr 9, 2024
vjrj added a commit that referenced this issue Apr 15, 2024
@vjrj vjrj mentioned this issue Apr 15, 2024
@vjrj
Copy link
Contributor Author

vjrj commented Oct 3, 2024

A comment just to show the effect of this issue:
https://collections-test.ala.org.au/contact/show/1029
and why #236 should be merged IMHO.

@adam-collins
Copy link
Contributor

adam-collins commented Oct 4, 2024

Before forwarding to the data team for review, is my following summary of the changes to the IPT service correct?

  1. eml.dataset.contact, when present, is used as the primary contact. This is your data managers suggestion.
  2. Add the contacts found in eml.dataset.associatedParty to the list of contacts. This was not previously done.
  3. Add additional logic to manage the creation of duplicate contacts in the global contacts table. One implication of this is that when a contact (unique by email) has a new first or last name, the first and last name will be updated.
  • First looks for a contact with the same email (electronicMailAddress)
  • When there is no email, looks for a contact with the same first and last name (individualName.givenName, individualName.surName)
  • When there is no email or first name, looks for a contact with only the same last name.
  • Otherwise, create a new contact.
  1. Add additional logic to prevent the creation of duplicate contacts for a data provider to prevent multiple contacts having the same first and last name.

@vjrj
Copy link
Contributor Author

vjrj commented Oct 4, 2024

Although this was detected using the IPT service, the contacts process code of an eml is something common and used in other parts of the collectory.

About primary contacts: IMHO It's something that we add to the db (for some reason) and I maintain, but it's not used in the UI. So I wouldn't worry too much.

More than the case where the first contact without email is incorrect associated with all the drs of other contacts without email: https://collections-test.ala.org.au/contact/show/1029 this PR is tries to follow the EML specification and put more attention on the contacts part, something that from my point of view was not totally developed in our side.

As a result, if you can compare how this contacts part is processed in our datasets and compare with how GBIF does the same process of the same dataset, you'll find that currently our contacts:

  • are not updated (missing names if we processed a contact initially only with email)
  • or not well processed (missing contacts without email)
  • or duplicated (because we don't compare well)
  • or other missing contacts (associatedParty)
  • or not well associated (a dataset with a contact without email is associated incorrectly to the first contact without email in the db).

and #236 solves all these issues and our contacts part is in this case similar to GBIF one.

@adam-collins
Copy link
Contributor

Merging the PR closed the issue, reopening.

@adam-collins adam-collins reopened this Oct 24, 2024
@adam-collins adam-collins added this to the 5.2.0 milestone Oct 24, 2024
vjrj added a commit to living-atlases/collectory that referenced this issue Oct 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants