Feature #9992

Enhance import matching behaviors: use cascading logic to improve matching during imports

Added by Dan Gillean about 4 years ago. Updated about 3 years ago.

Status:VerifiedStart date:06/10/2016
Priority:MediumDue date:
Assignee:Dan Gillean% Done:

0%

Category:Import/ExportEstimated time:16.00 hours
Target version:Release 2.4.0
Google Code Legacy ID: Tested version:
Sponsored:Yes Requires documentation:

Description

This issue will build upon some of the CSV and XML import matching behaviors previously sponsored for inclusion in 2.4.

When seeking to check for existing matches (for updating, etc) during import, AtoM will perform the following checks:

CSV

  • Try to match first on legacyID (CSV)
  • Also check for a match on keymap.source_name
  • If no match found, check: identifier, title, and repo name match
  • Else, consider unmatched (skip or create new, depending on other settings)

Note:

Matching on legacyId relies on 'sourceName' as the key to find the correct keymap record containing the legacyId in question. 'sourceName' can come from 2 places:

1) by default it is set from the filename of the import file
2) sourcename can be set on the CLI using the '--source-name=...' option

There is no option in the user interface to provide a custom sourcename. However, as per the work done in #10193, users can enter edit mode to see the source-name that was used during the import in the admin area.

EAD

  • Attempt to match first based on identifier, title, and repo name
  • If no match is found, try to find a keymap record using:
  • EAD url field from the XML file
  • uploaded import filename

Note this keymap matching logic only applies to Information Object loading.


Related issues

Related to Access to Memory (AtoM) - Feature #10016: Enhance import matching behaviors: add and enhance matchi... Verified 06/10/2016
Related to Access to Memory (AtoM) - Feature #10017: Enhance import matching behaviors: add and enhance match... Verified 10/26/2016 10/26/2016
Related to Access to Memory (AtoM) - Feature #10144: Enhance import matching behaviors: Add ability to limit ... Verified 06/10/2016
Related to Access to Memory (AtoM) - Feature #10626: Added CSV information object import parentId lookup fallback Verified 11/30/2016

History

#2 Updated by Dan Gillean about 4 years ago

  • Estimated time changed from 12.00 to 16.00

#3 Updated by Dan Gillean about 4 years ago

  • Related to Feature #10016: Enhance import matching behaviors: add and enhance matching logic for imports of authority records added

#4 Updated by Dan Gillean about 4 years ago

  • Related to Feature #10017: Enhance import matching behaviors: add and enhance matching logic for imports of repository records added

#5 Updated by Dan Gillean almost 4 years ago

  • Related to Feature #10144: Enhance import matching behaviors: Add ability to limit matching to a specific repository or top-level description added

#6 Updated by Dan Gillean almost 4 years ago

  • Target version changed from Release 2.4.0 to Release 2.5.0

#7 Updated by Dan Gillean over 3 years ago

  • Target version changed from Release 2.5.0 to Release 2.4.0

#8 Updated by Dan Gillean over 3 years ago

  • Related to Feature #10626: Added CSV information object import parentId lookup fallback added

#9 Updated by Dan Gillean over 3 years ago

  • Description updated (diff)
  • Status changed from New to QA/Review

Updating the description with further information based on the internal ticket from the sponsored development project.

#10 Updated by Sara Allain about 3 years ago

  • Assignee set to Kelly Stewart

#11 Updated by Dan Gillean about 3 years ago

  • Assignee changed from Kelly Stewart to Dan Gillean

Documentation for the CSV import added as a part of https://github.com/artefactual/atom-docs/commit/94ad54befffd1d803bd78793a99ced61474fa764

Still need to test and document the XML import portion of this work.

#12 Updated by Dan Gillean about 3 years ago

  • Status changed from QA/Review to Verified
  • Requires documentation deleted (Yes)

Also available in: Atom PDF