Bug #5870

<bioghist> data dropped during import

Added by Creighton Barrett over 8 years ago. Updated over 8 years ago.

Status:VerifiedStart date:10/25/2013
Priority:CriticalDue date:
Assignee:José Raddaoui Marín% Done:

100%

Category:Import/Export
Target version:Release 1.4.0
Google Code Legacy ID: Tested version:
Sponsored:No Requires documentation:

Description

The data in <bioghist> does not appear to be copied from the EAD XML into the ISAAR record. In 1.x I think the import mapped the <bioghist> data in the <archdesc><did> area into the EAC record, but in 2.0 I can't find it anywhere.

ms-2-18_atom.xml Magnifier - File for testing (5.18 KB) Creighton Barrett, 10/25/2013 12:51 PM

History

#1 Updated by Tim Hutchinson over 8 years ago

How are you encoding <bioghist>? I'm not seeing this with either the command line or UI import, using <archdesc><bioghist><p>

#2 Updated by Creighton Barrett over 8 years ago

Here's an example:

  <archdesc level="file">
      <did>
         <unittitle>Newfoundland Geological Survey notebooks of R.L. Cunningham</unittitle>
         <unitid>MS-2-18</unitid>
         <repository>
            <corpname>Dalhousie University Archives</corpname>
         </repository>
         <langmaterial>
            <language langcode="eng"/>
         </langmaterial>
         <unitid>MS-2-18, SF Box 13, Folder 14</unitid>
         <container id="cid506001" type="Box-folder" label="Text">SF Box 13, Folder 14</container>
         <physdesc>
            <extent>4 cm of textual records</extent>
            <extent>(1 folder)</extent>
         </physdesc>
         <unitdate normal="1938/1938">1938</unitdate>
         <abstract id="ref1" label="Abstract">Robert Leonard Cunningham graduated from Dalhousie University in 1936 with a B.Sc degree and was employed by the Newfoundland Geological Survey in 1938. File consists of five notebooks from R.L. Cunningham's employment with the Newfoundland Geological Survey in 1938.</abstract>
         <langmaterial id="ref6" label="Language of Materials">English.</langmaterial>
         <origination label="creator">
            <persname source="local">Cunningham, Robert Leonard, c.1915-1994</persname>
         </origination>
      </did>
      <accruals id="ref7">
         <p>Further accruals are not expected.</p>
      </accruals>
      <scopecontent id="ref4">
         <p>File consists of five notebooks kept by R.L. Cunningham during his employment with the Newfoundland Geological Survey in 1938.</p>
      </scopecontent>
      <bioghist id="ref3">
         <p>Robert Leonard Cunningham graduated from Dalhousie University in 1936 with a B.Sc degree. He was employed by the Newfoundland Geological Survey in 1938. Cunningham passed away in 1994.</p>
      </bioghist>

And here it is after being imported into 2.0:

http://findingaids.library.dal.ca/index.php/newfoundland-geological-survey-notebooks-of-r-l-cunningham

The EAD validates in Oxygen and imported correctly into 1.x.

#3 Updated by Tim Hutchinson over 8 years ago

Can you attach the XML file?

#4 Updated by Tim Hutchinson over 8 years ago

Sorry, I should retest with a different file - I think I might be picking up an existing authority record.

#5 Updated by Tim Hutchinson over 8 years ago

Right - my mistake, I am able to reproduce.

The issue seems to be that the import routine expects <bioghist><chronlist>. That's fine for roundtripping (that's how AtoM now exports), but <bioghist><p> should definitely be supported too.

#6 Updated by Creighton Barrett over 8 years ago

Thanks, Tim. I'm attaching the full file in case it still helps.

#7 Updated by Dan Gillean over 8 years ago

  • Target version set to Release 2.0.2

#8 Updated by Jessica Bushey over 8 years ago

  • Assignee changed from Mike Gale to José Raddaoui Marín
  • Priority changed from Medium to High

To add to this issue:

Creator is listed in EAD.XML and after import into AtoM 2.0 appears as the Creator and an authority record is created. BUT, the related biographical history is NOT imported into AtoM.

See below for the sample EAD.XML that is not importing correctly.

<bioghist><p>Dr Wendy Orr was a registered medical practitioner employed as a Medical Officer with the Department of National Health and Population Development at the office of the District Surgeon in Port Elizabeth.</p><p>Dr Orr brought an APPLICATION against the Minister of Law and Order and others to restrain the respondents from assaulting or threatening to assault detainee held in terms of the regulations of the Public Safety Act of 1953 (Emergency Regulations).</p><p>On 25 September 1985 an Interim Order was granted in the South Eastern Cape Local Division restraining the Respondents from further mistreatment of prisoners.</p><p>The Respondents challenged Dr Orr's Locus Standi (i.e. does the person have sufficient legal interest to bring the application).</p><p>On the 4 February 1980 the interim order was extended to 17 June 1986</p><p>The matter was finally settled out of court because the State of Emergency had been lifted. There was no longer any need to pursue the relief sought in the application.</p></bioghist>

Related information on biographical history tag in EAD is here: [[http://www.loc.gov/ead/tglib/elements/bioghist.html]]

#9 Updated by Dan Gillean over 8 years ago

  • Assignee changed from José Raddaoui Marín to Mike Gale
  • Priority changed from High to Medium
  • Target version changed from Release 2.0.2 to Release 2.0.1

#10 Updated by Dan Gillean over 8 years ago

  • Assignee changed from Mike Gale to José Raddaoui Marín
  • Priority changed from Medium to High

#11 Updated by Dan Gillean over 8 years ago

  • Priority changed from High to Critical

#12 Updated by Dan Gillean over 8 years ago

We should do this fix in the 1.x branch, apply to 1.4 and 2.0.1. It's become important for client data migrations too.

#13 Updated by Dan Gillean over 8 years ago

  • Target version changed from Release 2.0.1 to Release 1.4.0

#14 Updated by José Raddaoui Marín over 8 years ago

Hi Dan,

As we spoke on the IRC chat:

  • The <bioghist> nodes without <chronlist> will be added to the creators inside <origination>, making the relation by order.
  • If there isn't a related creator or it already has the history field populated, a new untitled authority record will be created with the <bioghist> content as history, and it will be added as a creator for the description.

I'll add the fix to 2.x when it gets verified. Thanks.

#15 Updated by José Raddaoui Marín over 8 years ago

  • Status changed from New to QA/Review
  • % Done changed from 0 to 100

AtoM|commit: c2b650b4d9b9ae1cf6fa42ee80008d266f977813

#16 Updated by Dan Gillean over 8 years ago

  • Status changed from QA/Review to Verified

SUCCESS! I tested this with the attached example in 1.x, and the bioghist imported correctly. Then I added 1 additional <persname> and a <corpname> to the same <origination> element (for 3 creator names total), and included 4 bioghist elements - the first 3 mapped correctly, and the 4th was attached to an untitled authority record. I then moved the additional <persname> and <corpname> to a second <origination> element and tried again - all 4 still imported correctly. Marking verified.

Note for use
Because of the challenges in passing YAML Xpath data from 2 different areas to the authority record, this fix depends on the order of names in the <origination> element(s) to appear in the same order that the related <bioghist> elements appear. As Radda notes and my testing confirms, if there are more <bioghist> elements than names, the additional biographical histories that do not map 1:1 to a name will be assigned to an untitled authority record linked to the description. If there are more names than biographical histories, the import will still work, assigning the bioghists on a 1:1 basis. This means it's not possible to import 2 names where the first does not have a bioghist, but the second does - the EAD should be edited to reverse this prior to import if the history is to match the name.

Radda, please apply this fix to the 2.x branch too! Thanks.

#17 Updated by Jessica Bushey over 8 years ago

Radda please apply this fix to AtoM 2.x branch ASAP please.

#18 Updated by José Raddaoui Marín over 8 years ago

It's in 2.x now, sorry for the delay.

Also available in: Atom PDF