Bug #5508

EAD round tripping adds HTML escape characters into the original data

Added by Mike Gale over 8 years ago. Updated over 8 years ago.

Status:NewStart date:08/26/2013
Priority:MediumDue date:
Assignee:Mike Gale% Done:

0%

Category:EAD
Target version:Release 1.4.0
Google Code Legacy ID: Tested version:
Sponsored:No Requires documentation:

Description

Currently, AtoM uses the Symfony esc_specialchars() to transform characters such as '<' and '>' in the original meta-data to '<' and '>' to avoid some limitations when using the XML file format (such characters are illegal in user data in XML/HTML).

However, when the EAD XML file is imported again, AtoM does not swap back these HTML sequences with their proper, original characters, and the data including the substituted HTML escape codes are inserted into the database. We should be at the very least switching this data back and 'unescaping' the escape codes so as to not modify the original data.

History

#1 Updated by Mike Gale over 8 years ago

The escape characters for the <> keys showed up as well... arrows in here because of how Redmine handles HTML :)

http://www.theukwebdesigncompany.com/articles/entity-escape-characters.php

See above for details.

#2 Updated by Jesús García Crespo over 8 years ago

Mike, is this affecting 1.4 too?

#3 Updated by Jesús García Crespo over 8 years ago

  • Target version changed from Release 2.0.0 to Release 1.4.0

Also available in: Atom PDF