Correct Unicode handling for in-db ingest
authordbs <dbs@dcc99617-32d9-48b4-a31d-7c20da2025e4>
Wed, 20 Jan 2010 23:15:10 +0000 (23:15 +0000)
committerdbs <dbs@dcc99617-32d9-48b4-a31d-7c20da2025e4>
Wed, 20 Jan 2010 23:15:10 +0000 (23:15 +0000)
commitf761fdf4a31b679b1e7760b35ad91606eeb05b17
treea70b89bdca71f61b258b818560392f30ab4b14d5
parent53700fddb59711be2104444055118f921722d882
Correct Unicode handling for in-db ingest

Ensure MARC::File::XML treats the content as UTF-8; then follow
recommended practice of explicitly decoding the UTF-8 string to a
bytes string before operating on it, then encoding it back to UTF8
when it's returned.

'Québec' now gets naco-normalized to 'quebec' as one would expect. YAY!

git-svn-id: svn://svn.open-ils.org/ILS/trunk@15351 dcc99617-32d9-48b4-a31d-7c20da2025e4
Open-ILS/src/sql/Pg/020.schema.functions.sql
Open-ILS/src/sql/Pg/030.schema.metabib.sql